mRNA及包含其的新冠病毒mRNA疫苗

文档序号:1916806 发布日期:2021-12-03 浏览:10次 >En<

阅读说明:本技术 mRNA及包含其的新冠病毒mRNA疫苗 (mRNA and novel coronavirus mRNA vaccine containing same ) 是由 王冰 俞航 于 2020-05-28 设计创作,主要内容包括:本发明提供了mRNA,其包含编码来源于SARS-CoV-2病毒的S蛋白、E蛋白、M蛋白和N蛋白中的一种、两种、三种或四种蛋白或其片段的mRNA,编码S蛋白的mRNA的序列如SEQ ID NO.18、SEQ ID NO.19或SEQ ID NO.20所示;编码E蛋白的mRNA的序列如SEQ ID NO.21所示;编码M蛋白的mRNA的序列如SEQ ID NO.22所示;编码N蛋白的mRNA的序列如SEQ ID NO.23所示。还提供了一种包含所述mRNA的脂质体纳米颗粒、一种针对新冠病毒的mRNA疫苗等。本发明的mRNA在细胞水平高效产生病毒蛋白,或由产生的蛋白自组装成病毒样颗粒。将包含本发明的mRNA制备成疫苗时,安全性高、有效性好、不会产生非中和抗体而不会产生抗体依赖增强感染效应。(The invention provides mRNA, which comprises mRNA for encoding one, two, three or four proteins or fragments thereof in S protein, E protein, M protein and N protein derived from SARS-CoV-2 virus, wherein the sequence of the mRNA for encoding the S protein is shown as SEQ ID NO.18, SEQ ID NO.19 or SEQ ID NO. 20; the sequence of mRNA for encoding the E protein is shown as SEQ ID NO. 21; the sequence of mRNA for coding the M protein is shown as SEQ ID NO. 22; the sequence of mRNA for coding the N protein is shown as SEQ ID NO. 23. Also provided are a liposomal nanoparticle comprising the mRNA, an mRNA vaccine against the novel coronavirus, and the like. The mRNA of the present invention efficiently produces viral proteins at the cellular level, or self-assembles from the produced proteins into virus-like particles. When the mRNA of the present invention is used to prepare a vaccine, it is highly safe and effective, and produces no non-neutralizing antibody and no antibody-dependent infection-enhancing effect.)

mRNA及包含其的新冠病毒mRNA疫苗

技术领域

本发明涉及一种mRNA及包含其的新冠病毒mRNA疫苗,本发明还涉及所述mRNA 及包含其的新冠病毒mRNA疫苗、脂质体纳米颗粒、药物组合物和试剂盒等。

背景技术

近年来,基于体外转录(IVT)的信使RNA(mRNA)的治疗正显示出巨大的潜力。 它的原理是将体外制备的mRNA包裹成药物递送到体内组织被细胞内吞,外源mRNA到 达细胞后被核糖体识别并根据其编码信息合成相应的蛋白质。早在1990年Wolff等证明 注射到小鼠体内的mRNA能被翻译蛋白质[7]。1992年Jirikowski等显示注射到下丘脑部 位的vasopressin mRNA缓解了老鼠的尿崩症状[8]。mRNA药物在理论上有诸多优势:与 DNA疗法相比,mRNA不需要入核,无基因组整合的插入突变风险;相对于蛋白质药物, mRNA利用细胞自身的翻译系统,可实现高效和剂量依赖性的活性蛋白表达,解决了一 些蛋白的不可成药性问题。然而mRNA一直受到体外制备、稳定性和递送问题的困扰。 直到近几年IVT(invitro transcription,体外转录)技术配合化学和酶学加帽法、修饰核 苷酸的引入、HPLC纯化技术使得mRNA可以在体外进行大规模制备[9,10]。同时脂质 体和脂质纳米颗粒在递送siRNA上获得成功后被证明也能用于mRNA的包裹和递送[11]。 这些技术的突破使得mRNA成药性大大提高,目前有超过25个mRNA药物包括mRNA 疫苗和蛋白替代的临床研究正在展开[12],市场上第一个mRNA产品的竞争已全面展开。 越来越多的研究人员在关注mRNA药物的应用,而中国在这一领域的研究才刚刚起步。

mRNA药物一个最有潜力的应用是疫苗,包括肿瘤疫苗和感染性疾病疫苗。编码抗原蛋白的mRNA分子经体外合成并形成制剂后即可用于人体免疫,这一过程不涉及活病 毒培养的相关操作,大大缩短了研发时间[13]。mRNA疫苗在近几年不断取得突破性进 展,在2013年的一项研究中,研究者针对H7N9流感病毒设计并制备了mRNA疫苗, 在小鼠实验中取得成效[14]。2015年,针对HIV的mRNA疫苗在非人灵长类动物中产生 体液免疫应答。2017年,寨卡病毒的mRNA疫苗在有效的保护了接受病毒攻击的小鼠[15], 并能降低孕期老鼠感染的风险[16]。除了在动物实验上取得成功以外,mRNA疫苗(如 流感疫苗和寨卡疫苗)已经开始了临床试验,Moderna公司的流感病毒mRNA疫苗I期 临床结果显示其免疫原性或优于传统疫苗[17]。同样是该公司的寨卡病毒疫苗 mRNA-1893去年进入了美国FDA的快速通道。IVT mRNA的技术优势能有效应对病毒 的高突变率,使得新爆发流行病疫苗的快速开发成为可能,有望成为提高新发传染病的 防治效率的突破方向。

传统的预防病毒的疫苗有重组蛋白疫苗、灭活疫苗、减毒活疫苗以及体外重组的病 毒样颗粒(virus-like particle,VLP)。在以往的经验中灭活或减毒疫苗因形态和组成与真 实病毒一样,能产生强烈的免疫反应,成为疫苗的首选。但是他们有不可避免的缺点:灭活或减毒疫苗生产周期较长、有些病毒如诺如病毒不能大规模培养、灭活病毒往往不 能诱导免疫反应而减毒疫苗亦有返祖的风险等。而体外重组病毒样颗粒疫苗是由病毒衣 壳蛋白或包膜蛋白自主包装形成的空衣壳结构,能快速刺激机体产生体液免疫和细胞免 疫应答,病毒样颗粒不含有病毒遗传物质和免疫抑制蛋白,目前安全性最高的一种新型 候选疫苗,已经有多种基于VLP的疫苗产品上市[18]。2002年SARS-CoV和2012年 MERS-CoV爆发后,多种疫苗方案被研究,包括灭活或减毒毒株、基于重组DNA的S 蛋白质以及体外重组的病毒样颗粒[19,20]。其中S蛋白质由于是介导病毒入侵的主要蛋 白质,也是中和抗体的主要靶点,受到疫苗开发的特别关注。动物试验表明这些疫苗均 有保护效果,但是安全性还是最大的担忧。例如基于全长S蛋白抗原的疫苗产生了大量 非中和抗体,这些非中和抗体在抗体依赖增强感染效应(antibody-dependent enhancement, ADE)中扮演了重要角色[21],反而加速了疾病进展,造成疫苗安全性存在重大问题。由 于机体在接受mRNA药物后能根据编码信息合成任何蛋白质,mRNA在疫苗抗原的选择 上极其灵活。但是考虑到病毒样颗粒的优势,目前在临床的病毒mRNA疫苗多以病毒样 颗粒为最终抗原展示形式,如寨卡病毒[15]。

mRNA疫苗虽有诸多优势,但大部分还是理论阶段,需要大量的基础和临床研究。有效的能在体内诱导合成病毒样颗粒的mRNA疫苗应满足两个条件,一是表达效率要高, 产生足够剂量的病毒样颗粒刺激机体产生免疫应答;二是产的病毒样颗粒应与真实病毒 从形态和结构组成上一致,使机体获得免疫能力能应对真实病毒。但是由于冠状病毒自 身的特点,研发也面临着很多挑战。冠状病毒为正链单股RNA病毒,它有一层脂双分子 膜形成包膜(envelope)结构,包膜中插入结构蛋白M(membrane)、E(envelope)、和S(spike)。 其中S棘突蛋白质是冠状病毒最重要的表面蛋白,决定了病毒的宿主范围和特异性。S 蛋白质是宿主中和抗体的重要作用位点,因此成为SARS-CoV和MERS-CoV的疫苗设计 中的关键靶点。冠状病毒还有核蛋白N(nucleoprotein)在内层包裹病毒的基因组。除了结 合基因组,N蛋白质对包膜的形态塑成也有作用,因此也被认为是结构蛋白之一。冠状 病毒一个特点是其形态和大小并不是完全固定的,事实上冠状病毒的直径在80-200nm之 间。因此即使是高分辨率冷冻电镜,也无法使用单颗粒分析来获得整个病毒的原子结构。 冠状病毒包膜内结构蛋白质的比例也不固定,取决于病毒在细胞内组装时各个结构蛋白 质的含量。这和寨卡病毒不同,寨卡病毒虽然也是包膜病毒,但是其形态固定,成刚性 二十面体结构,其结构蛋白单一且拷贝数固定,没有棘突结构。因此同样是合成病毒样 颗粒,冠状病毒的mRNA疫苗设计要比寨卡病毒复杂的多。首先,寨卡病毒的mRNA 疫苗只包含了一条mRNA,编码prM-E融合蛋白,而冠状病毒的mRNA疫苗必须是个组 合(cocktail),包含至少3条mRNA编码不同的结构蛋白质。其次,对冠状病毒包膜结 构的装配目前还存在诸多争议。根据对SARS-CoV的研究,M和E共表达已经足够形成 病毒样颗粒,但是没有棘突结构,共表达S与M和E可以引入S蛋白质,使得VLP带 有棘突。但是尽管形成病毒样颗粒,蛋白质组成比例却和真实病毒存在很大差别。另外N蛋白质虽然在内层主要与病毒基因组相互作用,有研究并表明但是它的存在对病毒样 颗粒的表达和分泌有加强作用。目前已有几种新冠病毒疫苗进入临床试验,均以新冠病 毒S蛋白为主要抗原,安全性和有效性还未经证明,仍有失败的风险。因此,急需继续 研发针对新冠病毒的多种抗原策略的新型冠状病毒疫苗。

发明内容

本发明所要解决的技术问题是为了克服现有技术中没有商品化的新冠病毒疫苗等缺 陷,提供了一种mRNA、一种DNA以及包含其的新冠病毒mRNA疫苗、脂质体纳米颗 粒、由其表达产生的病毒样颗粒、药物组合物和试剂盒。本发明的经过密码子优化后或 者进一步经过核苷酸修饰的组装新冠病毒所需的几种蛋白的mRNA,各自单独在细胞中 都可以高表达。并且经过本发明特定比例组成的mRNA,在细胞水平高效产生病毒蛋白, 或由产生的蛋白自组装成病毒样颗粒,能够实现病毒样颗粒的高表达,且大小和形态结 构均极为接近真实病毒,后续用于临床时可以使机体获得免疫能力能应对真实病毒。包 含本发明mRNA的纳米颗粒,多条mRNA同时被脂质纳米颗粒包装的效率/表达效率高 依旧很高,从而能够产生足够剂量的病毒样颗粒刺激机体产生免疫应答,免疫原性和稳 定性较高。将包含本发明的经过密码子优化后或者经过核苷酸修饰的组装新冠病毒所需 的几种蛋白的mRNA制备成疫苗(例如以表达病毒样颗粒的形式、仅表达S蛋白的疫苗 的形式或以仅表达S蛋白中RBD区域的疫苗的形式)时,安全性高、有效性好、不会产 生非中和抗体从而不会产生抗体依赖增强感染效应。

本领域技术人员公知,冠状病毒的形态和大小并非完全固定,其包膜内结构蛋白质 的比例也不固定,因此同样是合成病毒样颗粒,冠状病毒的mRNA疫苗设计要比现有技术中的其他病毒复杂的多。而本发明人通过大量实验和摸索,意外发现对密码子进行特 定优化后,能够实现病毒样颗粒的完整表达。本发明人在实验中还发现,体外转录生成 mRNA的翻译效率和稳定性受其不同的化学修饰(对每种mRNA使用不同的修饰核苷酸 在细胞内的命运大相径庭),5’和3’非翻译区序列(UTRs),5’加帽方式(使用不同的cap0 或cap1类似物)和3’poly(A)尾的长度影响。本发明人经过大量研究,发现通过选用特定 的核苷化学修饰、特定的UTR序列及特定的优化加帽方式,能够进一步实现mRNA转 染细胞半小时后即开始表达较高水平蛋白,并且表达可持续一周。同时本发明人通过大 量实验,最终发现多种修饰核苷酸通过特定的组合能够进一步取得较好的免疫原性和稳 定性。另外,本发明中S蛋白质是长达1273个氨基酸,属于较大蛋白质,结合5’和3’UTR, 最终mRNA总长超过4000nt。本发明人在实验中发现长链mRNA的合成一直是个挑战, 而通过优化编码蛋白质(例如S蛋白质)的mRNA序列,同时优化UTR序列和修饰核 苷酸,进行蛋白质(例如S蛋白质)的表达筛选,还能够克服超长基因mRNA的制备与 纯化的问题。

为了解决上述技术问题,本发明第一方面提供了mRNA,其包含编码来源于 SARS-CoV-2病毒的S蛋白、E蛋白、M蛋白和N蛋白中的一种、两种、三种或四种蛋 白、其片段、变体或衍生物的mRNA,

其中,编码所述S蛋白的mRNA的序列如SEQ ID NO.18、SEQ ID NO.19或SEQ IDNO.20所示;编码所述E蛋白的mRNA的序列如SEQ ID NO.21所示;编码所述M蛋 白的mRNA的序列如SEQ ID NO.22所示;编码所述N蛋白的mRNA的序列如SEQ ID NO.23所示。

较佳地,所述片段为所述S蛋白的RBD结构域的片段,其mRNA的序列优选如SEQ IDNO.37所示。

较佳地,所述mRNA还包含5’-帽结构,优选为3′-O-Me-m7G(5')ppp(5')G、 m7G(5')ppp(5')(2'OMeA)pG或m7(3'OMeG)(5')ppp(5')(2'OMeA)pG。

本发明中,所述3′-O-Me-m7G(5')ppp(5')G的结构一般如下所示:

本发明中,所述m7G(5')ppp(5')(2'OMeA)pG的结构一般如下所示:

本发明中,所述m7(3‘OMeG)(5’)ppp(5‘)(2’OMeA)pG的结构一般如下所示:

较佳地,所述mRNA序列还包含3’-聚腺苷酸,其序列优选包含约25至约400个腺 苷核苷酸的序列,优选约50至约400个腺苷核苷酸的序列,更优选约50至约300个腺 苷核苷酸的序列,进一步更优选约50至约250个腺苷核苷酸的序列,进一步优选约60 至约250个腺苷核苷酸的序列,最优选120个聚腺苷酸组成的序列。

较佳地,所述mRNA序列还包含5’-UTR,所述5’-UTR的序列优选如SEQ ID NO. 15所示。

较佳地,所述mRNA序列还包含3’-UTR,所述3’-UTR的序列优选来源于提供 稳定的mRNA的基因的3’UTR或来源于其同源物、片段或变体,更优选如SEQ ID NO. 16或SEQ IDNO.17所示。

较佳地,所述mRNA序列还包含多核苷酸修饰,所述多核苷酸优选5-methyl-CTP、pseudo-UTP、N1-Methylpseudo-UTP和5-Methoxy-UTP中的一种或多种。本发明中,所 述的5-methyl-CTP可以购于ApexBio,#B7967。所述的pseudo-UTP可以购于ApexBio, #B7972。所述的N1-Methylpseudo-UTP可以购于ApexBio,#B8049。所述的5-Methoxy-UTP 可以购于ApexBio,#B8061。

更佳地,所述N蛋白的mRNA包括5-methyl-CTP、pseudo-UTP、N1-Methylpseudo-UTP或5-Methoxy-UTP的修饰,或、包括5-methyl-CTP和pseudo-UTP共同的修饰。

更佳地,所述E蛋白的mRNA包括5-methyl-CTP、pseudo-UTP或 N1-Methylpseudo-UTP的修饰。

更佳地,当编码所述S蛋白的mRNA的序列如SEQ ID NO.18所示时,所述S蛋白 的mRNA包括5-methyl-CTP、pseudo-UTP或N1-Methylpseudo-UTP的修饰,或包括 5-methyl-CTP和pseudo-UTP的修饰,优选包括pseudo-UTP或N1-Methylpseudo-UTP的 修饰。

更佳地,当编码所述S蛋白的mRNA的序列如SEQ ID NO.19所示时,所述S蛋白 的mRNA包括pseudo-UTP的修饰,或包括5-methyl-CTP和pseudo-UTP的修饰。

更佳地,当编码所述S蛋白的mRNA的序列如SEQ ID NO.20所示时,所述S蛋白 的mRNA包括pseudo-UTP或N1-Methylpseudo-UTP的修饰。

较佳地,所述mRNA包含编码来源于SARS-CoV-2病毒的S蛋白、E蛋白和M蛋白 的mRNA,所述S蛋白、E蛋白、M蛋白由三条独立的mRNA分别表达得到,表达所述 S蛋白、E蛋白、M蛋白的mRNA的摩尔比优选为1:(2~0.5):(2~0.5),例如为1:1:1。

较佳地,所述mRNA包含编码来源于SARS-CoV-2病毒的M蛋白和E蛋白的mRNA, 所述M蛋白和E蛋白的mRNA优选连接后进行表达,所述连接优选通过编码2A肽段的 mRNA的序列(经过蛋白表达后,所得的2A肽经过“自剪切”,最后能够得到独立的M 蛋白和E蛋白)进行连接。其中,所述2A肽段的氨基酸序列优选如SEQ ID NO.42或 SEQ ID NO.43所示,编码所述2A肽段的DNA序列进一步优选如SEQ ID NO.38或SEQ ID NO.39所示,编码所述2A肽段的mRNA序列进一步优选如SEQ ID NO.40或SEQ ID NO.41所示。更佳地,所述连接后的mRNA的序列优选如SEQ ID NO.35或36所示,其 DNA序列优选如SEQ ID NO.28或29所示。

较佳地,所述mRNA包含编码来源于SARS-CoV-2病毒的所述的S蛋白的mRNA。

较佳地,所述mRNA包含编码来源于SARS-CoV-2病毒的所述的S蛋白的RBD结 构域的mRNA。

较佳地,所述mRNA包含编码来源于SARS-CoV-2病毒的M蛋白、E蛋白和S蛋白 的mRNA,所述M蛋白和E蛋白的mRNA连接后进行表达,所述连接优选通过编码2A 肽段的所述2A肽段的序列进行连接。其中,所述2A肽段的氨基酸序列优选如SEQ ID NO. 42或SEQ IDNO.43所示,编码所述2A肽段的DNA序列进一步优选如SEQ ID NO.38 或SEQ ID NO.39所示,编码所述2A肽段的mRNA序列进一步优选如SEQ ID NO.40或SEQ ID NO.41所示。更佳地,所述连接后的mRNA的序列优选如SEQ ID NO.35或36 所示,其DNA序列优选如SEQ ID NO.28或29所示。进一步更佳地,连接后的mRNA 与所述S蛋白的mRNA的摩尔比优选为1.5:1~3:1,例如为2:1。

较佳地,当所述的mRNA包含编码来源于SARS-CoV-2病毒的S蛋白、E蛋白、M 蛋白和N蛋白中的两种、三种或四种蛋白或其片段的mRNA时,所述mRNA编码的蛋 白自组装成病毒样颗粒。

为了解决上述技术问题,本发明第二方面提供了一种DNA,其包含编码来源于SARS-CoV-2病毒的S蛋白、E蛋白、M蛋白和N蛋白中的至少一种蛋白(例如一种、 两种、三种、四种)或其片段的DNA,

其中,编码所述S蛋白的DNA的序列如SEQ ID NO.3、SEQ ID NO.4或SEQ ID NO. 5所示;编码所述E蛋白的DNA的序列如SEQ ID NO.8所示;编码所述M蛋白的DNA 的序列如SEQID NO.11所示;编码所述N蛋白的DNA的序列如SEQ ID NO.13所示。

较佳地,所述片段为所述S蛋白的RBD结构域的片段,其DNA序列优选如SEQ IDNO.30所示。

为了解决上述技术问题,本发明第三方面提供了组合物,其包含多个或多于一个如 本发明第一方面所述的mRNA或如本发明第二方面所述的DNA。

为了解决上述技术问题,本发明第四方面提供了一种脂质体纳米颗粒,其包含本发 明第一方面所述的mRNA、如本发明第二方面所述的DNA或如本发明第三方面所述的 组合物。

较佳地,所脂质体纳米颗粒还包括阳离子脂质和辅助脂质,所述阳离子脂质优选为 DLin-MC3-DMA或DOTMA,所述辅助脂质优选为DSPC和/或胆固醇。

本发明中,所述DLin-MC3-DMA的结构式一般如下所示:

本发明中,所述DOTMA的结构式一般如下所示:

本发明中,所述DSPC的结构式一般如下所示:

较佳地,所脂质体纳米颗粒为长循环阳离子脂质体纳米颗粒,优选为经PEG或其衍生物修饰的长循环阳离子脂质体纳米颗粒;所述PEG的相对分子质量优选为2000~5000,例如为2000、3000、4000或5000。在本发明某一较佳实施例中,所述脂质体纳米颗粒为 包括DMPE-PEG2000的长循环阳离子脂质体纳米颗粒。

为了解决上述技术问题,本发明第五方面提供了一种病毒样颗粒,其包含由本发明 第一方面所述的mRNA所表达的相应蛋白自组装而成、如本发明第二方面所述的DNA 所表达的相应蛋白自组装而成和/或如本发明第三方面所述的组合物所表达的相应蛋白 自组装而成,优选是将所述的mRNA、所述的DNA和/或所述的组合物转入细胞后表达 相应的蛋白,所述细胞优选为293T和/或293A。

较佳地,所述的病毒样颗粒由编码SARS-CoV-2病毒的S蛋白、E蛋白、M蛋白和 N蛋白中的两种、三种或四种蛋白或其片段的mRNA所表达的蛋白自组装而成,优选在 细胞中表达所述蛋白,所述细胞优选为293T和/或293A。

更佳地,所述的病毒样颗粒由编码SARS-CoV-2病毒的S蛋白、E蛋白和M蛋白的mRNA,所述S蛋白、E蛋白、M蛋白由三条独立的mRNA分别表达的蛋白自组装得到, 表达所述S蛋白、E蛋白、M蛋白的mRNA的摩尔比优选为1:(2~0.5):(2~0.5),例如 为1:1:1。

更佳地,所述的病毒样颗粒由编码SARS-CoV-2病毒的M蛋白和E蛋白的mRNA 所表达的蛋白自组装而成,优选在细胞中表达所述蛋白,所述细胞优选为293T和/或293A。 其中,所述M蛋白和E蛋白的mRNA优选连接后进行表达,所述连接优选通过编码2A 肽段的mRNA的序列进行连接。其中,所述2A肽段的氨基酸序列优选如SEQ ID NO.42 或SEQ ID NO.43所示,编码所述2A肽段的DNA序列进一步优选如SEQ ID NO.38或 SEQ ID NO.39所示,编码所述2A肽段的mRNA序列进一步优选如SEQ ID NO.40或SEQ ID NO.41所示。更佳地,所述连接后的mRNA序的列优选如SEQ ID NO.35或36所示, 其DNA序列优选如SEQ ID NO.28或29所示。

更佳地,所述的病毒样颗粒由编码SARS-CoV-2病毒的M蛋白、E蛋白和S蛋白的mRNA所表达的蛋白自组装而成,优选在细胞中表达所述蛋白,所述细胞优选为293T 和/或293A。其中,所述M蛋白和E蛋白的mRNA连接后进行表达,所述连接优选通过 编码2A肽段的mRNA的序列进行连接。其中,所述2A肽段的氨基酸序列优选如SEQ ID NO.42或SEQ IDNO.43所示,编码所述2A肽段的DNA序列进一步优选如SEQ ID NO. 38或SEQ ID NO.39所示,编码所述2A肽段的RNA序列进一步优选如SEQ ID NO.40 或SEQ ID NO.41所示。更佳地,所述连接后的mRNA的序列优选如SEQ ID NO.35或 36所示,其DNA序列优选如SEQ ID NO.28或29所示。进一步更佳地,连接后的mRNA 与所述S蛋白的mRNA的摩尔比优选为1.5:1~3:1,例如为2:1。

为了解决上述技术问题,本发明第六方面提供了一种针对新冠病毒的mRNA疫苗,其包含如本发明第一方面所述的mRNA、如本发明第二方面所述的DNA、如本发明第三 方面所述的组合物和/或如本发明第四方面所述的脂质体纳米颗粒。

较佳地,所述mRNA疫苗诱导细胞产生病毒样颗粒,以激活免疫系统。

较佳地,所述mRNA疫苗还包括本领域常规使用的佐剂。

为了解决上述技术问题,本发明第七方面提供了一种药物组合物,其包含如本发明 第一方面所述的mRNA、如本发明第二方面所述的DNA、如本发明第三方面所述的组合物、如本发明第四方面所述的脂质体纳米颗粒、如本发明第五方面所述的病毒样颗粒、 和/或如本发明第六方面所述的mRNA疫苗,和任选地药用载体。

为了解决上述技术问题,本发明第八方面提供了一种试剂盒,其包含如本发明第一 方面所述的mRNA、如本发明第二方面所述的DNA、如本发明第三方面所述的组合物、 如本发明第四方面所述的脂质体纳米颗粒、如本发明第五方面所述的病毒样颗粒、如本 发明第六方面所述的mRNA疫苗和/或如本发明第七方面所述的药物组合物。

为了解决上述技术问题,本发明还提供了一种编码2A肽段的mRNA,其序列优选 如SEQ ID NO.40或SEQ ID NO.41所示。

为了解决上述技术问题,本发明还提供了一种编码2A肽段的DNA,其序列如SEQ IDNO.38或SEQ ID NO.39所示。

为了解决上述技术问题,本发明还提供了如本发明第一方面所述的mRNA或如本发明第二方面所述的DNA或如本发明第三方面所述的组合物在制备如本发明第四方面所 述的脂质体纳米颗粒、如本发明第五方面所述的病毒样颗粒、如本发明第六方面所述的 mRNA疫苗、如本发明第七方面所述的药物组合物、和/或如本发明第八方面所述的试剂 盒中的应用。

为了解决上述技术问题,本发明还提供了一种预防和/或治疗新冠病毒感染的方法, 其包括(任选地向有需要的受试者)施用如本发明第一方面所述的mRNA、如本发明第二方面所述的DNA、如本发明第三方面所述的组合物、如本发明第四方面所述的脂质体 纳米颗粒、如本发明第五方面所述的病毒样颗粒、如本发明第六方面所述的mRNA疫苗、 如本发明第七方面所述的药物组合物、和/或如本发明第八方面所述的试剂盒的步骤。

为了解决上述技术问题,本发明还提供了一种如本发明第一方面所述的mRNA、如本发明第二方面所述的DNA、如本发明第三方面所述的组合物、如本发明第四方面所述 的脂质体纳米颗粒、如本发明第五方面所述的病毒样颗粒、如本发明第六方面所述的 mRNA疫苗、如本发明第七方面所述的药物组合物、和/或如本发明第八方面所述的试剂 盒在预防和/或治疗新冠病毒感染中的应用。

本发明中,所示编码2A肽段的序列可以是编码天然病毒的2A肽段的序列,也可以是经过优化的序列(例如T2A和P2A,T2A的mRNA序列可以是如SEQ ID NO.40所示, P2A的mRNA序列可以是如SEQ ID NO.41所示。其对应的DNA序列可以是如SEQ ID NO.38和SEQ IDNO.39所示,经翻译后所得多肽的氨基酸序列可以是如SEQ ID NO.42 和SEQ ID NO.43所示)。这段多肽可高效“自剪切”成前后两个片段,可以使得这段序 列前后两部分的序列独立表达成两个单独的蛋白,达到一段序列上协同表达两个独立蛋 白的目的。

术语解释

本发明中,所述的mRNA又称信使RNA,通常是由DNA的一条链作为模板转录而 来的、携带遗传信息的能指导蛋白质合成的一类单链核糖核酸。以细胞中基因为模板, 依据碱基互补配对原则转录生成mRNA后,mRNA就含有与DNA分子中某些功能片段 相对应的碱基序列,作为蛋白质生物合成的直接模板。

本发明中,所述的mRNA疫苗通常是通过将编码病毒抗原的mRNA直接导入到人 体内,在细胞内表达病毒蛋白抗原,从而激活人体免疫系统,产生针对病毒的中和抗体。

本发明中,所述的抗原(antigen,缩写Ag)一般是指能引起抗体生成的物质,是任何可诱发免疫反应的物质。

本发明中,所述的抗体一般是指机体在抗原物质刺激下,由B细胞分化成的浆细胞所产生的、可与相应抗原发生特异性结合反应的免疫球蛋白。

本发明中,所述的中和抗体一般是指微生物入侵人体后,会刺激产生很多种抗体,但只有部分抗体能迅速识别微生物,并在其入侵人体细胞前将其“抓住”,保护人体不被 感染。这个过程就叫中和作用,发挥作用的抗体就是中和抗体。

本发明中,所述的脂质体纳米颗粒一般是指利用脂质体将药物分子(小分子化合物、 RNA、DNA或蛋白质药物)包装成百纳米大小的复合物,将药物递送到体内,具有增加 药物溶解度、延长药物在体内的滞留时间、增强药物的靶向性及降低毒性等优势。

本发明中,所述的病毒样颗粒(virus-like particles,VLPs)通常是含有某种病毒的一 个或多个结构蛋白的空心颗粒,没有病毒核酸,不能自主复制,在形态上与真正病毒粒 子相同或相似,俗称伪病毒。

本发明中,所述的新冠病毒S蛋白(Spike蛋白)又称棘突或刺突蛋白。S蛋白是冠状病毒最重要的致病靶点蛋白,包含两个亚基,S1和S2。其中,S1主要包含有受体结 合区(RBD结构域),冠状病毒正是通过RBD结构域与细胞表面受体结合来感染细胞。 S蛋白也因此主要承担病毒与宿主细胞膜受体结合及膜融合功能。同时,也是宿主中和 抗体的重要作用位点以及疫苗设计的关键靶点。

在符合本领域常识的基础上,上述各优选条件,可任意组合,即得本发明各较佳实例。

本发明所用试剂和原料均市售可得。

本发明的积极进步效果在于:

(1)本发明的经过密码子优化后或者进一步经过核苷酸修饰的组装新冠病毒所需的 几种蛋白的mRNA,各自单独在细胞中都可以高表达。此外,进一步经过本发明特定比 例组成的mRNA,在细胞水平高效产生病毒蛋白,或由产生的蛋白自组装成病毒样颗粒, 能够实现病毒样颗粒的高表达,且大小和形态结构均极为接近真实病毒,后续用于临床 时可以使机体获得免疫能力能应对真实病毒。

(2)包含本发明mRNA的纳米颗粒,多条mRNA同时被脂质纳米颗粒包装的效率 /表达效率高依旧很高,从而能够产生足够剂量的病毒样颗粒刺激机体产生免疫应答,免 疫原性和稳定性较高。

(3)将包含本发明的经过密码子优化后或者进一步经过核苷酸修饰的组装新冠病毒 所需的几种蛋白的mRNA制备成疫苗(例如以表达病毒样颗粒的形式、仅表达S蛋白的 疫苗的形式或以仅表达S蛋白中RBD区域的疫苗的形式)时,安全性高、有效性好、不 会产生非中和抗体从而不会产生抗体依赖增强感染效应。

附图说明

图1显示了本发明实施例的概述。实施例中用mRNA表达新冠病毒结构蛋白S、M、 E和N以及S蛋白的RBD结构域。用脂质体将mRNA包被成纳米粒(LNP),进行细胞 转染或动物免疫。体外细胞转染的多种mRNA可高表达病毒蛋白,并在合适比例下,自 组装成病毒样颗粒(VLP)。LNP免疫小鼠后,激活小鼠免疫系统,产生抗体。

图2显示了mRNA用脂质体包被转染293A细胞后,Western Blot检测蛋白表达的 结果图。其中,泳道(lane)1为经cap1修饰的mRNA表达的蛋白,泳道2为经 cap1+5mC+pseudoU修饰的mRNA表达的蛋白,泳道3为经cap1+pseudoU修饰的mRNA 表达的蛋白,泳道4为经cap1+5moU修饰的mRNA表达的蛋白,泳道5为经 cap1+N1-m-pseudoU修饰的mRNA表达的蛋白,泳道6为经cap1+5mC修饰的mRNA表 达的蛋白。A为N蛋白的mRNA和NBL mRNA表达蛋白的WB结果图,B为EBL mRNA 和MBL mRNA表达蛋白的WB结果图,C为SGS mRNA、STFmRNA、SBLmRNA表达蛋白的WB结果图,D为SDC50、SDC54、SDC58、SDC60表达蛋白的WB结果图,E 为SGS-RBD结构域表达蛋白的WB结果图,F为MP2AE和MT2AE表达蛋白的WB结 果图。

图3显示了VLP颗粒的电镜照片。

图4显示了mRNA脂质纳米颗粒包装示意图。

图5显示了用ZetaView检测LNP色谱分布图。A的上图是包被了表达S蛋白的SGSmRNA的LNP过滤前的粒径和分布图,下图是同一LNP过滤后的粒径分布图。B的上 图是表达S蛋白的RBD结构域的mRNA的LNP过滤前的粒径和分布图,下图是同一LNP 过滤后的粒径分布图。C的上图是包被了表达M、E和S蛋白mRNA的LNP过滤前的粒 径和分布图,下图是同一LNP过滤后的粒径分布图。

图6显示了经过第一次免疫后一周,用酶联免疫吸附法测试血清中抗体滴度的结果 图。

图7显示小鼠血清中和抗体滴度实验结果。表达S蛋白全长的mRNA(Spike)和 表达病毒样颗粒的mRNA组合(SME),诱导产生的抗体滴度均大于104,单独产生RBD 结构域的mRNA(RBD)诱导产生的中和抗体滴度略高于空白对照(Ctrl)。

具体实施方式

下面通过实施例的方式进一步说明本发明,但并不因此将本发明限制在所述的实施 例范围之中。下列实施例中未注明具体条件的实验方法,按照常规方法和条件,或按照商品说明书选择。

本发明针对新型冠状病毒开发的mRNA疫苗,主要采用(1)表达多种病毒蛋白, 在体内组装成病毒样颗粒;(2)表达S蛋白全长的mRNA;(3)表达S蛋白RBD结构 域这三种方式,如图1所示。

实施例1 mRNA制备

为新型冠状病毒(SARS-CoV-2)4个结构基因S、M、E、N进行了密码子优化,并 为每条基因设计了多条编码序列。每种序列将被克隆到mRNA合成载体上。对每种序列, 制备两种mRNA,一种编码不含标签的野生型蛋白质,一种编码C端带有Flag标签,用 于后期表达验证。具体步骤为:

委托上海生工进行合成携带针对SARS-Cov-2的S蛋白(Spike protein)、M蛋白、E蛋白、N蛋白(氨基酸序列分别如SEQ ID NO.1、SEQ ID NO.9、SEQ ID NO.6和SEQ ID NO.12所示,四种蛋白的天然基因序列分别如SEQ ID NO.2(连接有3’-UTR-2)、SEQ ID NO.10、SEQID NO.7和SEQ ID NO.14(连接有3’-UTR-2)所示)的经过密码子优 化的基因序列,其中:S蛋白基因优化后的序列分别如SEQ ID NO.3(SGS,连接有 3’-UTR-2)、SEQ ID NO.4(SBL或S-benchling,连接有3’-UTR-1)、SEQ ID NO.5(STF, 连接有3’-UTR-1)、SEQ ID NO.24(SDC50,连接有3’-UTR-2)、SEQ ID NO.25(SDC54, 连接有3’-UTR-2)、SEQ ID NO.26(SDC58,连接有3’-UTR-2)、SEQ ID NO.27(SDC60, 连接有3’-UTR-2)所示;M蛋白基因优化后的序列如SEQ ID NO.11(MBL,连接有 3’-UTR-1)所示;E蛋白基因优化后的序列如SEQ IDNO.8(EBL,连接有3’-UTR-1) 所示;N蛋白基因优化后的序列如SEQ ID NO.13(NBL,连接有3’-UTR-2)所示。随 后将上述密码子优化过的基因序列亚克隆到含有T7启动子和5’非编码区(5’UTR,序列 如SEQ ID NO.15所示)、3’非编码区(3’UTR,序列如SEQ ID NO.16(3’-UTR-1)或 SEQ ID NO.17所示(3’-UTR-2))的载体(载体有两种:一种是在pUC19基础上加入 5’-UTR和3’-UTR-1区域;一种是在pUC57基础上加入5’-UTR和3’-UTR-2区域)中。 S、E、N和M蛋白分别在C-末端被HA和Flag标记。扩增载体后,通过限制性内切酶 进行酶切使载体线性化(操作步骤均为本领域常规)。进一步纯化酶切片段,并将其用作 体外转录(IVT)的模板,合成修饰的mRNA,具体为:使用具有1-2μg模板的HyperScribe T7高产率RNA合成试剂盒(ApexBio)和加帽cap0或cap1类似物(购于ApexBio)(每 个修饰的核苷酸7.5mM)进行IVT。将反应在37℃下孵育2-4小时,然后进行DNase (Thermo)处理。使用poly(A)加尾试剂盒(ApexBio)将3′poly(A)尾巴进一步添 加到IVT RNA产品中。通过使用RNAClean andConcentrator试剂盒(ApexBio)纯化 mRNA。其中,所得S蛋白基因优化后的mRNA序列分别如SEQ ID NO.18(SGS mRNA)、 SEQ ID NO.19(SBL mRNA)和SEQ ID NO.20(STF mRNA)、以及SEQ ID NO.31 (SDC50)、SEQ ID NO.32(SDC54)、SEQ ID NO.33(SDC58)、SEQ ID NO.34(SDC60) 所示,所得M蛋白基因优化后的mRNA序列如SEQ ID NO.22(MBL mRNA)所示,所 得E蛋白基因优化后的mRNA序列如SEQ ID NO.21(EBL mRNA)所示,所得N蛋白 基因优化后的mRNA序列如SEQ ID NO.23(NBL mRNA)所示。

实施例2 体外转录中掺入的修饰核苷酸

在实施例1中所述的体外转录合成修饰的mRNA过程中,在反应体系中按照一定比例加入修饰核苷酸,修饰核苷酸随机插入mRNA序列中。本实施例尝试使用的修饰核苷 酸包括5-methyl-CTP(简称5mC,ApexBio,#B7967)、pseudo-UTP(简称pseudoU, ApexBio,#B7972)、N1-Methylpseudo-UTP(简称N1-m-pseudo,ApexBio,#B8049)、 5-Methoxy-UTP(5moU,ApexBio,#B8061);用于mRNA的5’加帽的修饰核苷酸有 3′-O-Me-m7G(5')ppp(5')G(ARCA,cap0,APExBIO公司产品,#B8175)、 m7G(5')ppp(5')(2'OMeA)pG(APExBIO公司产品EZ Cap#B8176,cap1,)和 m7(3'OMeG)(5')ppp(5')(2'OMeA)pG(APExBIO公司产品EZ Cap#B8178,cap1类似物)。

具体实验步骤为:

(1)体外mRNA序列中插入多种上述修饰核苷酸;体外转录过程中随机插入修饰 核苷酸,按照修饰核苷酸与未修饰核苷酸1:5的摩尔比,掺入反应体系,使用APExBIO 公司试剂盒#K1047。按照试剂盒使用说明配置反应体系,37℃反应2-4小时。

(2)转录过程如加入5’加帽核苷酸;则在转录反应体系中同时加入5’ m7(3'OMeG)(5')ppp(5')(2'OMeA)pG、m7G(5')ppp(5')(2'OMeA)pG或 3′-O-Me-m7G(5')ppp(5')G,与GTP的摩尔比为8:1。

(3)3’端加入120个聚腺苷酸的序列;使用poly(A)拖尾试剂盒(APExBIO,#K1053)将3′poly(A)尾巴添加到IVT RNA产品中,按照试剂盒使用说明配置反应体系,37℃ 反应1小时。

(4)DNase消化DNA模板;使用NEB公司DNase I(货号#M0303S)进行DNA 模板消化,37℃反应1小时。

(5)mRNA纯化;用Thermo Fisher RiboPure Kit(#AM1924)纯化转录后,DNA 模板已消化的mRNA,用1mM柠檬酸钠,pH6.4的溶液洗脱。进行琼脂糖胶核酸电泳 检测mRNA,并用NanoDrop测定浓度。

实施例3 mRNA转染细胞

使用lipofectamine 2000(lipo2K,ThermoFisher Scientific#11668019)以质量体积比 1:2(mRNA:lipo2K,1g mRNA+2L lipo2K)的比例将实施例1和2中所得S、M、E、 N的mRNA分别转染到293A细胞中,24hr后用Western Blot检测蛋白表达。所得结果 如图2所示。

图2中,数字代表mRNA序列中插入了不同的修饰核苷酸:1.cap1;2. cap1+5mC+pseudoU;3.cap1+pseudoU;4.cap1+5moU;5.cap1+N1-m-pseudo;6.cap1+5mC。 细胞表达的N蛋白和E蛋白均带有HA序列标签,用抗HA的抗体做western blot检测细 胞中蛋白表达情况,GAPDH蛋白作为阳性对照。其中:

如图2的A所示,N蛋白较小,各序列和修饰优化的mRNA,均可以在细胞内表达 N蛋白,其中cap1+5mC+pseudoU(lane 2)和cap1+5moU(lane 4)两种修饰时的mRNA 对于蛋白表达相对较低。

图2的B中,EBL序列表达很强,用抗HA标签肽的抗体检测,信号很强。MBL序 列接flag标签肽,用抗flag标签肽的抗体进行检测,发现cap1(lane 1)、cap1+pseudoU (lane3)以及cap1+N1-m-pseudo(lane 5)、cap1+5mC(lane 6)四种修饰组合表达较好, cap1+5mC+pseudoU(lane 2)和cap1+5moU(lane 4)两种修饰组合的mRNA,E蛋白表 达量很低。

图2的C和D中,表达S蛋白的序列分别连接HA标签肽或flag标签肽,用抗HA 或flag标签肽的抗体进行检测,蛋白表达差异极大。由图2的C中可以看出,未经优化 的天然S基因序列在293A细胞中几乎不表达,或表达量极低。STF和SBL两种优化序 列比天然S基因序列的蛋白表达略有提高,cap1+pseudoU(lane 3)和cap1+N1-m-pseudo (lane 5)修饰的STF表达的蛋白量相对较高,cap1+pseudo(lane 3)修饰的SBL表达的 蛋白量相对较高。SGS基因优化序列使得蛋白表达大为上升,表达量最好的是添加 cap+pseudoU(lane 3)和cap1+N1-m-pseudoU(lane 5)两种方式修饰的SGS序列,cap1 (lane 1)、cap1+5mC+pseudoU(lane2)、cap1+5mC(lane 6)修饰的SGS序列的表达量 也较高。由图2的D中可以看出,SDC50、SDC54、SDC58、SDC60这些优化后的序列 所表达的蛋白中,杂蛋白很多。

图2的E中,带有HA标签肽的SGS-RBD优化mRNA序列(mRNA序列如SEQ ID NO.37所示,其对应的DNA序列如SEQ ID NO.30所述,均采用pseudoU多核苷酸修 饰,5’加帽的结构为Cap1,3’加入120个聚腺苷酸,连接了序列如SEQ ID NO.15所 示的5’UTR和序列如SEQ IDNO.16或17所示的3’UTR)可以在细胞中高表达S蛋白 RBD结构域。

图2的F中,用一条mRNA序列串联表达两种蛋白M和E(即将M蛋白和E蛋白 的mRNA相连接后再进行表达,中间可使用不同的表达2A肽段的mRNA进行连接(蛋 白表达后,2A肽经过“自剪切”,最后能够得到独立的M和E蛋白。在天然病毒2A序 列基础上,经过优化所得的表达2A肽段的T2A和P2A多肽对应的DNA序列如SEQ ID NO.38和SEQ ID NO.39所示,T2AmRNA序列如SEQ ID NO.40所示,P2A mRNA序 列如SEQ ID NO.41所示,可翻译成多肽(对于序列为SEQ ID NO.42和SEQ ID NO.43)),mRNA相连接后所得MT2AE的mRNA序列如SEQ IDNO.35所示(对应的DNA序列 如SEQ ID NO.28所示),mRNA相连接后所得MP2AE的mRNA序列如SEQ ID NO.36 所示(对应的DNA序列如SEQ ID NO.29所示),采用pseudoU多核苷酸修饰,5’加帽 的结构为Cap1,3’加入120个聚腺苷酸,连接了序列如SEQ ID NO.15所示的5’UTR 和序列如SEQ ID NO.16或17所示的3’UTR),western blot显示位置接近的双带代表了 M和E蛋白。MP2AE优化mRNA序列得到的两个蛋白量更接近。

实施例4 病毒样颗粒的制备和观测

为了产生病毒样颗粒(VLP),将表达的S、M、E蛋白的mRNA(SGS mRNA、MBL mRNA、EBLmRNA组成的SME mRNA,均采用pseudoU多核苷酸修饰,5’加帽的结 构为Cap1,3’加入120个聚腺苷酸,连接了序列如SEQ ID NO.15所示的5’UTR和序 列如SEQ ID NO.16或17所示的3’UTR),按照摩尔比1:0.5:0.5的比例,用lipo2K包被, 共转染到293A细胞中,并在转染后48小时收集上清液。或者采用串联表达M蛋白、E 蛋白的mRNA(即将M蛋白和E蛋白的mRNA相连接后再进行后续步骤,中间可使用 不同的连接肽进行连接,mRNA相连接后的序列如SEQ IDNO.35或36所示),与表达 S蛋白mRNA(SEQ ID NO.3)按照摩尔比2:1的比例,用lipo2K包被,共转染到293A 细胞中,并在转染后48小时收集上清液。

将所收集的上清液使用100kDa截止浓度的Amicon Ultra-15(Millipore)进行浓缩, 然后将其置于合适的溶液中(20mM HEPES,pH 7.4,120mM NaCl)。在4℃,31,000rpm(Beckman超速离心机,转子型号SW32)超速离心90分钟后,立即用5mL注射器提 取30-40%(w/v)之间的蔗糖溶液,该溶液包含病毒样颗粒(VLP)。使用100kDa截止 浓度的AmiconUltra-15离心管进行溶液交换,将含VLP的溶液置换成PBS缓冲液。为 了制备用于负染透射电子显微镜(TEM)的栅格,将5μL VLP溶液在辉光放电碳涂层的 栅格上吸收2分钟。将网格以逐滴滴加方式染色60秒,然后加载到Talos L120C显微镜 (Thermofisher)上以可视化VLP。结果如图3显示,图3的a中显示了由mRNA转录 出的S、E和M蛋白,自组装成的新冠病毒样颗粒电镜照片;图3的b中单个病毒样颗 粒放大照片,并测量表面棘突大小;图3的c为新冠病毒样颗粒的卡通模式图。由图3 可知,电镜下VLP颗粒直径约90nm,表面形成与天然病毒类似的三聚体棘突,棘突大 小约12×13nm,与天然病毒的大小和结构都极为接近。

实施例5 mRNA包被方法

按照之前报道,将实施例2中所得含有修饰核苷酸的mRNA(分别为表达S蛋白RBD结构域的mRNA;可以表达S蛋白的SGS mRNA;表达S、M、E三种蛋白的SME mRNA, 按摩尔比为1:0.5:0.5混合表达;均采用pseudoU多核苷酸修饰,5’加帽的结构为Cap1, 3’加入120个聚腺苷酸,连接了序列如SEQ ID NO.15所示的5’UTR和序列如SEQ ID NO. 16或17所示的3’UTR)利用DLin-MC3-DMA(APExBIO,#A8791),在低pH下可离 子化(阳离子),两种辅助脂质(DSPC和胆固醇)和聚乙二醇化脂质(DMPE-PEG2000) 包被成纳米颗粒(示意图如图4所示)。通过将溶解在超纯水中的mRNA,与pH 3.0的 100μmM柠檬酸盐缓冲液1:1(v/v)混合,制备了mRNA的水溶液。调节四种脂质组 分[可离子化的脂质:胆固醇:DSPC:DMPE-PEG2000]比例(50:10:38.5:1.5),在乙醇(99.5%) 中溶解为脂质溶液。将mRNA和脂质溶液在NanoAssemblr(Precision Nanosystems)微 流体混合系统中以Aq:EtOH=3:1的体积混合比和恒定的12mL/min的总流速混合, 得到包含mRNA的脂质体纳米颗粒(LNP)。

为了表征上述所制得的LNP,在制备后,将25μL的样品级分注入975的10μmM磷 酸盐缓冲液(pH 7.4)中,并用于在ZetaSizer(Malvern Instruments Inc.)上测量强度平 均粒径(Z平均值)。立即将样品馏分转移至Slide-a-lyzer G2透析盒(10000MWCO,ThermoFischer Scientific Inc.)中,并在4℃下针对PBS(pH7.4)透析过夜。PBS缓冲液的体积 为样品体积的650-800倍。收集样品部分,并从该体积中将25μL注入975μL 10μmM 磷酸盐缓冲液(pH 7.4)中,并再次测量粒径(透析后粒径),LNP在透析前后,直径约 100nm,均处于均一稳定的状态,见图5和表1。透析过的样品用来进行小鼠注射免疫。 图5中显示了表达S蛋白的SGS mRNA对应的结果图,以及表达S蛋白RBD结构域的 mRNA和表达S、M、E三种蛋白的对应mRNA的包装结果。这些结果表面,mRNA经 过脂质体包装后,粒径在100-110nm之间,包装效率大于90%。

表1中显示了用ZetaView检测LNP包被后mRNA样品粒径及分布。S-RBD mRNA 可以表达S蛋白RBD结构域,SGS mRNA可以表达S蛋白,SME mRNA表达S、M、E 三种蛋白,可以形成病毒样颗粒。包被后的LNP粒径均在100-110nm之间,符合纳米颗 粒大小预期。稀释后数量在100-300之间,稀释比例适合。在用1xPBS透析并用0.22μM 或0.45μM滤膜过滤后,粒径和数量保持稳定,可用于后续动物实验。

表1

实施例6 小鼠免疫实验

将包被好的表达新冠病毒VLP(含有实施例5中所述的表达S蛋白的SGS mRNA,或表达S、M、E三种蛋白的SME mRNA)或RBD(含有实施例5中所述的表达S蛋白RBD结 构域的mRNA)的上述脂质体纳米颗粒,配合免疫佐剂,进行Balb/c小鼠(肌肉(i.m.) 注射,具体信息如下表2所示。在第42天收集血液样品,并且在荧光抗体病毒中和检测中 分析血清,具体如以下实施例7所述。

表2

组别 品系 数量 途径体积 疫苗剂量 接种时间
1 Balb/c 8 i.m.50μl×3 Control PBS D0,致敏;D14,加强免疫;D35,加强免疫
2 Balb/c 8 i.m.50μl×1 mRNA 10μg D0,致敏
3 Balb/c 8 i.m.50μl×2 mRNA 10μg D0,致敏;D14,加强免疫;
4 Balb/c 8 i.m.50μl×3 mRNA 10μg D0,致敏;D14,加强免疫;D35,加强免疫

实施例7 酶联免疫吸附法测定血清中抗体效价

分别用2μg/ml抗原蛋白(溶于PBS),100ng,包被96孔ELISA板,50μl/孔,4 度过夜避光。其中S蛋白抗原购自Sino Biological,货号40589-V08B1;S蛋白RBD结 构域,购自Novoprotein,货号DRA36。PBST(0.05%Tween)洗3次,200μl/孔,每次 倒扣ELISA板并轻击清洗干净。加100μl/孔2%BSA(溶于PBST)封闭,室温孵育1hr。 PBST(0.05%Tween)洗3次,200μl/孔,每次倒扣ELISA板并轻击清洗干净。加小鼠 血清(稀释100倍作为起始浓度,之后5倍梯度稀释,共6个梯度)于PBS中,混合均 匀后取100μl分别加到ELISA板中,室温孵育2hr。将实施例6中的小鼠进行眼周采血 100μl约20μl血清。洗板后,分别加HRP-anti-mouseIgG(1:5000稀释于PBS),100μl/ 孔,室温孵育1hr。洗板后,分别加TMB底物(ThermoFisher,货号34022),50μl/孔, 室温等待5-15min(避光),将呈现蓝色。分别加1M硫酸终止反应,150μl/孔,蓝色将 变黄。酶标仪读取OD450。

所得结果如图6所示,其中可见mRNA表达病毒样颗粒产生的抗体滴度最高,达到107。表达S蛋白的mRNA产生的抗体滴度达到106。表达S蛋白RBD结构域的mRNA, 产生的抗体滴度为104。可见使用本发明的mRNA表达的病毒样颗粒均可有效激活小鼠 免疫系统,促使血清中产生抗体,有效发挥疫苗的作用。

实施例8 中和抗体检测实验

通过病毒中和测定进行病毒中和抗体应答(特异性B细胞免疫应答)的检测。所述测定 的结果称为病毒中和滴度(VNT)。按照WHO标准,如果各自的VNT是至少0.5IU/ml,则认为抗体滴度是保护性的。因此,在第42天从实施例6中所述接种的小鼠中采集血液样品,并且制备血清。这些血清用于荧光抗体滴度中和(FAVN)检测,所述检测使用人源CACO-2 细胞。培养细胞用假病毒颗粒(表达新冠病毒S蛋白,内核为EGFP DNA)进行感染。之 后不久,以连续两倍稀释四次一式四份检测热灭活的血清,检测其以50μl的体积中和假 病毒颗粒的100TCID50(组织培养感染剂量50%)的潜力。因此,将血清稀释液在37℃(在 具有5%CO2的潮湿的培养箱中)用病毒温育1小时,然后加入胰蛋白酶解的CACO-2细胞(4 ×105个细胞/ml;50μl/孔)。将感染的细胞培养物在潮湿的培养箱中在37℃和5%CO2培养 48小时。在室温用80%丙酮固定细胞后,用荧光检测EGFP表达,用量标志细胞的感染情 况。

根据图7的结果可知,本发明的疫苗(以表达病毒样颗粒的形式、仅表达S蛋白的疫苗的形式或者以仅表达S蛋白中RBD区域的疫苗的形式)均可有效激活小鼠免疫系统, 血清中产生抗体,且其安全性高、有效性好。其中表达病毒样颗粒的mRNA组合产生的 中和抗体滴度最高。不会产生非中和抗体从而不会产生抗体依赖增强感染效应。

参考文献

1.Huang C,Wang Y,Li X,Ren L,Zhao J,et al.2020.Lancet

2.Zhu N,Zhang D,Wang W,Li X,Yang B,et al.2020.N Engl J Med

3.de Wit E,van Doremalen N,Falzarano D,Munster VJ.2016.Nat RevMicrobiol 14:523-34

4.Potter CW.2001.J Appl Microbiol 91:572-9

5.Smith W,Andrewes CH,Laidlaw PP.1933.Lancet 2:66-8

6.Barberis I,Myles P,Ault SK,Bragazzi NL,Martini M.2016.J Prev MedHyg 57: E115-E20

7.Wolff JA,Malone RW,Williams P,Chong W,Acsadi G,et al.1990.Science247: 1465-8

8.Jirikowski GF,Sanna PP,Maciejewski-Lenoir D,Bloom FE.1992.Science255:996-8

9.Zangi L,Lui KO,von Gise A,Ma Q,Ebina W,et al.2013.Nat Biotechnol31:898-907

10.Kariko K,Muramatsu H,Ludwig J,Weissman D.2011.Nucleic Acids Res39:e142

11.Reichmuth AM,Oberli MA,Jaklenec A,Langer R,BlankschteinD.2016.Ther Deliv 7: 319-34

12.Sahin U,Kariko K,Tureci O.2014.Nat Rev Drug Discov 13:759-80

13.Pardi N,Hogan MJ,Porter FW,Weissman D.2018.Nat Rev Drug Discov 17:261-79

14.Hekele A,Bertholet S,Archer J,Gibson DG,Palladino G,etal.2013.Emerg Microbes Infect 2:e52

15.Richner JM,Himansu S,Dowd KA,Butler SL,Salazar V,et al.2017.Cell169:176

16.Richner JM,Jagger BW,Shan C,Fontes CR,Dowd KA,et al.2017.Cell 170:273-83 e12

17.Feldman RA,Fuhr R,Smolenov I,Mick Ribeiro A,Panther L,etal.2019.Vaccine 37: 3326-34

18.Chroboczek J,Szurgot I,Szolajska E.2014.Acta Biochim Pol 61:531-9

19.Yong CY,Ong HK,Yeap SK,Ho KL,Tan WS.2019.Front Microbiol 10:1781

20.Baric RS,Sheahan T,Deming D,Donaldson E,Yount B,et al.2006.Adv ExpMed Biol 581:553-60

21.Yip MS,Leung HL,Li PH,Cheung CY,Dutry I,et al.2016.Hong Kong Med J22: 25-31

22.Millet JK,Tang T,Nathan L,Jaimes JA,Hsu HL,et al.2019.J Vis Exp

23.Islam MA,Xu Y,Tao W,Ubellacker JM,Lim M,et al.2018.Nat Biomed Eng2: 850-64

SEQUENCE LISTING

<110> 上海蓝鹊生物医药有限公司

<120> mRNA及包含其的新冠病毒mRNA疫苗

<130> P20011191C

<160> 43

<170> PatentIn version 3.5

<210> 1

<211> 1273

<212> PRT

<213> SARS-COV-2

<400> 1

Met Phe Val Phe Leu Val Leu Leu Pro Leu Val Ser Ser Gln Cys Val

1 5 10 15

Asn Leu Thr Thr Arg Thr Gln Leu Pro Pro Ala Tyr Thr Asn Ser Phe

20 25 30

Thr Arg Gly Val Tyr Tyr Pro Asp Lys Val Phe Arg Ser Ser Val Leu

35 40 45

His Ser Thr Gln Asp Leu Phe Leu Pro Phe Phe Ser Asn Val Thr Trp

50 55 60

Phe His Ala Ile His Val Ser Gly Thr Asn Gly Thr Lys Arg Phe Asp

65 70 75 80

Asn Pro Val Leu Pro Phe Asn Asp Gly Val Tyr Phe Ala Ser Thr Glu

85 90 95

Lys Ser Asn Ile Ile Arg Gly Trp Ile Phe Gly Thr Thr Leu Asp Ser

100 105 110

Lys Thr Gln Ser Leu Leu Ile Val Asn Asn Ala Thr Asn Val Val Ile

115 120 125

Lys Val Cys Glu Phe Gln Phe Cys Asn Asp Pro Phe Leu Gly Val Tyr

130 135 140

Tyr His Lys Asn Asn Lys Ser Trp Met Glu Ser Glu Phe Arg Val Tyr

145 150 155 160

Ser Ser Ala Asn Asn Cys Thr Phe Glu Tyr Val Ser Gln Pro Phe Leu

165 170 175

Met Asp Leu Glu Gly Lys Gln Gly Asn Phe Lys Asn Leu Arg Glu Phe

180 185 190

Val Phe Lys Asn Ile Asp Gly Tyr Phe Lys Ile Tyr Ser Lys His Thr

195 200 205

Pro Ile Asn Leu Val Arg Asp Leu Pro Gln Gly Phe Ser Ala Leu Glu

210 215 220

Pro Leu Val Asp Leu Pro Ile Gly Ile Asn Ile Thr Arg Phe Gln Thr

225 230 235 240

Leu Leu Ala Leu His Arg Ser Tyr Leu Thr Pro Gly Asp Ser Ser Ser

245 250 255

Gly Trp Thr Ala Gly Ala Ala Ala Tyr Tyr Val Gly Tyr Leu Gln Pro

260 265 270

Arg Thr Phe Leu Leu Lys Tyr Asn Glu Asn Gly Thr Ile Thr Asp Ala

275 280 285

Val Asp Cys Ala Leu Asp Pro Leu Ser Glu Thr Lys Cys Thr Leu Lys

290 295 300

Ser Phe Thr Val Glu Lys Gly Ile Tyr Gln Thr Ser Asn Phe Arg Val

305 310 315 320

Gln Pro Thr Glu Ser Ile Val Arg Phe Pro Asn Ile Thr Asn Leu Cys

325 330 335

Pro Phe Gly Glu Val Phe Asn Ala Thr Arg Phe Ala Ser Val Tyr Ala

340 345 350

Trp Asn Arg Lys Arg Ile Ser Asn Cys Val Ala Asp Tyr Ser Val Leu

355 360 365

Tyr Asn Ser Ala Ser Phe Ser Thr Phe Lys Cys Tyr Gly Val Ser Pro

370 375 380

Thr Lys Leu Asn Asp Leu Cys Phe Thr Asn Val Tyr Ala Asp Ser Phe

385 390 395 400

Val Ile Arg Gly Asp Glu Val Arg Gln Ile Ala Pro Gly Gln Thr Gly

405 410 415

Lys Ile Ala Asp Tyr Asn Tyr Lys Leu Pro Asp Asp Phe Thr Gly Cys

420 425 430

Val Ile Ala Trp Asn Ser Asn Asn Leu Asp Ser Lys Val Gly Gly Asn

435 440 445

Tyr Asn Tyr Leu Tyr Arg Leu Phe Arg Lys Ser Asn Leu Lys Pro Phe

450 455 460

Glu Arg Asp Ile Ser Thr Glu Ile Tyr Gln Ala Gly Ser Thr Pro Cys

465 470 475 480

Asn Gly Val Glu Gly Phe Asn Cys Tyr Phe Pro Leu Gln Ser Tyr Gly

485 490 495

Phe Gln Pro Thr Asn Gly Val Gly Tyr Gln Pro Tyr Arg Val Val Val

500 505 510

Leu Ser Phe Glu Leu Leu His Ala Pro Ala Thr Val Cys Gly Pro Lys

515 520 525

Lys Ser Thr Asn Leu Val Lys Asn Lys Cys Val Asn Phe Asn Phe Asn

530 535 540

Gly Leu Thr Gly Thr Gly Val Leu Thr Glu Ser Asn Lys Lys Phe Leu

545 550 555 560

Pro Phe Gln Gln Phe Gly Arg Asp Ile Ala Asp Thr Thr Asp Ala Val

565 570 575

Arg Asp Pro Gln Thr Leu Glu Ile Leu Asp Ile Thr Pro Cys Ser Phe

580 585 590

Gly Gly Val Ser Val Ile Thr Pro Gly Thr Asn Thr Ser Asn Gln Val

595 600 605

Ala Val Leu Tyr Gln Asp Val Asn Cys Thr Glu Val Pro Val Ala Ile

610 615 620

His Ala Asp Gln Leu Thr Pro Thr Trp Arg Val Tyr Ser Thr Gly Ser

625 630 635 640

Asn Val Phe Gln Thr Arg Ala Gly Cys Leu Ile Gly Ala Glu His Val

645 650 655

Asn Asn Ser Tyr Glu Cys Asp Ile Pro Ile Gly Ala Gly Ile Cys Ala

660 665 670

Ser Tyr Gln Thr Gln Thr Asn Ser Pro Arg Arg Ala Arg Ser Val Ala

675 680 685

Ser Gln Ser Ile Ile Ala Tyr Thr Met Ser Leu Gly Ala Glu Asn Ser

690 695 700

Val Ala Tyr Ser Asn Asn Ser Ile Ala Ile Pro Thr Asn Phe Thr Ile

705 710 715 720

Ser Val Thr Thr Glu Ile Leu Pro Val Ser Met Thr Lys Thr Ser Val

725 730 735

Asp Cys Thr Met Tyr Ile Cys Gly Asp Ser Thr Glu Cys Ser Asn Leu

740 745 750

Leu Leu Gln Tyr Gly Ser Phe Cys Thr Gln Leu Asn Arg Ala Leu Thr

755 760 765

Gly Ile Ala Val Glu Gln Asp Lys Asn Thr Gln Glu Val Phe Ala Gln

770 775 780

Val Lys Gln Ile Tyr Lys Thr Pro Pro Ile Lys Asp Phe Gly Gly Phe

785 790 795 800

Asn Phe Ser Gln Ile Leu Pro Asp Pro Ser Lys Pro Ser Lys Arg Ser

805 810 815

Phe Ile Glu Asp Leu Leu Phe Asn Lys Val Thr Leu Ala Asp Ala Gly

820 825 830

Phe Ile Lys Gln Tyr Gly Asp Cys Leu Gly Asp Ile Ala Ala Arg Asp

835 840 845

Leu Ile Cys Ala Gln Lys Phe Asn Gly Leu Thr Val Leu Pro Pro Leu

850 855 860

Leu Thr Asp Glu Met Ile Ala Gln Tyr Thr Ser Ala Leu Leu Ala Gly

865 870 875 880

Thr Ile Thr Ser Gly Trp Thr Phe Gly Ala Gly Ala Ala Leu Gln Ile

885 890 895

Pro Phe Ala Met Gln Met Ala Tyr Arg Phe Asn Gly Ile Gly Val Thr

900 905 910

Gln Asn Val Leu Tyr Glu Asn Gln Lys Leu Ile Ala Asn Gln Phe Asn

915 920 925

Ser Ala Ile Gly Lys Ile Gln Asp Ser Leu Ser Ser Thr Ala Ser Ala

930 935 940

Leu Gly Lys Leu Gln Asp Val Val Asn Gln Asn Ala Gln Ala Leu Asn

945 950 955 960

Thr Leu Val Lys Gln Leu Ser Ser Asn Phe Gly Ala Ile Ser Ser Val

965 970 975

Leu Asn Asp Ile Leu Ser Arg Leu Asp Lys Val Glu Ala Glu Val Gln

980 985 990

Ile Asp Arg Leu Ile Thr Gly Arg Leu Gln Ser Leu Gln Thr Tyr Val

995 1000 1005

Thr Gln Gln Leu Ile Arg Ala Ala Glu Ile Arg Ala Ser Ala Asn

1010 1015 1020

Leu Ala Ala Thr Lys Met Ser Glu Cys Val Leu Gly Gln Ser Lys

1025 1030 1035

Arg Val Asp Phe Cys Gly Lys Gly Tyr His Leu Met Ser Phe Pro

1040 1045 1050

Gln Ser Ala Pro His Gly Val Val Phe Leu His Val Thr Tyr Val

1055 1060 1065

Pro Ala Gln Glu Lys Asn Phe Thr Thr Ala Pro Ala Ile Cys His

1070 1075 1080

Asp Gly Lys Ala His Phe Pro Arg Glu Gly Val Phe Val Ser Asn

1085 1090 1095

Gly Thr His Trp Phe Val Thr Gln Arg Asn Phe Tyr Glu Pro Gln

1100 1105 1110

Ile Ile Thr Thr Asp Asn Thr Phe Val Ser Gly Asn Cys Asp Val

1115 1120 1125

Val Ile Gly Ile Val Asn Asn Thr Val Tyr Asp Pro Leu Gln Pro

1130 1135 1140

Glu Leu Asp Ser Phe Lys Glu Glu Leu Asp Lys Tyr Phe Lys Asn

1145 1150 1155

His Thr Ser Pro Asp Val Asp Leu Gly Asp Ile Ser Gly Ile Asn

1160 1165 1170

Ala Ser Val Val Asn Ile Gln Lys Glu Ile Asp Arg Leu Asn Glu

1175 1180 1185

Val Ala Lys Asn Leu Asn Glu Ser Leu Ile Asp Leu Gln Glu Leu

1190 1195 1200

Gly Lys Tyr Glu Gln Tyr Ile Lys Trp Pro Trp Tyr Ile Trp Leu

1205 1210 1215

Gly Phe Ile Ala Gly Leu Ile Ala Ile Val Met Val Thr Ile Met

1220 1225 1230

Leu Cys Cys Met Thr Ser Cys Cys Ser Cys Leu Lys Gly Cys Cys

1235 1240 1245

Ser Cys Gly Ser Cys Cys Lys Phe Asp Glu Asp Asp Ser Glu Pro

1250 1255 1260

Val Leu Lys Gly Val Lys Leu His Tyr Thr

1265 1270

<210> 2

<211> 3819

<212> DNA

<213> SARS-COV-2

<400> 2

atgtttgttt ttcttgtttt attgccacta gtctctagtc agtgtgttaa tcttacaacc 60

agaactcaat taccccctgc atacactaat tctttcacac gtggtgttta ttaccctgac 120

aaagttttca gatcctcagt tttacattca actcaggact tgttcttacc tttcttttcc 180

aatgttactt ggttccatgc tatacatgtc tctgggacca atggtactaa gaggtttgat 240

aaccctgtcc taccatttaa tgatggtgtt tattttgctt ccactgagaa gtctaacata 300

ataagaggct ggatttttgg tactacttta gattcgaaga cccagtccct acttattgtt 360

aataacgcta ctaatgttgt tattaaagtc tgtgaatttc aattttgtaa tgatccattt 420

ttgggtgttt attaccacaa aaacaacaaa agttggatgg aaagtgagtt cagagtttat 480

tctagtgcga ataattgcac ttttgaatat gtctctcagc cttttcttat ggaccttgaa 540

ggaaaacagg gtaatttcaa aaatcttagg gaatttgtgt ttaagaatat tgatggttat 600

tttaaaatat attctaagca cacgcctatt aatttagtgc gtgatctccc tcagggtttt 660

tcggctttag aaccattggt agatttgcca ataggtatta acatcactag gtttcaaact 720

ttacttgctt tacatagaag ttatttgact cctggtgatt cttcttcagg ttggacagct 780

ggtgctgcag cttattatgt gggttatctt caacctagga cttttctatt aaaatataat 840

gaaaatggaa ccattacaga tgctgtagac tgtgcacttg accctctctc agaaacaaag 900

tgtacgttga aatccttcac tgtagaaaaa ggaatctatc aaacttctaa ctttagagtc 960

caaccaacag aatctattgt tagatttcct aatattacaa acttgtgccc ttttggtgaa 1020

gtttttaacg ccaccagatt tgcatctgtt tatgcttgga acaggaagag aatcagcaac 1080

tgtgttgctg attattctgt cctatataat tccgcatcat tttccacttt taagtgttat 1140

ggagtgtctc ctactaaatt aaatgatctc tgctttacta atgtctatgc agattcattt 1200

gtaattagag gtgatgaagt cagacaaatc gctccagggc aaactggaaa gattgctgat 1260

tataattata aattaccaga tgattttaca ggctgcgtta tagcttggaa ttctaacaat 1320

cttgattcta aggttggtgg taattataat tacctgtata gattgtttag gaagtctaat 1380

ctcaaacctt ttgagagaga tatttcaact gaaatctatc aggccggtag cacaccttgt 1440

aatggtgttg aaggttttaa ttgttacttt cctttacaat catatggttt ccaacccact 1500

aatggtgttg gttaccaacc atacagagta gtagtacttt cttttgaact tctacatgca 1560

ccagcaactg tttgtggacc taaaaagtct actaatttgg ttaaaaacaa atgtgtcaat 1620

ttcaacttca atggtttaac aggcacaggt gttcttactg agtctaacaa aaagtttctg 1680

cctttccaac aatttggcag agacattgct gacactactg atgctgtccg tgatccacag 1740

acacttgaga ttcttgacat tacaccatgt tcttttggtg gtgtcagtgt tataacacca 1800

ggaacaaata cttctaacca ggttgctgtt ctttatcagg atgttaactg cacagaagtc 1860

cctgttgcta ttcatgcaga tcaacttact cctacttggc gtgtttattc tacaggttct 1920

aatgtttttc aaacacgtgc aggctgttta ataggggctg aacatgtcaa caactcatat 1980

gagtgtgaca tacccattgg tgcaggtata tgcgctagtt atcagactca gactaattct 2040

cctcggcggg cacgtagtgt agctagtcaa tccatcattg cctacactat gtcacttggt 2100

gcagaaaatt cagttgctta ctctaataac tctattgcca tacccacaaa ttttactatt 2160

agtgttacca cagaaattct accagtgtct atgaccaaga catcagtaga ttgtacaatg 2220

tacatttgtg gtgattcaac tgaatgcagc aatcttttgt tgcaatatgg cagtttttgt 2280

acacaattaa accgtgcttt aactggaata gctgttgaac aagacaaaaa cacccaagaa 2340

gtttttgcac aagtcaaaca aatttacaaa acaccaccaa ttaaagattt tggtggtttt 2400

aatttttcac aaatattacc agatccatca aaaccaagca agaggtcatt tattgaagat 2460

ctacttttca acaaagtgac acttgcagat gctggcttca tcaaacaata tggtgattgc 2520

cttggtgata ttgctgctag agacctcatt tgtgcacaaa agtttaacgg ccttactgtt 2580

ttgccacctt tgctcacaga tgaaatgatt gctcaataca cttctgcact gttagcgggt 2640

acaatcactt ctggttggac ctttggtgca ggtgctgcat tacaaatacc atttgctatg 2700

caaatggctt ataggtttaa tggtattgga gttacacaga atgttctcta tgagaaccaa 2760

aaattgattg ccaaccaatt taatagtgct attggcaaaa ttcaagactc actttcttcc 2820

acagcaagtg cacttggaaa acttcaagat gtggtcaacc aaaatgcaca agctttaaac 2880

acgcttgtta aacaacttag ctccaatttt ggtgcaattt caagtgtttt aaatgatatc 2940

ctttcacgtc ttgacaaagt tgaggctgaa gtgcaaattg ataggttgat cacaggcaga 3000

cttcaaagtt tgcagacata tgtgactcaa caattaatta gagctgcaga aatcagagct 3060

tctgctaatc ttgctgctac taaaatgtca gagtgtgtac ttggacaatc aaaaagagtt 3120

gatttttgtg gaaagggcta tcatcttatg tccttccctc agtcagcacc tcatggtgta 3180

gtcttcttgc atgtgactta tgtccctgca caagaaaaga acttcacaac tgctcctgcc 3240

atttgtcatg atggaaaagc acactttcct cgtgaaggtg tctttgtttc aaatggcaca 3300

cactggtttg taacacaaag gaatttttat gaaccacaaa tcattactac agacaacaca 3360

tttgtgtctg gtaactgtga tgttgtaata ggaattgtca acaacacagt ttatgatcct 3420

ttgcaacctg aattagactc attcaaggag gagttagata aatattttaa gaatcataca 3480

tcaccagatg ttgatttagg tgacatctct ggcattaatg cttcagttgt aaacattcaa 3540

aaagaaattg accgcctcaa tgaggttgcc aagaatttaa atgaatctct catcgatctc 3600

caagaacttg gaaagtatga gcagtatata aaatggccat ggtacatttg gctaggtttt 3660

atagctggct tgattgccat agtaatggtg acaattatgc tttgctgtat gaccagttgc 3720

tgtagttgtc tcaagggctg ttgttcttgt ggatcctgct gcaaatttga tgaagacgac 3780

tctgagccag tgctcaaagg agtcaaatta cattacaca 3819

<210> 3

<211> 3819

<212> DNA

<213> Artificial Sequence

<220>

<223> S蛋白基因优化后的序列(S-GS)

<400> 3

atgttcgtct tcctggtcct gctgcctctg gtctcctcac agtgcgtcaa tctgacaact 60

cggactcagc tgccacctgc ttatactaat agcttcacca gaggcgtgta ctatcctgac 120

aaggtgttta gaagctccgt gctgcactct acacaggatc tgtttctgcc attctttagc 180

aacgtgacct ggttccacgc catccacgtg agcggcacca atggcacaaa gcggttcgac 240

aatcccgtgc tgccttttaa cgatggcgtg tacttcgcct ctaccgagaa gagcaacatc 300

atcagaggct ggatctttgg caccacactg gactccaaga cacagtctct gctgatcgtg 360

aacaatgcca ccaacgtggt catcaaggtg tgcgagttcc agttttgtaa tgatcccttc 420

ctgggcgtgt actatcacaa gaacaataag agctggatgg agtccgagtt tagagtgtat 480

tctagcgcca acaactgcac atttgagtac gtgagccagc ctttcctgat ggacctggag 540

ggcaagcagg gcaatttcaa gaacctgagg gagttcgtgt ttaagaatat cgacggctac 600

ttcaaaatct actctaagca cacccccatc aacctggtgc gcgacctgcc tcagggcttc 660

agcgccctgg agcccctggt ggatctgcct atcggcatca acatcacccg gtttcagaca 720

ctgctggccc tgcacagaag ctacctgaca cccggcgact cctctagcgg atggaccgcc 780

ggcgctgccg cctactatgt gggctacctc cagccccgga ccttcctgct gaagtacaac 840

gagaatggca ccatcacaga cgcagtggat tgcgccctgg accccctgag cgagacaaag 900

tgtacactga agtcctttac cgtggagaag ggcatctatc agacatccaa tttcagggtg 960

cagccaaccg agtctatcgt gcgctttcct aatatcacaa acctgtgccc atttggcgag 1020

gtgttcaacg caacccgctt cgccagcgtg tacgcctgga ataggaagcg gatcagcaac 1080

tgcgtggccg actatagcgt gctgtacaac tccgcctctt tcagcacctt taagtgctat 1140

ggcgtgtccc ccacaaagct gaatgacctg tgctttacca acgtctacgc cgattctttc 1200

gtgatcaggg gcgacgaggt gcgccagatc gcccccggcc agacaggcaa gatcgcagac 1260

tacaattata agctgccaga cgatttcacc ggctgcgtga tcgcctggaa cagcaacaat 1320

ctggattcca aagtgggcgg caactacaat tatctgtacc ggctgtttag aaagagcaat 1380

ctgaagccct tcgagaggga catctctaca gaaatctacc aggccggcag caccccttgc 1440

aatggcgtgg agggctttaa ctgttatttc ccactccagt cctacggctt ccagcccaca 1500

aacggcgtgg gctatcagcc ttaccgcgtg gtggtgctga gctttgagct gctgcacgcc 1560

ccagcaacag tgtgcggccc caagaagtcc accaatctgg tgaagaacaa gtgcgtgaac 1620

ttcaacttca acggcctgac cggcacaggc gtgctgaccg agtccaacaa gaagttcctg 1680

ccatttcagc agttcggcag ggacatcgca gataccacag acgccgtgcg cgacccacag 1740

accctggaga tcctggacat cacaccctgc tctttcggcg gcgtgagcgt gatcacaccc 1800

ggcaccaata caagcaacca ggtggccgtg ctgtatcagg acgtgaattg taccgaggtg 1860

cccgtggcta tccacgccga tcagctgacc ccaacatggc gggtgtacag caccggctcc 1920

aacgtcttcc agacaagagc cggatgcctg atcggagcag agcacgtgaa caattcctat 1980

gagtgcgaca tcccaatcgg cgccggcatc tgtgcctctt accagaccca gacaaactct 2040

cccagaagag cccggagcgt ggcctcccag tctatcatcg cctataccat gtccctgggc 2100

gccgagaaca gcgtggccta ctctaacaat agcatcgcca tcccaaccaa cttcacaatc 2160

tctgtgacca cagagatcct gcccgtgtcc atgaccaaga catctgtgga ctgcacaatg 2220

tatatctgtg gcgattctac cgagtgcagc aacctgctgc tccagtacgg cagcttttgt 2280

acccagctga atagagccct gacaggcatc gccgtggagc aggataagaa cacacaggag 2340

gtgttcgccc aggtgaagca aatctacaag acccccccta tcaaggactt tggcggcttc 2400

aatttttccc agatcctgcc tgatccatcc aagccttcta agcggagctt tatcgaggac 2460

ctgctgttca acaaggtgac cctggccgat gccggcttca tcaagcagta tggcgattgc 2520

ctgggcgaca tcgcagccag ggacctgatc tgcgcccaga agtttaatgg cctgaccgtg 2580

ctgccacccc tgctgacaga tgagatgatc gcacagtaca caagcgccct gctggccggc 2640

accatcacat ccggatggac cttcggcgca ggagccgccc tccagatccc ctttgccatg 2700

cagatggcct ataggttcaa cggcatcggc gtgacccaga atgtgctgta cgagaaccag 2760

aagctgatcg ccaatcagtt taactccgcc atcggcaaga tccaggacag cctgtcctct 2820

acagccagcg ccctgggcaa gctccaggat gtggtgaatc agaacgccca ggccctgaat 2880

accctggtga agcagctgag cagcaacttc ggcgccatct ctagcgtgct gaatgacatc 2940

ctgagccggc tggacaaggt ggaggcagag gtgcagatcg accggctgat caccggccgg 3000

ctccagagcc tccagaccta tgtgacacag cagctgatca gggccgccga gatcagggcc 3060

agcgccaatc tggcagcaac caagatgtcc gagtgcgtgc tgggccagtc taagagagtg 3120

gacttttgtg gcaagggcta tcacctgatg tccttccctc agtctgcccc acacggcgtg 3180

gtgtttctgc acgtgaccta cgtgcccgcc caggagaaga acttcaccac agcccctgcc 3240

atctgccacg atggcaaggc ccactttcca agggagggcg tgttcgtgtc caacggcacc 3300

cactggtttg tgacacagcg caatttctac gagccccaga tcatcaccac agacaacacc 3360

ttcgtgagcg gcaactgtga cgtggtcatc ggcatcgtga acaataccgt gtatgatcca 3420

ctccagcccg agctggacag ctttaaggag gagctggata agtatttcaa gaatcacacc 3480

tcccctgacg tggatctggg cgacatcagc ggcatcaatg cctccgtggt gaacatccag 3540

aaggagatcg accgcctgaa cgaggtggct aagaatctga acgagagcct gatcgacctc 3600

caggagctgg gcaagtatga gcagtacatc aagtggccct ggtacatctg gctgggcttc 3660

atcgccggcc tgatcgccat cgtgatggtg accatcatgc tgtgctgtat gacatcctgc 3720

tgttcttgcc tgaagggctg ctgtagctgt ggctcctgct gtaagtttga cgaggatgac 3780

tctgaacctg tgctgaaggg cgtgaagctg cattacacc 3819

<210> 4

<211> 3819

<212> DNA

<213> Artificial Sequence

<220>

<223> S蛋白基因优化后的序列 (SBL)

<400> 4

atgttcgttt tcctcgttct gctgcctctt gtcagctctc agtgtgtgaa cctgacaact 60

agaacacaac tacctcccgc ctacacaaac tctttcaccc ggggcgtgta ctacccagac 120

aaagtgttca ggagctctgt gttgcacagc acccaagacc tgtttttgcc attctttagt 180

aatgtgacct ggtttcacgc tatccatgtg tcgggcacca acgggaccaa aagattcgac 240

aaccccgttc tgccgttcaa cgacggcgtg tacttcgcta gcactgagaa gtccaacatt 300

attcgcgggt ggatcttcgg aactaccttg gactccaaaa cacagtctct actcatcgtg 360

aacaacgcga ctaacgtggt gattaaggtg tgtgaatttc agttctgcaa tgatccattt 420

ttaggagtgt actaccacaa aaataataaa tcatggatgg agtctgaatt tcgcgtatac 480

agtagcgcta ataactgtac attcgaatat gttagccaac cctttttgat ggacttagag 540

gggaagcagg gaaattttaa gaatttgcga gaatttgtgt tcaaaaatat cgatgggtat 600

ttcaagatct actccaagca tactcccata aatctggtgc gcgacttacc tcaagggttc 660

agcgcactgg agccactggt agacctgcca atcggcatca acatcacccg attccagacc 720

ctgcttgctc tgcaccgttc atatctgaca ccaggagatt cgtcttccgg atggacagca 780

ggggccgctg cttactatgt tggttatctt cagcctcgga cctttctgct caagtataat 840

gagaatggga ccattaccga cgctgttgat tgtgctctcg atcccctgtc agaaaccaag 900

tgcacactaa aatctttcac agtcgaaaag gggatctacc agacttctaa ctttcgtgta 960

cagcccaccg agagcatcgt caggttccca aatatcacta acctgtgtcc ttttggcgag 1020

gtgttcaacg ctacaagatt tgctagcgtg tacgcctgga acagaaaaag aatatcaaat 1080

tgcgtagccg attacagcgt cttatataac tctgcatcct tctcaacttt caagtgttat 1140

ggagtgagcc cgactaagct gaatgatttg tgctttacaa atgtttatgc cgattcattc 1200

gtgatccggg gcgacgaggt cagacagatc gcccctggcc aaacaggtaa gattgctgat 1260

tacaactaca aattacctga cgattttaca ggatgcgtta tcgcttggaa ctctaacaat 1320

ctcgattcta aggtcggcgg caattacaat tatctttatc gccttttcag gaagtcaaat 1380

cttaagccat tcgagcgaga catcagtacc gagatatacc aggcggggtc caccccgtgt 1440

aacggtgtcg agggtttcaa ctgctacttt ccactgcagt cctatgggtt ccagcccacc 1500

aatggcgtgg gttaccagcc ctaccgagta gtcgtattgt cttttgagct cttgcacgcc 1560

cccgccacgg tgtgcggtcc aaagaaatca actaacttag ttaagaataa atgtgtgaat 1620

tttaacttta acggcctgac agggacagga gtcctgacag aatccaataa gaagttcctt 1680

ccctttcagc agtttggacg cgacatcgca gacaccacag acgccgtgcg tgacccccaa 1740

actctcgaaa ttctcgatat cacaccctgc agttttggcg gggtcagtgt cattacccct 1800

gggaccaata ctagtaacca ggtcgcagtg ctttaccaag atgtcaactg taccgaggtt 1860

cctgtggcta ttcacgcaga ccaactgact ccgacttggc gggtgtatag tacaggctcc 1920

aatgtgtttc agacccgggc aggctgcctg attggggccg agcatgtaaa taactcctac 1980

gagtgcgata tccccatagg tgctggaata tgtgccagtt atcagaccca gacgaactcg 2040

ccaagacgag ctaggtccgt agcctctcag agcataatcg cgtacactat gagcctgggg 2100

gccgaaaatt ccgtggcata tagcaacaac agcattgcta ttcctactaa ctttacaatt 2160

tcagtcacga cggagatcct gccagtctcc atgactaaaa cctccgtgga ctgtacgatg 2220

tacatttgtg gcgattcaac tgaatgctct aacctgctct tacagtacgg ttctttttgt 2280

acccagctga accgggcatt gacgggcatc gcagttgagc aggacaagaa tactcaggag 2340

gtgtttgcgc aagtgaagca aatttataaa actcctccca ttaaggactt tggcggtttc 2400

aacttctcgc agatcctacc tgacccatca aaacctagca agaggtcttt cattgaagac 2460

cttctgttca acaaggtcac actggctgac gccggcttca ttaaacagta cggagattgt 2520

ctaggtgata ttgcagcgcg cgatctgatt tgcgcacaga agtttaacgg cctgacggtc 2580

ttaccccctc tccttaccga cgaaatgatt gcccagtaca ccagcgccct gctcgctggc 2640

acgattacta gcggatggac atttggggcc ggcgctgccc tccagatacc atttgccatg 2700

cagatggcgt ataggtttaa cggcatagga gtaacccaga acgtgctgta cgagaaccaa 2760

aaactgatag ccaatcaatt caatagtgcc ataggaaaga tacaggacag tctcagcagc 2820

accgcgtccg ctctcggaaa gctacaagat gtggtcaacc agaacgcgca ggcattgaat 2880

acactggtga agcagctctc ctcgaatttt ggagcaatca gcagcgtgct gaatgatatc 2940

ctgtctcggc tggacaaggt tgaagccgaa gtccagatcg acaggttaat caccggtcgg 3000

ctgcagagtc tccagacata tgttacccag caactcatca gagctgccga aatacgcgcc 3060

agtgccaatc ttgcagccac taagatgtcc gagtgcgtgt tggggcaaag taaaagggtt 3120

gatttctgtg gaaaaggata tcatcttatg agtttccctc aatccgcccc tcacggagtt 3180

gtcttcctgc atgtgaccta cgtgccagcg caggagaaga acttcacgac cgcccccgcc 3240

atctgccatg atggcaaggc ccattttccc cgcgaaggag tgttcgtatc caatggcacc 3300

cactggttcg tgacgcagag aaatttttat gagccgcaaa ttatcactac cgacaacaca 3360

ttcgtttccg gcaattgcga tgtcgtaatc gggatcgtga ataatacagt ctatgatcct 3420

cttcagccag aactcgattc attcaaagag gagctggata aatatttcaa gaaccacacc 3480

tcccccgatg tggatctggg tgacatatca ggaattaacg caagcgtcgt gaacattcag 3540

aaggaaatcg acaggctcaa tgaagtagca aagaacttga atgagtctct catcgacttg 3600

caggaactcg gcaaatatga gcagtacatt aaatggccgt ggtatatctg gctaggcttt 3660

atcgccggtc tgattgcaat tgtgatggtt actatcatgt tgtgctgcat gacaagttgc 3720

tgttcatgcc ttaaaggctg ctgctcctgc gggtcatgtt gtaaattcga tgaggacgac 3780

tctgagcccg tgctgaaagg ggtgaaactg cactacacg 3819

<210> 5

<211> 3819

<212> DNA

<213> Artificial Sequence

<220>

<223> S蛋白基因优化序列3 (STF)

<400> 5

atgttcgtgt tcctggtgct gctgcctctg gtgtccagcc agtgtgtgaa cctgaccacc 60

agaacacagc tgcctccagc ctacaccaac agctttacca gaggcgtgta ctaccccgac 120

aaggtgttca gatccagcgt gctgcactct acccaggacc tgttcctgcc tttcttcagc 180

aacgtgacct ggttccacgc catccacgtg tccggcacca atggcaccaa gagattcgac 240

aaccccgtgc tgcccttcaa cgacggggtg tactttgcca gcaccgagaa gtccaacatc 300

atcagaggct ggatcttcgg caccacactg gacagcaaga cccagagcct gctgatcgtg 360

aacaacgcca ccaacgtggt catcaaagtg tgcgagttcc agttctgcaa cgaccccttc 420

ctgggcgtct actaccacaa gaacaacaag agctggatgg aaagcgagtt ccgggtgtac 480

agcagcgcca acaactgcac cttcgagtac gtgtcccagc ctttcctgat ggacctggaa 540

ggcaagcagg gcaacttcaa gaacctgcgc gagttcgtgt ttaagaacat cgacggctac 600

ttcaagatct acagcaagca cacccctatc aacctcgtgc gggatctgcc tcagggcttc 660

tctgctctgg aacccctggt ggatctgccc atcggcatca acatcacccg gtttcagaca 720

ctgctggccc tgcacagaag ctacctgaca cctggcgata gcagcagcgg atggacagct 780

ggtgccgccg cttactatgt gggctacctg cagcctagaa ccttcctgct gaagtacaac 840

gagaacggca ccatcaccga cgccgtggat tgtgctctgg atcctctgag cgagacaaag 900

tgcaccctga agtccttcac cgtggaaaag ggcatctacc agaccagcaa cttccgggtg 960

cagcccaccg aatccatcgt gcggttcccc aatatcacca atctgtgccc cttcggcgag 1020

gtgttcaatg ccaccagatt cgcctctgtg tacgcctgga accggaagcg gatcagcaat 1080

tgcgtggccg actactccgt gctgtacaac tccgccagct tcagcacctt caagtgctac 1140

ggcgtgtccc ctaccaagct gaacgacctg tgcttcacaa acgtgtacgc cgacagcttc 1200

gtgatccggg gagatgaagt gcggcagatt gcccctggac agacaggcaa gatcgccgac 1260

tacaactaca agctgcccga cgacttcacc ggctgtgtga ttgcctggaa cagcaacaac 1320

ctggactcca aagtcggcgg caactacaat tacctgtacc ggctgttccg gaagtccaat 1380

ctgaagccct tcgagcggga catctccacc gagatctatc aggccggcag caccccttgt 1440

aacggcgtgg aaggcttcaa ctgctacttc ccactgcagt cctacggctt tcagcccaca 1500

aatggcgtgg gctatcagcc ctacagagtg gtggtgctga gcttcgaact gctgcatgcc 1560

cctgccacag tgtgcggccc taagaaaagc accaatctcg tgaagaacaa atgcgtgaac 1620

ttcaacttca acggcctgac cggcaccggc gtgctgacag agagcaacaa gaagttcctg 1680

ccattccagc agtttggccg ggatatcgcc gataccacag acgccgttag agatccccag 1740

acactggaaa tcctggacat caccccttgc agcttcggcg gagtgtctgt gatcacccct 1800

ggcaccaaca ccagcaatca ggtggcagtg ctgtaccagg acgtgaactg taccgaagtg 1860

cccgtggcca ttcacgccga tcagctgaca cctacatggc gggtgtactc caccggcagc 1920

aatgtgtttc agaccagagc cggctgtctg atcggagccg agcacgtgaa caatagctac 1980

gagtgcgaca tccccatcgg cgctggcatc tgtgccagct accagacaca gacaaacagc 2040

cccagacggg ccagatctgt ggccagccag agcatcattg cctacacaat gtctctgggc 2100

gccgagaaca gcgtggccta ctccaacaac tctatcgcta tccccaccaa cttcaccatc 2160

agcgtgacca cagagatcct gcctgtgtcc atgaccaaga ccagcgtgga ctgcaccatg 2220

tacatctgcg gcgattccac cgagtgctcc aacctgctgc tgcagtacgg cagcttctgc 2280

acccagctga atagagccct gacagggatc gccgtggaac aggacaagaa cacccaagag 2340

gtgttcgccc aagtgaagca gatctacaag acccctccta tcaaggactt cggcggcttc 2400

aatttcagcc agattctgcc cgatcctagc aagcccagca agcggagctt catcgaggac 2460

ctgctgttca acaaagtgac actggccgac gccggcttca tcaagcagta tggcgattgt 2520

ctgggcgaca ttgccgccag ggatctgatt tgcgcccaga agtttaacgg actgacagtg 2580

ctgcctcctc tgctgaccga tgagatgatc gcccagtaca catctgccct gctggccggc 2640

acaatcacaa gcggctggac atttggagct ggcgccgctc tgcagatccc ctttgctatg 2700

cagatggcct accggttcaa cggcatcgga gtgacccaga atgtgctgta cgagaaccag 2760

aagctgatcg ccaaccagtt caacagcgcc atcggcaaga tccaggacag cctgagcagc 2820

acagcaagcg ccctgggaaa gctgcaggac gtggtcaacc agaatgccca ggcactgaac 2880

accctggtca agcagctgtc ctccaacttc ggcgccatca gctctgtgct gaacgatatc 2940

ctgagcagac tggacaaggt ggaagccgag gtgcagatcg acagactgat caccggaagg 3000

ctgcagtccc tgcagaccta cgttacccag cagctgatca gagccgccga gattagagcc 3060

tctgccaatc tggccgccac caagatgtct gagtgtgtgc tgggccagag caagagagtg 3120

gacttttgcg gcaagggcta ccacctgatg agcttccctc agtctgcccc tcacggcgtg 3180

gtgtttctgc acgtgacata cgtgcccgct caagagaaga atttcaccac cgctccagcc 3240

atctgccacg acggcaaagc ccactttcct agagaaggcg tgttcgtgtc caacggcacc 3300

cattggttcg tgacccagcg gaacttctac gagccccaga tcatcaccac cgacaacacc 3360

ttcgtgtctg gcaactgcga cgtcgtgatc ggcattgtga acaataccgt gtacgaccct 3420

ctgcagcccg agctggacag cttcaaagag gaactggata agtactttaa gaaccacaca 3480

agccccgacg tggacctggg cgatatcagc ggaatcaatg ccagcgtcgt gaacatccag 3540

aaagagatcg accggctgaa cgaggtggcc aagaatctga acgagagcct gatcgacctg 3600

caagaactgg ggaagtacga gcagtacatc aagtggccct ggtacatctg gctgggcttt 3660

atcgccggac tgattgccat cgtgatggtc acaatcatgc tgtgttgcat gaccagctgc 3720

tgtagctgcc tgaagggctg ttgtagctgt ggcagctgct gcaagttcga cgaggacgat 3780

tctgagcccg tgctgaaggg cgtgaaactg cactacaca 3819

<210> 6

<211> 75

<212> PRT

<213> SARS-COV-2

<400> 6

Met Tyr Ser Phe Val Ser Glu Glu Thr Gly Thr Leu Ile Val Asn Ser

1 5 10 15

Val Leu Leu Phe Leu Ala Phe Val Val Phe Leu Leu Val Thr Leu Ala

20 25 30

Ile Leu Thr Ala Leu Arg Leu Cys Ala Tyr Cys Cys Asn Ile Val Asn

35 40 45

Val Ser Leu Val Lys Pro Ser Phe Tyr Val Tyr Ser Arg Val Lys Asn

50 55 60

Leu Asn Ser Ser Arg Val Pro Asp Leu Leu Val

65 70 75

<210> 7

<211> 228

<212> DNA

<213> SARS-COV-2

<400> 7

atgtactcat tcgtttcgga agagacaggt acgttaatag ttaatagcgt acttcttttt 60

cttgctttcg tggtattctt gctagttaca ctagccatcc ttactgcgct tcgattgtgt 120

gcgtactgct gcaatattgt taacgtgagt cttgtaaaac cttcttttta cgtttactct 180

cgtgttaaaa atctgaattc ttctagagtt cctgatcttc tggtctaa 228

<210> 8

<211> 225

<212> DNA

<213> Artificial Sequence

<220>

<223> E蛋白基因优化序列(EBL)

<400> 8

atgtacagct ttgtctcaga ggaaaccggc acgctgattg taaacagcgt gttactattc 60

ctcgccttcg ttgtgtttct ccttgttaca ctggcaatac tgactgccct gcggttgtgc 120

gcttactgct gtaatatcgt gaacgtgtct ttggtgaagc ccagtttcta tgtatattcc 180

agagtcaaaa atctcaactc ctctagggtg cctgacctgc ttgtc 225

<210> 9

<211> 222

<212> PRT

<213> SARS-COV-2

<400> 9

Met Ala Asp Ser Asn Gly Thr Ile Thr Val Glu Glu Leu Lys Lys Leu

1 5 10 15

Leu Glu Gln Trp Asn Leu Val Ile Gly Phe Leu Phe Leu Thr Trp Ile

20 25 30

Cys Leu Leu Gln Phe Ala Tyr Ala Asn Arg Asn Arg Phe Leu Tyr Ile

35 40 45

Ile Lys Leu Ile Phe Leu Trp Leu Leu Trp Pro Val Thr Leu Ala Cys

50 55 60

Phe Val Leu Ala Ala Val Tyr Arg Ile Asn Trp Ile Thr Gly Gly Ile

65 70 75 80

Ala Ile Ala Met Ala Cys Leu Val Gly Leu Met Trp Leu Ser Tyr Phe

85 90 95

Ile Ala Ser Phe Arg Leu Phe Ala Arg Thr Arg Ser Met Trp Ser Phe

100 105 110

Asn Pro Glu Thr Asn Ile Leu Leu Asn Val Pro Leu His Gly Thr Ile

115 120 125

Leu Thr Arg Pro Leu Leu Glu Ser Glu Leu Val Ile Gly Ala Val Ile

130 135 140

Leu Arg Gly His Leu Arg Ile Ala Gly His His Leu Gly Arg Cys Asp

145 150 155 160

Ile Lys Asp Leu Pro Lys Glu Ile Thr Val Ala Thr Ser Arg Thr Leu

165 170 175

Ser Tyr Tyr Lys Leu Gly Ala Ser Gln Arg Val Ala Gly Asp Ser Gly

180 185 190

Phe Ala Ala Tyr Ser Arg Tyr Arg Ile Gly Asn Tyr Lys Leu Asn Thr

195 200 205

Asp His Ser Ser Ser Ser Asp Asn Ile Ala Leu Leu Val Gln

210 215 220

<210> 10

<211> 669

<212> DNA

<213> SARS-COV-2

<400> 10

atggcagatt ccaacggtac tattaccgtt gaagagctta aaaagctcct tgaacaatgg 60

aacctagtaa taggtttcct attccttaca tggatttgtc ttctacaatt tgcctatgcc 120

aacaggaata ggtttttgta tataattaag ttaattttcc tctggctgtt atggccagta 180

actttagctt gttttgtgct tgctgctgtt tacagaataa attggatcac cggtggaatt 240

gctatcgcaa tggcttgtct tgtaggcttg atgtggctca gctacttcat tgcttctttc 300

agactgtttg cgcgtacgcg ttccatgtgg tcattcaatc cagaaactaa cattcttctc 360

aacgtgccac tccatggcac tattctgacc agaccgcttc tagaaagtga actcgtaatc 420

ggagctgtga tccttcgtgg acatcttcgt attgctggac accatctagg acgctgtgac 480

atcaaggacc tgcctaaaga aatcactgtt gctacatcac gaacgctttc ttattacaaa 540

ttgggagctt cgcagcgtgt agcaggtgac tcaggttttg ctgcatacag tcgctacagg 600

attggcaact ataaattaaa cacagaccat tccagtagca gtgacaatat tgctttgctt 660

gtacagtaa 669

<210> 11

<211> 669

<212> DNA

<213> Artificial Sequence

<220>

<223> M蛋白基因优化序列MBL

<400> 11

atggcagatt ccaacggtac aattaccgtc gaagagctga aaaagctcct tgagcagtgg 60

aacctggtca tagggttcct attcctgaca tggatttgcc tgctgcaatt tgcctatgcc 120

aacaggaata ggtttttgta tataatcaag ctgattttcc tctggctgtt atggccagtg 180

accctggcct gttttgtgct tgccgctgtt tacagaataa attggatcac cggcggaatc 240

gccatcgcaa tggcttgcct tgtaggcttg atgtggctca gctacttcat tgcttctttc 300

cggctgtttg cgcgaacgcg gtccatgtgg tctttcaatc cggagactaa catactcctc 360

aatgtgcccc tccatggcac tattctgacc agacccctgc tagagagtga actcgtcatc 420

ggagctgtga tcctgcgggg gcacctgaga atcgccggac accacttagg ccgctgtgac 480

atcaaggatc tgcctaaaga aatcactgtt gccacatcac gaaccctttc ttattacaag 540

ttgggggcct cgcagcgtgt ggcaggagac tcaggttttg cggcatacag tcgctacagg 600

attggcaact ataaattaaa cacagaccat tccagcagca gcgataatat tgctttgctt 660

gtgcagtga 669

<210> 12

<211> 419

<212> PRT

<213> SARS-COV-2

<400> 12

Met Ser Asp Asn Gly Pro Gln Asn Gln Arg Asn Ala Pro Arg Ile Thr

1 5 10 15

Phe Gly Gly Pro Ser Asp Ser Thr Gly Ser Asn Gln Asn Gly Glu Arg

20 25 30

Ser Gly Ala Arg Ser Lys Gln Arg Arg Pro Gln Gly Leu Pro Asn Asn

35 40 45

Thr Ala Ser Trp Phe Thr Ala Leu Thr Gln His Gly Lys Glu Asp Leu

50 55 60

Lys Phe Pro Arg Gly Gln Gly Val Pro Ile Asn Thr Asn Ser Ser Pro

65 70 75 80

Asp Asp Gln Ile Gly Tyr Tyr Arg Arg Ala Thr Arg Arg Ile Arg Gly

85 90 95

Gly Asp Gly Lys Met Lys Asp Leu Ser Pro Arg Trp Tyr Phe Tyr Tyr

100 105 110

Leu Gly Thr Gly Pro Glu Ala Gly Leu Pro Tyr Gly Ala Asn Lys Asp

115 120 125

Gly Ile Ile Trp Val Ala Thr Glu Gly Ala Leu Asn Thr Pro Lys Asp

130 135 140

His Ile Gly Thr Arg Asn Pro Ala Asn Asn Ala Ala Ile Val Leu Gln

145 150 155 160

Leu Pro Gln Gly Thr Thr Leu Pro Lys Gly Phe Tyr Ala Glu Gly Ser

165 170 175

Arg Gly Gly Ser Gln Ala Ser Ser Arg Ser Ser Ser Arg Ser Arg Asn

180 185 190

Ser Ser Arg Asn Ser Thr Pro Gly Ser Ser Arg Gly Thr Ser Pro Ala

195 200 205

Arg Met Ala Gly Asn Gly Gly Asp Ala Ala Leu Ala Leu Leu Leu Leu

210 215 220

Asp Arg Leu Asn Gln Leu Glu Ser Lys Met Ser Gly Lys Gly Gln Gln

225 230 235 240

Gln Gln Gly Gln Thr Val Thr Lys Lys Ser Ala Ala Glu Ala Ser Lys

245 250 255

Lys Pro Arg Gln Lys Arg Thr Ala Thr Lys Ala Tyr Asn Val Thr Gln

260 265 270

Ala Phe Gly Arg Arg Gly Pro Glu Gln Thr Gln Gly Asn Phe Gly Asp

275 280 285

Gln Glu Leu Ile Arg Gln Gly Thr Asp Tyr Lys His Trp Pro Gln Ile

290 295 300

Ala Gln Phe Ala Pro Ser Ala Ser Ala Phe Phe Gly Met Ser Arg Ile

305 310 315 320

Gly Met Glu Val Thr Pro Ser Gly Thr Trp Leu Thr Tyr Thr Gly Ala

325 330 335

Ile Lys Leu Asp Asp Lys Asp Pro Asn Phe Lys Asp Gln Val Ile Leu

340 345 350

Leu Asn Lys His Ile Asp Ala Tyr Lys Thr Phe Pro Pro Thr Glu Pro

355 360 365

Lys Lys Asp Lys Lys Lys Lys Ala Asp Glu Thr Gln Ala Leu Pro Gln

370 375 380

Arg Gln Lys Lys Gln Gln Thr Val Thr Leu Leu Pro Ala Ala Asp Leu

385 390 395 400

Asp Asp Phe Ser Lys Gln Leu Gln Gln Ser Met Ser Ser Ala Asp Ser

405 410 415

Thr Gln Ala

<210> 13

<211> 1257

<212> DNA

<213> Artificial Sequence

<220>

<223> N蛋白基因优化序列NBL

<400> 13

atgtcagata acggaccgca gaaccaaagg aacgcccctc ggatcacttt cgggggtcct 60

agcgacagca ctgggtctaa ccaaaatgga gaacgttccg gcgcaagatc caaacagagg 120

aggcctcagg ggcttcctaa caatacagcc tcctggttca cagctctcac acagcatggc 180

aaggaagacc tgaagtttcc tagaggccag ggggttccca tcaatactaa ctcctcccca 240

gacgatcaga ttggttatta tcggcgggct accaggcgga tccggggcgg agacggtaag 300

atgaaggacc tctctccccg ttggtacttt tactacctcg gtacaggccc cgaggctggg 360

cttccgtatg gcgccaataa ggatggaata atttgggtgg ctacggaagg ggccctcaac 420

acaccgaagg atcacattgg cacccgtaat cccgcgaata atgccgccat tgtcctgcag 480

ttgccccagg ggacgacgtt gcccaaaggc ttttacgcag aaggatcgcg cggaggatcc 540

caagcctcca gccgatcaag ctctcgatct cggaactcaa gtcgcaatag cacaccaggg 600

tcttctcgcg ggaccagccc tgcaaggatg gccggaaacg gcggtgatgc tgctttagcg 660

ctgctgctgc tggatagact gaaccaatta gagagtaaaa tgtcaggtaa aggccagcaa 720

cagcaggggc agacagtgac caaaaaaagt gcggccgagg ccagcaagaa accccgccag 780

aaacgaacag ccactaaagc ctacaacgta acccaagcat tcggaaggag aggaccagag 840

cagacccaag gcaattttgg cgatcaagag ctgatccgcc aggggacgga ctataagcat 900

tggccacaga tcgcccagtt cgcacccagt gcttcagcct tcttcggaat gtcgagaatc 960

ggtatggagg tcactccttc tggcacttgg ctgacttata ccggcgcaat aaagctagac 1020

gacaaagacc ctaactttaa ggatcaggtg atcctgctaa ataaacacat tgatgcgtac 1080

aaaacattcc caccaactga gccaaagaag gacaagaaga agaaggcaga tgaaacccag 1140

gctttgcccc agagacagaa aaagcagcag accgtgacct tgctgccagc agccgacctc 1200

gacgattttt caaagcaact tcagcagtcc atgagtagcg ctgacagcac ccaggct 1257

<210> 14

<211> 1257

<212> DNA

<213> SARS-COV-2

<400> 14

atgtctgata atggacccca aaatcagcga aatgcacccc gcattacgtt tggtggaccc 60

tcagattcaa ctggcagtaa ccagaatgga gaacgcagtg gggcgcgatc aaaacaacgt 120

cggccccaag gtttacccaa taatactgcg tcttggttca ccgctctcac tcaacatggc 180

aaggaagacc ttaaattccc tcgaggacaa ggcgttccaa ttaacaccaa tagcagtcca 240

gatgaccaaa ttggctacta ccgaagagct accagacgaa ttcgtggtgg tgacggtaaa 300

atgaaagatc tcagtccaag atggtatttc tactacctag gaactgggcc agaagctgga 360

cttccctatg gtgctaacaa agacggcatc atatgggttg caactgaggg agccttgaat 420

acaccaaaag atcacattgg cacccgcaat cctgctaaca atgctgcaat cgtgctacaa 480

cttcctcaag gaacaacatt gccaaaaggc ttctacgcag aagggagcag aggcggcagt 540

caagcctctt ctcgttcctc atcacgtagt cgcaacagtt caagaaattc aactccaggc 600

agcagtaggg gaacttctcc tgctagaatg gctggcaatg gcggtgatgc tgctcttgct 660

ttgctgctgc ttgacagatt gaaccagctt gagagcaaaa tgtctggtaa aggccaacaa 720

caacaaggcc aaactgtcac taagaaatct gctgctgagg cttctaagaa gcctcggcaa 780

aaacgtactg ccactaaagc atacaatgta acacaagctt tcggcagacg tggtccagaa 840

caaacccaag gaaattttgg ggaccaggaa ctaatcagac aaggaactga ttacaaacat 900

tggccgcaaa ttgcacaatt tgcccccagc gcttcagcgt tcttcggaat gtcgcgcatt 960

ggcatggaag tcacaccttc gggaacgtgg ttgacctaca caggtgccat caaattggat 1020

gacaaagatc caaatttcaa agatcaagtc attttgctga ataagcatat tgacgcatac 1080

aaaacattcc caccaacaga gcctaaaaag gacaaaaaga agaaggctga tgaaactcaa 1140

gccttaccgc agagacagaa gaaacagcaa actgtgactc ttcttcctgc tgcagatttg 1200

gatgatttct ccaaacaatt gcaacaatcc atgagcagtg ctgactcaac tcaggcc 1257

<210> 15

<211> 46

<212> DNA

<213> Artificial Sequence

<220>

<223> 5'UTR

<400> 15

ggaaataaga gagaaaagaa gagtaagaag aaatataaga gccacc 46

<210> 16

<211> 110

<212> DNA

<213> Artificial Sequence

<220>

<223> 3'UTR-1

<400> 16

gctggagcct cggtggccat gcttcttgcc ccttgggcct ccccccagcc cctcctcccc 60

ttcctgcacc cgtacccccg tggtctttga ataaagtctg agtgggcggc 110

<210> 17

<211> 109

<212> DNA

<213> Artificial Sequence

<220>

<223> 3'UTR-2

<400> 17

gcggccgctt aattaagctg ccttctgcgg ggcttgcctt ctggccatgc ccttcttctc 60

tcccttgcac ctgtacctct tggtctttga ataaagcctg agtaggaag 109

<210> 18

<211> 3819

<212> RNA

<213> Artificial Sequence

<220>

<223> S蛋白基因优化后的mRNA序列1(S-GS mRNA)

<400> 18

uacaagcaga aggaccagga cgacggagac cagaggagug ucacgcaguu agacuguuga 60

gccugagucg acgguggacg aauaugauua ucgaaguggu cuccgcacau gauaggacug 120

uuccacaaau cuucgaggca cgacgugaga uguguccuag acaaagacgg uaagaaaucg 180

uugcacugga ccaaggugcg guaggugcac ucgccguggu uaccguguuu cgccaagcug 240

uuagggcacg acggaaaauu gcuaccgcac augaagcgga gauggcucuu cucguuguag 300

uagucuccga ccuagaaacc guggugugac cugagguucu gugucagaga cgacuagcac 360

uuguuacggu gguugcacca guaguuccac acgcucaagg ucaaaacauu acuagggaag 420

gacccgcaca ugauaguguu cuuguuauuc ucgaccuacc ucaggcucaa aucucacaua 480

agaucgcggu uguugacgug uaaacucaug cacucggucg gaaaggacua ccuggaccuc 540

ccguucgucc cguuaaaguu cuuggacucc cucaagcaca aauucuuaua gcugccgaug 600

aaguuuuaga ugagauucgu guggggguag uuggaccacg cgcuggacgg agucccgaag 660

ucgcgggacc ucggggacca ccuagacgga uagccguagu uguagugggc caaagucugu 720

gacgaccggg acgugucuuc gauggacugu gggccgcuga ggagaucgcc uaccuggcgg 780

ccgcgacggc ggaugauaca cccgauggag gucggggccu ggaaggacga cuucauguug 840

cucuuaccgu gguagugucu gcgucaccua acgcgggacc ugggggacuc gcucuguuuc 900

acaugugacu ucaggaaaug gcaccucuuc ccguagauag ucuguagguu aaagucccac 960

gucgguuggc ucagauagca cgcgaaagga uuauaguguu uggacacggg uaaaccgcuc 1020

cacaaguugc guugggcgaa gcggucgcac augcggaccu uauccuucgc cuagucguug 1080

acgcaccggc ugauaucgca cgacauguug aggcggagaa agucguggaa auucacgaua 1140

ccgcacaggg gguguuucga cuuacuggac acgaaauggu ugcagaugcg gcuaagaaag 1200

cacuaguccc cgcugcucca cgcggucuag cgggggccgg ucuguccguu cuagcgucug 1260

auguuaauau ucgacggucu gcuaaagugg ccgacgcacu agcggaccuu gucguuguua 1320

gaccuaaggu uucacccgcc guugauguua auagacaugg ccgacaaauc uuucucguua 1380

gacuucggga agcucucccu guagagaugu cuuuagaugg uccggccguc guggggaacg 1440

uuaccgcacc ucccgaaauu gacaauaaag ggugagguca ggaugccgaa ggucgggugu 1500

uugccgcacc cgauagucgg aauggcgcac caccacgacu cgaaacucga cgacgugcgg 1560

ggucguuguc acacgccggg guucuucagg ugguuagacc acuucuuguu cacgcacuug 1620

aaguugaagu ugccggacug gccguguccg cacgacuggc ucagguuguu cuucaaggac 1680

gguaaagucg ucaagccguc ccuguagcgu cuaugguguc ugcggcacgc gcuggguguc 1740

ugggaccucu aggaccugua gugugggacg agaaagccgc cgcacucgca cuaguguggg 1800

ccgugguuau guucguuggu ccaccggcac gacauagucc ugcacuuaac auggcuccac 1860

gggcaccgau aggugcggcu agucgacugg gguuguaccg cccacauguc guggccgagg 1920

uugcagaagg ucuguucucg gccuacggac uagccucguc ucgugcacuu guuaaggaua 1980

cucacgcugu aggguuagcc gcggccguag acacggagaa uggucugggu cuguuugaga 2040

gggucuucuc gggccucgca ccggaggguc agauaguagc ggauauggua cagggacccg 2100

cggcucuugu cgcaccggau gagauuguua ucguagcggu aggguugguu gaaguguuag 2160

agacacuggu gucucuagga cgggcacagg uacugguucu guagacaccu gacguguuac 2220

auauagacac cgcuaagaug gcucacgucg uuggacgacg aggucaugcc gucgaaaaca 2280

ugggucgacu uaucucggga cuguccguag cggcaccucg uccuauucuu guguguccuc 2340

cacaagcggg uccacuucgu uuagauguuc ugggggggau aguuccugaa accgccgaag 2400

uuaaaaaggg ucuaggacgg acuagguagg uucggaagau ucgccucgaa auagcuccug 2460

gacgacaagu uguuccacug ggaccggcua cggccgaagu aguucgucau accgcuaacg 2520

gacccgcugu agcgucgguc ccuggacuag acgcgggucu ucaaauuacc ggacuggcac 2580

gacggugggg acgacugucu acucuacuag cgugucaugu guucgcggga cgaccggccg 2640

ugguagugua ggccuaccug gaagccgcgu ccucggcggg aggucuaggg gaaacgguac 2700

gucuaccgga uauccaaguu gccguagccg cacugggucu uacacgacau gcucuugguc 2760

uucgacuagc gguuagucaa auugaggcgg uagccguucu agguccuguc ggacaggaga 2820

ugucggucgc gggacccguu cgagguccua caccacuuag ucuugcgggu ccgggacuua 2880

ugggaccacu ucgucgacuc gucguugaag ccgcgguaga gaucgcacga cuuacuguag 2940

gacucggccg accuguucca ccuccgucuc cacgucuagc uggccgacua guggccggcc 3000

gaggucucgg aggucuggau acacuguguc gucgacuagu cccggcggcu cuagucccgg 3060

ucgcgguuag accgucguug guucuacagg cucacgcacg acccggucag auucucucac 3120

cugaaaacac cguucccgau aguggacuac aggaagggag ucagacgggg ugugccgcac 3180

cacaaagacg ugcacuggau gcacgggcgg guccucuucu ugaaguggug ucggggacgg 3240

uagacggugc uaccguuccg ggugaaaggu ucccucccgc acaagcacag guugccgugg 3300

gugaccaaac acugugucgc guuaaagaug cucggggucu aguaguggug ucuguugugg 3360

aagcacucgc cguugacacu gcaccaguag ccguagcacu uguuauggca cauacuaggu 3420

gaggucgggc ucgaccuguc gaaauuccuc cucgaccuau ucauaaaguu cuuagugugg 3480

aggggacugc accuagaccc gcuguagucg ccguaguuac ggaggcacca cuuguagguc 3540

uuccucuagc uggcggacuu gcuccaccga uucuuagacu ugcucucgga cuagcuggag 3600

guccucgacc cguucauacu cgucauguag uucaccggga ccauguagac cgacccgaag 3660

uagcggccgg acuagcggua gcacuaccac ugguaguacg acacgacaua cuguaggacg 3720

acaagaacgg acuucccgac gacaucgaca ccgaggacga cauucaaacu gcuccuacug 3780

agacuuggac acgacuuccc gcacuucgac guaaugugg 3819

<210> 19

<211> 3819

<212> RNA

<213> Artificial Sequence

<220>

<223> S蛋白基因优化后的mRNA序列(SBLmRNA)

<400> 19

uacaagcaaa aggagcaaga cgacggagaa cagucgagag ucacacacuu ggacuguuga 60

ucuuguguug auggagggcg gauguguuug agaaaguggg ccccgcacau gaugggucug 120

uuucacaagu ccucgagaca caacgugucg uggguucugg acaaaaacgg uaagaaauca 180

uuacacugga ccaaagugcg auagguacac agcccguggu ugcccugguu uucuaagcug 240

uuggggcaag acggcaaguu gcugccgcac augaagcgau cgugacucuu cagguuguaa 300

uaagcgccca ccuagaagcc uugauggaac cugagguuuu gugucagaga ugaguagcac 360

uuguugcgcu gauugcacca cuaauuccac acacuuaaag ucaagacguu acuagguaaa 420

aauccucaca ugaugguguu uuuauuauuu aguaccuacc ucagacuuaa agcgcauaug 480

ucaucgcgau uauugacaug uaagcuuaua caaucgguug ggaaaaacua ccugaaucuc 540

cccuucgucc cuuuaaaauu cuuaaacgcu cuuaaacaca aguuuuuaua gcuacccaua 600

aaguucuaga ugagguucgu augaggguau uuagaccacg cgcugaaugg aguucccaag 660

ucgcgugacc ucggugacca ucuggacggu uagccguagu uguagugggc uaaggucugg 720

gacgaacgag acguggcaag uauagacugu gguccucuaa gcagaaggcc uaccugucgu 780

ccccggcgac gaaugauaca accaauagaa gucggagccu ggaaagacga guucauauua 840

cucuuacccu gguaauggcu gcgacaacua acacgagagc uaggggacag ucuuugguuc 900

acgugugauu uuagaaagug ucagcuuuuc cccuagaugg ucugaagauu gaaagcacau 960

gucggguggc ucucguagca guccaagggu uuauagugau uggacacagg aaaaccgcuc 1020

cacaaguugc gauguucuaa acgaucgcac augcggaccu ugucuuuuuc uuauaguuua 1080

acgcaucggc uaaugucgca gaauauauug agacguagga agaguugaaa guucacaaua 1140

ccucacucgg gcugauucga cuuacuaaac acgaaauguu uacaaauacg gcuaaguaag 1200

cacuaggccc cgcugcucca gucugucuag cggggaccgg uuuguccauu cuaacgacua 1260

auguugaugu uuaauggacu gcuaaaaugu ccuacgcaau agcgaaccuu gagauuguua 1320

gagcuaagau uccagccgcc guuaauguua auagaaauag cggaaaaguc cuucaguuua 1380

gaauucggua agcucgcucu guagucaugg cucuauaugg uccgccccag guggggcaca 1440

uugccacagc ucccaaaguu gacgaugaaa ggugacguca ggauacccaa ggucgggugg 1500

uuaccgcacc caauggucgg gauggcucau cagcauaaca gaaaacucga gaacgugcgg 1560

gggcggugcc acacgccagg uuucuuuagu ugauugaauc aauucuuauu uacacacuua 1620

aaauugaaau ugccggacug ucccuguccu caggacuguc uuagguuauu cuucaaggaa 1680

gggaaagucg ucaaaccugc gcuguagcgu cugugguguc ugcggcacgc acuggggguu 1740

ugagagcuuu aagagcuaua gugugggacg ucaaaaccgc cccagucaca guaaugggga 1800

cccugguuau gaucauuggu ccagcgucac gaaaugguuc uacaguugac auggcuccaa 1860

ggacaccgau aagugcgucu gguugacuga ggcugaaccg cccacauauc auguccgagg 1920

uuacacaaag ucugggcccg uccgacggac uaaccccggc ucguacauuu auugaggaug 1980

cucacgcuau agggguaucc acgaccuuau acacggucaa uagucugggu cugcuugagc 2040

gguucugcuc gauccaggca ucggagaguc ucguauuagc gcaugugaua cucggacccc 2100

cggcuuuuaa ggcaccguau aucguuguug ucguaacgau aaggaugauu gaaauguuaa 2160

agucagugcu gccucuagga cggucagagg uacugauuuu ggaggcaccu gacaugcuac 2220

auguaaacac cgcuaaguug acuuacgaga uuggacgaga augucaugcc aagaaaaaca 2280

ugggucgacu uggcccguaa cugcccguag cgucaacucg uccuguucuu augaguccuc 2340

cacaaacgcg uucacuucgu uuaaauauuu ugaggagggu aauuccugaa accgccaaag 2400

uugaagagcg ucuaggaugg acuggguagu uuuggaucgu ucuccagaaa guaacuucug 2460

gaagacaagu uguuccagug ugaccgacug cggccgaagu aauuugucau gccucuaaca 2520

gauccacuau aacgucgcgc gcuagacuaa acgcgugucu ucaaauugcc ggacugccag 2580

aaugggggag aggaauggcu gcuuuacuaa cgggucaugu ggucgcggga cgagcgaccg 2640

ugcuaaugau cgccuaccug uaaaccccgg ccgcgacggg aggucuaugg uaaacgguac 2700

gucuaccgca uauccaaauu gccguauccu cauugggucu ugcacgacau gcucuugguu 2760

uuugacuauc gguuaguuaa guuaucacgg uauccuuucu auguccuguc agagucgucg 2820

uggcgcaggc gagagccuuu cgauguucua caccaguugg ucuugcgcgu ccguaacuua 2880

ugugaccacu ucgucgagag gagcuuaaaa ccucguuagu cgucgcacga cuuacuauag 2940

gacagagccg accuguucca acuucggcuu caggucuagc uguccaauua guggccagcc 3000

gacgucucag aggucuguau acaauggguc guugaguagu cucgacggcu uuaugcgcgg 3060

ucacgguuag aacgucggug auucuacagg cucacgcaca accccguuuc auuuucccaa 3120

cuaaagacac cuuuuccuau aguagaauac ucaaagggag uuaggcgggg agugccucaa 3180

cagaaggacg uacacuggau gcacggucgc guccucuucu ugaagugcug gcgggggcgg 3240

uagacgguac uaccguuccg gguaaaaggg gcgcuuccuc acaagcauag guuaccgugg 3300

gugaccaagc acugcgucuc uuuaaaaaua cucggcguuu aauagugaug gcuguugugu 3360

aagcaaaggc cguuaacgcu acagcauuag cccuagcacu uauuauguca gauacuagga 3420

gaagucgguc uugagcuaag uaaguuucuc cucgaccuau uuauaaaguu cuuggugugg 3480

agggggcuac accuagaccc acuguauagu ccuuaauugc guucgcagca cuuguaaguc 3540

uuccuuuagc uguccgaguu acuucaucgu uucuugaacu uacucagaga guagcugaac 3600

guccuugagc cguuuauacu cgucauguaa uuuaccggca ccauauagac cgauccgaaa 3660

uagcggccag acuaacguua acacuaccaa ugauaguaca acacgacgua cuguucaacg 3720

acaaguacgg aauuuccgac gacgaggacg cccaguacaa cauuuaagcu acuccugcug 3780

agacucgggc acgacuuucc ccacuuugac gugaugugc 3819

<210> 20

<211> 3819

<212> RNA

<213> Artificial Sequence

<220>

<223> S蛋白基因优化mRNA序列3(STF mRNA)

<400> 20

uacaagcaca aggaccacga cgacggagac cacaggucgg ucacacacuu ggacuggugg 60

ucuugugucg acggaggucg gaugugguug ucgaaauggu cuccgcacau gauggggcug 120

uuccacaagu cuaggucgca cgacgugaga uggguccugg acaaggacgg aaagaagucg 180

uugcacugga ccaaggugcg guaggugcac aggccguggu uaccgugguu cucuaagcug 240

uuggggcacg acgggaaguu gcugccccac augaaacggu cguggcucuu cagguuguag 300

uagucuccga ccuagaagcc guggugugac cugucguucu gggucucgga cgacuagcac 360

uuguugcggu gguugcacca guaguuucac acgcucaagg ucaagacguu gcuggggaag 420

gacccgcaga ugaugguguu cuuguuguuc ucgaccuacc uuucgcucaa ggcccacaug 480

ucgucgcggu uguugacgug gaagcucaug cacagggucg gaaaggacua ccuggaccuu 540

ccguucgucc cguugaaguu cuuggacgcg cucaagcaca aauucuugua gcugccgaug 600

aaguucuaga ugucguucgu guggggauag uuggagcacg cccuagacgg agucccgaag 660

agacgagacc uuggggacca ccuagacggg uagccguagu uguagugggc caaagucugu 720

gacgaccggg acgugucuuc gauggacugu ggaccgcuau cgucgucgcc uaccugucga 780

ccacggcggc gaaugauaca cccgauggac gucggaucuu ggaaggacga cuucauguug 840

cucuugccgu gguaguggcu gcggcaccua acacgagacc uaggagacuc gcucuguuuc 900

acgugggacu ucaggaagug gcaccuuuuc ccguagaugg ucuggucguu gaaggcccac 960

gucggguggc uuagguagca cgccaagggg uuauaguggu uagacacggg gaagccgcuc 1020

cacaaguuac gguggucuaa gcggagacac augcggaccu uggccuucgc cuagucguua 1080

acgcaccggc ugaugaggca cgacauguug aggcggucga agucguggaa guucacgaug 1140

ccgcacaggg gaugguucga cuugcuggac acgaaguguu ugcacaugcg gcugucgaag 1200

cacuaggccc cucuacuuca cgccgucuaa cggggaccug ucuguccguu cuagcggcug 1260

auguugaugu ucgacgggcu gcugaagugg ccgacacacu aacggaccuu gucguuguug 1320

gaccugaggu uucagccgcc guugauguua auggacaugg ccgacaaggc cuucagguua 1380

gacuucggga agcucgcccu guagaggugg cucuagauag uccggccguc guggggaaca 1440

uugccgcacc uuccgaaguu gacgaugaag ggugacguca ggaugccgaa agucgggugu 1500

uuaccgcacc cgauagucgg gaugucucac caccacgacu cgaagcuuga cgacguacgg 1560

ggacgguguc acacgccggg auucuuuucg ugguuagagc acuucuuguu uacgcacuug 1620

aaguugaagu ugccggacug gccguggccg cacgacuguc ucucguuguu cuucaaggac 1680

gguaaggucg ucaaaccggc ccuauagcgg cuaugguguc ugcggcaauc ucuagggguc 1740

ugugaccuuu aggaccugua guggggaacg ucgaagccgc cucacagaca cuagugggga 1800

ccgugguugu ggucguuagu ccaccgucac gacauggucc ugcacuugac auggcuucac 1860

gggcaccggu aagugcggcu agucgacugu ggauguaccg cccacaugag guggccgucg 1920

uuacacaaag ucuggucucg gccgacagac uagccucggc ucgugcacuu guuaucgaug 1980

cucacgcugu agggguagcc gcgaccguag acacggucga uggucugugu cuguuugucg 2040

gggucugccc ggucuagaca ccggucgguc ucguaguaac ggauguguua cagagacccg 2100

cggcucuugu cgcaccggau gagguuguug agauagcgau aggggugguu gaagugguag 2160

ucgcacuggu gucucuagga cggacacagg uacugguucu ggucgcaccu gacgugguac 2220

auguagacgc cgcuaaggug gcucacgagg uuggacgacg acgucaugcc gucgaagacg 2280

ugggucgacu uaucucggga cugucccuag cggcaccuug uccuguucuu guggguucuc 2340

cacaagcggg uucacuucgu cuagauguuc uggggaggau aguuccugaa gccgccgaag 2400

uuaaagucgg ucuaagacgg gcuaggaucg uucgggucgu ucgccucgaa guagcuccug 2460

gacgacaagu uguuucacug ugaccggcug cggccgaagu aguucgucau accgcuaaca 2520

gacccgcugu aacggcgguc ccuagacuaa acgcgggucu ucaaauugcc ugacugucac 2580

gacggaggag acgacuggcu acucuacuag cgggucaugu guagacggga cgaccggccg 2640

uguuaguguu cgccgaccug uaaaccucga ccgcggcgag acgucuaggg gaaacgauac 2700

gucuaccgga uggccaaguu gccguagccu cacugggucu uacacgacau gcucuugguc 2760

uucgacuagc gguuggucaa guugucgcgg uagccguucu agguccuguc ggacucgucg 2820

ugucguucgc gggacccuuu cgacguccug caccaguugg ucuuacgggu ccgugacuug 2880

ugggaccagu ucgucgacag gagguugaag ccgcgguagu cgagacacga cuugcuauag 2940

gacucgucug accuguucca ccuucggcuc cacgucuagc ugucugacua guggccuucc 3000

gacgucaggg acgucuggau gcaauggguc gucgacuagu cucggcggcu cuaaucucgg 3060

agacgguuag accggcggug guucuacaga cucacacacg acccggucuc guucucucac 3120

cugaaaacgc cguucccgau gguggacuac ucgaagggag ucagacgggg agugccgcac 3180

cacaaagacg ugcacuguau gcacgggcga guucucuucu uaaaguggug gcgaggucgg 3240

uagacggugc ugccguuucg ggugaaagga ucucuuccgc acaagcacag guugccgugg 3300

guaaccaagc acugggucgc cuugaagaug cucggggucu aguaguggug gcuguugugg 3360

aagcacagac cguugacgcu gcagcacuag ccguaacacu uguuauggca caugcuggga 3420

gacgucgggc ucgaccuguc gaaguuucuc cuugaccuau ucaugaaauu cuuggugugu 3480

ucggggcugc accuggaccc gcuauagucg ccuuaguuac ggucgcagca cuuguagguc 3540

uuucucuagc uggccgacuu gcuccaccgg uucuuagacu ugcucucgga cuagcuggac 3600

guucuugacc ccuucaugcu cgucauguag uucaccggga ccauguagac cgacccgaaa 3660

uagcggccug acuaacggua gcacuaccag uguuaguacg acacaacgua cuggucgacg 3720

acaucgacgg acuucccgac aacaucgaca ccgucgacga cguucaagcu gcuccugcua 3780

agacucgggc acgacuuccc gcacuuugac gugaugugu 3819

<210> 21

<211> 225

<212> RNA

<213> Artificial Sequence

<220>

<223> E蛋白基因优化mRNA序列(EBL mRNA)

<400> 21

uacaugucga aacagagucu ccuuuggccg ugcgacuaac auuugucgca caaugauaag 60

gagcggaagc aacacaaaga ggaacaaugu gaccguuaug acugacggga cgccaacacg 120

cgaaugacga cauuauagca cuugcacaga aaccacuucg ggucaaagau acauauaagg 180

ucucaguuuu uagaguugag gagaucccac ggacuggacg aacag 225

<210> 22

<211> 669

<212> RNA

<213> Artificial Sequence

<220>

<223> M蛋白基因优化mRNA序列(MBL mRNA)

<400> 22

uaccgucuaa gguugccaug uuaauggcag cuucucgacu uuuucgagga acucgucacc 60

uuggaccagu aucccaagga uaaggacugu accuaaacgg acgacguuaa acggauacgg 120

uuguccuuau ccaaaaacau auauuaguuc gacuaaaagg agaccgacaa uaccggucac 180

ugggaccgga caaaacacga acggcgacaa augucuuauu uaaccuagug gccgccuuag 240

cgguagcguu accgaacgga acauccgaac uacaccgagu cgaugaagua acgaagaaag 300

gccgacaaac gcgcuugcgc cagguacacc agaaaguuag gccucugauu guaugaggag 360

uuacacgggg agguaccgug auaagacugg ucuggggacg aucucucacu ugagcaguag 420

ccucgacacu aggacgcccc cguggacucu uagcggccug uggugaaucc ggcgacacug 480

uaguuccuag acggauuucu uuagugacaa cgguguagug cuugggaaag aauaauguuc 540

aacccccgga gcgucgcaca ccguccucug aguccaaaac gccguauguc agcgaugucc 600

uaaccguuga uauuuaauuu gugucuggua aggucgucgu cgcuauuaua acgaaacgaa 660

cacgucacu 669

<210> 23

<211> 1257

<212> RNA

<213> Artificial Sequence

<220>

<223> N蛋白基因优化mRNA序列(NBL mRNA)

<400> 23

uacagucuau ugccuggcgu cuugguuucc uugcggggag ccuagugaaa gcccccagga 60

ucgcugucgu gacccagauu gguuuuaccu cuugcaaggc cgcguucuag guuugucucc 120

uccggagucc ccgaaggauu guuaugucgg aggaccaagu gucgagagug ugucguaccg 180

uuccuucugg acuucaaagg aucuccgguc ccccaagggu aguuaugauu gaggaggggu 240

cugcuagucu aaccaauaau agccgcccga ugguccgccu aggccccgcc ucugccauuc 300

uacuuccugg agagaggggc aaccaugaaa augauggagc cauguccggg gcuccgaccc 360

gaaggcauac cgcgguuauu ccuaccuuau uaaacccacc gaugccuucc ccgggaguug 420

uguggcuucc uaguguaacc gugggcauua gggcgcuuau uacggcggua acaggacguc 480

aacggggucc ccugcugcaa cggguuuccg aaaaugcguc uuccuagcgc gccuccuagg 540

guucggaggu cggcuaguuc gagagcuaga gccuugaguu cagcguuauc gugugguccc 600

agaagagcgc ccuggucggg acguuccuac cggccuuugc cgccacuacg acgaaaucgc 660

gacgacgacg accuaucuga cuugguuaau cucucauuuu acaguccauu uccggucguu 720

gucguccccg ucugucacug guuuuuuuca cgccggcucc ggucguucuu uggggcgguc 780

uuugcuuguc ggugauuucg gauguugcau uggguucgua agccuuccuc uccuggucuc 840

gucuggguuc cguuaaaacc gcuaguucuc gacuaggcgg uccccugccu gauauucgua 900

accggugucu agcgggucaa gcguggguca cgaagucgga agaagccuua cagcucuuag 960

ccauaccucc agugaggaag accgugaacc gacugaauau ggccgcguua uuucgaucug 1020

cuguuucugg gauugaaauu ccuaguccac uaggacgauu uauuugugua acuacgcaug 1080

uuuuguaagg gugguugacu cgguuucuuc cuguucuucu ucuuccgucu acuuuggguc 1140

cgaaacgggg ucucugucuu uuucgucguc uggcacugga acgacggucg ucggcuggag 1200

cugcuaaaaa guuucguuga agucgucagg uacucaucgc gacugucgug gguccga 1257

<210> 24

<211> 3819

<212> DNA

<213> Artificial Sequence

<220>

<223> SDC50

<400> 24

atgttcgtgt ttctggtgct gctgcctctg gtgtcttctc agtgtgtgaa tctgacaaca 60

agaacacagc tgcctcctgc ctacaccaac agctttacaa gaggagtgta ctaccctgac 120

aaggtgttca gaagcagcgt gctgcattct acacaggacc tgtttctgcc tttcttcagc 180

aacgtgacct ggtttcacgc cattcacgtg tctggcacaa atggaaccaa gaggttcgac 240

aatcctgtgc tgcctttcaa cgatggcgtg tactttgcct ctaccgagaa gagcaacatc 300

atcagaggct ggatctttgg caccacactg gatagcaaga cacagtctct gctgatcgtg 360

aacaatgcca ccaacgtggt gatcaaggtg tgtgagttcc agttctgcaa cgaccctttt 420

ctgggcgtgt actaccacaa gaacaacaag agctggatgg agagcgagtt cagagtgtac 480

agctctgcca acaattgcac ctttgagtac gtgagccagc ctttcctgat ggatctggaa 540

ggaaagcagg gcaatttcaa gaacctgcgg gagttcgtgt tcaagaacat cgacggctac 600

ttcaagatct acagcaagca cacccccatc aatctggtga gagatctgcc tcagggattt 660

tctgctctgg aacctctggt ggatctgcct attggcatca acatcaccag attccagaca 720

ctgctggctc tgcacagatc ttacctgaca cctggagatt cttcttctgg atggacagct 780

ggagctgctg cttattacgt gggctatctg cagcctagaa ccttcctgct gaagtacaac 840

gagaatggca ccatcacaga tgctgtggat tgtgctctgg atcctctgtc tgagaccaag 900

tgtacactga agagcttcac agtggagaag ggcatctacc agaccagcaa tttcagagtg 960

cagcctacag agagcatcgt gagattcccc aacatcacca atctgtgccc ttttggagag 1020

gtgttcaatg ccaccagatt tgcctctgtg tacgcctgga acagaaagag gatcagcaac 1080

tgtgtggccg attactctgt gctgtacaac tctgccagct ttagcacctt caagtgctac 1140

ggagtgtctc ctacaaagct gaacgacctg tgtttcacca acgtgtacgc cgatagcttc 1200

gtgattagag gcgatgaagt gagacagatt gctcctggcc agacaggaaa gatcgccgat 1260

tacaactaca agctgcctga tgacttcacc ggctgtgtga ttgcctggaa tagcaataac 1320

ctggacagca aagtgggcgg caactacaac tacctgtaca gactgttcag gaagagcaac 1380

ctgaagccct tcgagagaga catctctacc gagatttatc aggctggaag caccccttgt 1440

aatggcgtgg aaggcttcaa ctgttacttt cctctgcaga gctacggctt tcagcctacc 1500

aatggagtgg gatatcagcc ttatagagtg gtggtgctga gctttgaact gctgcatgct 1560

cctgctacag tgtgtggacc taagaagagc accaacctgg tgaagaacaa gtgcgtgaac 1620

ttcaacttca acggcctgac aggaacagga gtgctgacag agagcaataa gaagttcctg 1680

cccttccagc agtttggcag agacattgcc gatacaacag atgccgtgag agatcctcag 1740

acactggaga tcctggatat cacaccttgt agctttggcg gcgtgtctgt gattacacct 1800

ggaaccaata ccagcaatca ggtggctgtg ctgtaccagg atgtgaattg cacagaagtg 1860

cctgtggcca ttcatgctga tcagctgaca cctacatgga gagtgtacag caccggctct 1920

aatgtgtttc agaccagagc tggatgtctg attggagccg agcacgtgaa taacagctac 1980

gagtgtgaca tccctattgg agccggaatc tgtgcctctt atcagacaca gaccaactct 2040

cctagaagag ccagatctgt ggcctctcag tctatcatcg cctataccat gtctctggga 2100

gctgagaata gcgtggccta tagcaacaac agcattgcca tccctaccaa cttcaccatc 2160

agcgtgacaa cagagattct gcctgtgagc atgaccaaga catctgtgga ctgcaccatg 2220

tacatctgtg gcgattctac cgagtgtagc aatctgctgc tgcagtacgg ctctttttgt 2280

acccagctga atagagccct gacaggaatt gccgtggaac aggacaagaa tacccaggaa 2340

gtgtttgccc aggtgaagca gatctacaag acccctccta tcaaggactt tggcggcttc 2400

aacttctctc agattctgcc tgatcctagc aagcccagca agagaagttt catcgaggat 2460

ctgctgttca acaaggtgac actggccgat gccggattta tcaagcagta tggagattgt 2520

ctgggcgata tcgccgccag agatctgatt tgtgcccaga agtttaatgg actgaccgtg 2580

ctgcctcctc tgctgacaga tgagatgatt gctcagtata catctgccct gctggccgga 2640

acaatcacat ctggatggac atttggagct ggagctgctc tgcagattcc ttttgccatg 2700

cagatggcct acagattcaa tggcatcggc gtgacacaga atgtgctgta cgagaaccag 2760

aagctgattg ccaaccagtt caacagcgcc attggcaaga tccaggattc tctgtcttct 2820

acagcctctg ctctgggaaa actgcaggat gtggtgaatc agaatgccca ggccctgaat 2880

acactggtga agcagctgtc tagcaatttt ggcgccatct ctagcgtgct gaatgacatc 2940

ctgagcagac tggataaagt ggaggccgaa gtgcagatcg atagactgat cacaggcaga 3000

ctgcagtctc tgcagacata tgtgacacag cagctgatta gagctgccga gatcagagct 3060

tctgctaatc tggctgccac aaagatgtct gagtgtgtgc tgggacagtc taagagagtg 3120

gacttctgtg gcaaaggcta tcacctgatg agctttcctc agtctgctcc tcatggagtg 3180

gtgtttctgc atgtgacata tgtgcctgcc caggagaaga acttcacaac agctcctgcc 3240

atttgtcatg atggcaaggc ccactttcct agagaaggag tgttcgtgtc taatggcaca 3300

cactggttcg tgacacagag gaacttctac gagcctcaga tcatcaccac cgataacacc 3360

ttcgtgtctg gcaattgcga tgtggtgatc ggcatcgtga acaataccgt gtatgatcct 3420

ctgcagcctg agctggatag cttcaaggag gagctggaca agtacttcaa gaaccacacc 3480

tctcctgatg tggatctggg cgatatctct ggcatcaatg cctctgtggt gaacatccag 3540

aaggagatcg acagactgaa tgaggtggcc aagaacctga atgagagcct gatcgatctg 3600

caggaactgg gaaagtacga gcagtacatc aagtggcctt ggtacatctg gctgggattt 3660

attgccggac tgattgccat cgtgatggtg accatcatgc tgtgctgtat gaccagctgt 3720

tgtagctgtc tgaaaggctg ctgtagctgt ggcagctgtt gcaagtttga tgaggatgat 3780

tctgagcctg tgctgaaggg cgtgaagctg cactacacc 3819

<210> 25

<211> 3819

<212> DNA

<213> Artificial Sequence

<220>

<223> SDC54

<400> 25

atgttcgtgt tcctggtgct gctgcctctg gtgagctctc agtgtgtgaa tctgaccaca 60

agaacccagc tgcctcctgc ctacaccaac agctttacca gaggagtgta ctaccccgac 120

aaggtgttca gaagcagcgt gctgcatagc acacaggatc tgttcctgcc cttcttcagc 180

aacgtgacct ggtttcacgc catccatgtg tctggcacca atggcaccaa gagattcgac 240

aaccctgtgc tgcctttcaa cgatggcgtg tacttcgcct ctaccgagaa gagcaacatc 300

atcagaggct ggatcttcgg caccacactg gatagcaaga cccagtctct gctgatcgtg 360

aacaacgcca ccaacgtggt gatcaaggtg tgcgagttcc agttctgcaa cgaccccttc 420

ctgggcgtgt actaccacaa gaacaacaag agctggatgg agagcgagtt cagggtgtac 480

agcagcgcca acaattgcac cttcgagtac gtgagccagc ctttcctgat ggatctggag 540

ggaaagcagg gcaacttcaa gaacctgcgg gagttcgtgt tcaagaacat cgacggctac 600

ttcaagatct acagcaagca cacccccatc aacctggtga gagatctgcc tcagggattt 660

tctgctctgg agcctctggt ggatctgcct atcggcatca acatcaccag attccagaca 720

ctgctggccc tgcacagaag ctacctgaca cctggagatt cttcttctgg ctggacagct 780

ggagctgctg cctattacgt gggctatctg cagcccagaa ccttcctgct gaagtacaac 840

gagaacggca ccatcacaga tgccgtggat tgtgccctgg atcctctgtc tgagaccaag 900

tgtaccctga agagcttcac cgtggagaag ggcatctacc agaccagcaa cttcagagtg 960

cagcctaccg agagcatcgt gagattcccc aacatcacca acctgtgccc ttttggcgag 1020

gtgttcaatg ccaccagatt tgccagcgtg tacgcctgga acaggaagag gatcagcaac 1080

tgtgtggccg attacagcgt gctgtacaac tctgccagct tcagcacctt caagtgctac 1140

ggcgtgtctc ctacaaagct gaacgacctg tgcttcacca acgtgtacgc cgacagcttc 1200

gtgattagag gcgatgaggt gagacagatt gctcctggcc agacaggcaa gattgccgac 1260

tacaactaca agctgcctga cgacttcacc ggctgtgtga ttgcctggaa cagcaacaat 1320

ctggacagca aggtgggcgg caactacaac tacctgtaca ggctgttcag gaagagcaac 1380

ctgaagccct tcgagagaga catcagcacc gagatctatc aggctggaag caccccttgt 1440

aatggcgtgg agggcttcaa ctgttacttc cctctgcaga gctacggctt tcagcctacc 1500

aatggagtgg gctatcagcc ttacagagtg gtggtgctga gctttgaact gctgcatgct 1560

cctgctacag tgtgtggccc caagaagagc accaacctgg tgaagaacaa gtgcgtgaac 1620

ttcaacttca acggcctgac cggaacagga gtgctgacag agagcaacaa gaagttcctg 1680

cccttccagc agttcggcag agatatcgcc gataccacag atgccgtgag agatcctcag 1740

acactggaga tcctggacat cacaccttgc agctttggcg gagtgtctgt gatcacacct 1800

ggcaccaata ccagcaatca ggtggctgtg ctgtaccagg acgtgaattg caccgaagtg 1860

cctgtggcca ttcatgctga tcagctgacc cctacatgga gagtgtacag caccggctct 1920

aatgtgttcc agaccagagc cggatgtctg attggagccg agcacgtgaa taacagctac 1980

gagtgcgaca tccctattgg agccggcatc tgtgcctctt atcagaccca gaccaactct 2040

cctagaagag ccagaagcgt ggcctctcag agcatcattg cctacaccat gtctctggga 2100

gccgagaata gcgtggccta cagcaataac agcatcgcca tccccaccaa cttcaccatc 2160

agcgtgacca cagagattct gcctgtgagc atgaccaaga cctctgtgga ctgcaccatg 2220

tacatctgtg gcgactctac cgagtgcagc aatctgctgc tgcagtatgg cagcttttgt 2280

acccagctga acagagccct gacaggcatt gctgtggagc aggataagaa cacccaggag 2340

gtgtttgccc aggtgaagca gatctacaag acccctccca tcaaggactt cggcggcttt 2400

aacttcagcc agatcctgcc tgatcctagc aagcccagca agaggagctt tatcgaggac 2460

ctgctgttca acaaggtgac cctggccgat gctggcttta tcaagcagta cggagattgt 2520

ctgggcgata tcgccgccag agacctgatt tgtgcccaga agttcaatgg actgaccgtg 2580

ctgcctcctc tgctgacaga tgagatgatt gcccagtaca catctgccct gctggctggc 2640

acaatcacat ctggatggac atttggagct ggagctgccc tgcagatccc ttttgccatg 2700

cagatggcct acagattcaa cggcatcggc gtgacccaga atgtgctgta cgagaaccag 2760

aagctgatcg ccaaccagtt caacagcgcc atcggcaaga tccaggattc tctgtctagc 2820

acagcctctg ctctgggaaa gctgcaggat gtggtgaatc agaatgccca ggccctgaat 2880

acactggtga agcagctgag cagcaacttt ggcgccatca gctctgtgct gaatgacatc 2940

ctgagcagac tggacaaggt ggaggctgaa gtgcagatcg acagactgat cacaggcaga 3000

ctgcagtctc tgcagaccta cgtgacacag cagctgatta gagctgccga gatcagagct 3060

tctgccaatc tggctgccac caagatgtct gagtgtgtgc tgggacagag caagagagtg 3120

gacttctgtg gcaaaggcta ccacctgatg agcttccctc agtctgctcc tcatggagtg 3180

gtgtttctgc acgtgaccta tgtgcctgcc caggagaaga acttcaccac agctcctgcc 3240

atttgtcacg atggcaaggc ccactttcct agagaaggcg tgttcgtgag caatggcaca 3300

cactggttcg tgacccagag gaacttctac gagccccaga tcatcaccac cgataacacc 3360

ttcgtgagcg gcaattgcga cgtggtgatc ggcatcgtga acaataccgt gtacgatcct 3420

ctgcagcctg agctggacag cttcaaggag gagctggaca agtacttcaa gaaccacacc 3480

agccctgatg tggatctggg cgacatctct ggcatcaatg ccagcgtggt gaacatccag 3540

aaggagatcg acaggctgaa cgaggtggcc aagaacctga atgagagcct gatcgatctg 3600

caggagctgg gcaagtacga gcagtacatc aagtggcctt ggtacatctg gctgggcttt 3660

atcgccggac tgattgccat cgtgatggtg accatcatgc tgtgctgcat gaccagctgc 3720

tgtagctgtc tgaagggctg ttgtagctgt ggcagctgtt gcaagttcga cgaggatgat 3780

agcgagcctg tgctgaaagg cgtgaagctg cactacacc 3819

<210> 26

<211> 3819

<212> DNA

<213> Artificial Sequence

<220>

<223> SDC58

<400> 26

atgttcgtgt tcctggtgct gctgcccctg gtgagctctc agtgtgtgaa cctgaccacc 60

agaacccagc tgcctcctgc ctacaccaac agcttcacca gaggcgtgta ctaccccgac 120

aaggtgttca gaagcagcgt gctgcacagc acccaggacc tgttcctgcc cttcttcagc 180

aacgtgacct ggttccacgc catccacgtg tctggcacca atggcaccaa gaggttcgac 240

aaccctgtgc tgcccttcaa cgacggcgtg tacttcgcca gcaccgagaa gagcaacatc 300

atcaggggct ggatcttcgg caccaccctg gacagcaaga cccagagcct gctgatcgtg 360

aacaacgcca ccaacgtggt gatcaaggtg tgcgagttcc agttctgcaa cgaccccttc 420

ctgggcgtgt actaccacaa gaacaacaag agctggatgg agagcgagtt ccgggtgtac 480

agcagcgcca acaactgcac cttcgagtac gtgagccagc ccttcctgat ggacctggag 540

ggcaagcagg gcaacttcaa gaacctgcgg gagttcgtgt tcaagaacat cgacggctac 600

ttcaagatct acagcaagca cacccccatc aacctggtga gagacctgcc tcagggcttt 660

tctgccctgg agcctctggt ggacctgcct atcggcatca acatcaccag gttccagacc 720

ctgctggccc tgcacagaag ctacctgaca cctggcgata gctcttctgg ctggacagct 780

ggagctgctg cctattacgt gggctacctg cagcccagga ccttcctgct gaagtacaac 840

gagaacggca ccatcaccga cgccgtggat tgtgccctgg atcctctgag cgagaccaag 900

tgcaccctga agagcttcac cgtggagaag ggcatctacc agaccagcaa cttccgggtg 960

cagcctaccg agagcatcgt gaggttcccc aacatcacca acctgtgccc tttcggcgag 1020

gtgttcaacg ccaccagatt cgcctctgtg tacgcctgga acaggaagcg gatcagcaac 1080

tgcgtggccg actacagcgt gctgtacaac agcgccagct tcagcacctt caagtgctac 1140

ggcgtgagcc ctaccaagct gaacgacctg tgcttcacca acgtgtacgc cgacagcttc 1200

gtgatcagag gcgatgaggt gagacagatc gcccctggac agaccggcaa gatcgccgac 1260

tacaactaca agctgcccga cgacttcacc ggctgtgtga tcgcctggaa cagcaacaac 1320

ctggacagca aggtgggcgg caactacaac tacctgtacc ggctgttccg gaagagcaac 1380

ctgaagccct tcgagaggga catcagcacc gagatctacc aggccggaag cacaccttgc 1440

aatggcgtgg agggcttcaa ctgctacttc cccctgcaga gctacggctt tcagcctacc 1500

aatggcgtgg gctaccagcc ctacagagtg gtggtgctga gctttgaact gctgcatgcc 1560

cctgccacag tgtgtggccc caagaagagc accaacctgg tgaagaacaa gtgcgtgaac 1620

ttcaacttca acggcctgac cggcacaggc gtgctgaccg agagcaacaa gaagttcctg 1680

cccttccagc agttcggcag agacatcgcc gataccaccg atgccgtgag agatcctcag 1740

accctggaga tcctggacat caccccttgc agctttggcg gagtgagcgt gatcacacct 1800

ggcaccaaca ccagcaatca ggtggccgtg ctgtaccagg acgtgaactg cacagaggtg 1860

cctgtggcca ttcatgccga tcagctgacc cctacctgga gagtgtacag caccggcagc 1920

aatgtgttcc agaccagagc cggctgtctg atcggagccg agcacgtgaa caacagctac 1980

gagtgcgaca tccctatcgg agccggcatc tgcgcctctt accagacaca gaccaacagc 2040

cccagaagag ccagaagcgt ggccagccag tctatcatcg cctacaccat gagcctggga 2100

gccgagaaca gcgtggccta cagcaacaac agcatcgcca tccccaccaa cttcaccatc 2160

agcgtgacca ccgagatcct gcccgtgagc atgaccaaga ccagcgtgga ctgcaccatg 2220

tacatctgcg gcgacagcac agagtgcagc aacctgctgc tgcagtacgg cagcttttgc 2280

acccagctga acagagccct gacaggcatt gccgtggagc aggacaagaa cacccaggag 2340

gtgttcgccc aggtgaagca gatctacaag acccccccca tcaaggactt cggcggcttc 2400

aacttcagcc agatcctgcc tgaccctagc aagcccagca agcggagctt catcgaggac 2460

ctgctgttca acaaggtgac cctggccgat gccggcttca tcaagcagta cggcgattgt 2520

ctgggcgata tcgccgccag agacctgatc tgtgcccaga agttcaacgg cctgaccgtg 2580

ctgcctcctc tgctgacaga tgagatgatc gcccagtaca cctctgccct gctggccgga 2640

accatcacat ctggctggac atttggagct ggagccgccc tgcagatccc tttcgccatg 2700

cagatggcct acaggttcaa cggcatcggc gtgacccaga acgtgctgta cgagaaccag 2760

aagctgatcg ccaaccagtt caacagcgcc atcggcaaga tccaggacag cctgtctagc 2820

acagcctctg ctctgggcaa gctgcaggat gtggtgaacc agaatgccca ggccctgaac 2880

accctggtga agcagctgag cagcaatttc ggcgccatca gcagcgtgct gaacgacatc 2940

ctgagcagac tggacaaggt ggaggccgag gtgcagatcg acagactgat caccggcaga 3000

ctgcagagcc tgcagaccta cgtgacacag cagctgatca gagccgccga gatcagagcc 3060

tctgccaatc tggctgccac caagatgagc gagtgtgtgc tgggccagag caagagagtg 3120

gacttctgcg gcaaaggcta ccacctgatg agcttccccc agtctgctcc tcatggcgtg 3180

gtgtttctgc acgtgaccta cgtgcctgcc caggagaaga acttcaccac agcccctgcc 3240

atctgtcacg atggcaaggc ccacttccct agagagggcg tgttcgtgag caatggcacc 3300

cactggttcg tgacccagcg gaacttctac gagccccaga tcatcaccac cgacaacacc 3360

ttcgtgagcg gcaactgcga cgtggtgatc ggcatcgtga acaacaccgt gtacgaccct 3420

ctgcagcccg agctggacag cttcaaggag gagctggaca agtacttcaa gaaccacacc 3480

agccccgacg tggatctggg cgacatcagc ggcatcaacg ccagcgtggt gaacatccag 3540

aaggagatcg accggctgaa cgaggtggcc aagaacctga acgagagcct gatcgacctg 3600

caggagctgg gcaagtacga gcagtacatc aagtggccct ggtacatctg gctgggcttt 3660

atcgccggcc tgatcgccat cgtgatggtg accatcatgc tgtgctgcat gaccagctgc 3720

tgcagctgcc tgaagggctg ttgtagctgt ggcagctgct gcaagttcga cgaggacgat 3780

agcgagcctg tgctgaaggg cgtgaagctg cactacacc 3819

<210> 27

<211> 3819

<212> DNA

<213> Artificial Sequence

<220>

<223> SDC60

<400> 27

atgttcgtgt tcctggtgct gctgcccctg gtgagcagcc agtgtgtgaa cctgaccacc 60

agaacccagc tgcctcccgc ctacaccaac agcttcacca ggggcgtgta ctaccccgac 120

aaggtgttca ggagcagcgt gctgcacagc acccaggacc tgttcctgcc cttcttcagc 180

aacgtgacct ggttccacgc catccacgtg agcggcacca atggcaccaa gcggttcgac 240

aaccctgtgc tgcccttcaa cgacggcgtg tacttcgcca gcaccgagaa gagcaacatc 300

atccggggct ggatcttcgg caccaccctg gacagcaaga cccagagcct gctgatcgtg 360

aacaacgcca ccaacgtggt gatcaaggtg tgcgagttcc agttctgcaa cgaccccttc 420

ctgggcgtgt actaccacaa gaacaacaag agctggatgg agagcgagtt ccgggtgtac 480

agcagcgcca acaactgcac cttcgagtac gtgagccagc ccttcctgat ggacctggag 540

ggcaagcagg gcaacttcaa gaacctgcgg gagttcgtgt tcaagaacat cgacggctac 600

ttcaagatct acagcaagca cacccccatc aacctggtga gggacctgcc tcagggcttt 660

tctgccctgg agcctctggt ggacctgccc atcggcatca acatcaccag gttccagacc 720

ctgctggccc tgcacaggag ctacctgaca cctggcgata gctcttctgg ctggacagcc 780

ggagctgctg cctactacgt gggctacctg cagccccgga ccttcctgct gaagtacaac 840

gagaacggca ccatcaccga cgccgtggat tgcgccctgg atcctctgag cgagaccaag 900

tgcaccctga agagcttcac cgtggagaag ggcatctacc agaccagcaa cttccgggtg 960

cagcccaccg agagcatcgt gaggttcccc aacatcacca acctgtgccc cttcggcgag 1020

gtgttcaacg ccaccagatt cgccagcgtg tacgcctgga accggaagcg gatcagcaac 1080

tgcgtggccg actacagcgt gctgtacaac agcgccagct tcagcacctt caagtgctac 1140

ggcgtgagcc ccaccaagct gaacgacctg tgcttcacca acgtgtacgc cgacagcttc 1200

gtgatcaggg gcgatgaggt gagacagatc gcccctggcc agaccggcaa gatcgccgac 1260

tacaactaca agctgcccga cgacttcacc ggctgcgtga tcgcctggaa cagcaacaac 1320

ctggacagca aggtgggcgg caactacaac tacctgtacc ggctgttccg gaagagcaac 1380

ctgaagccct tcgagcggga catcagcacc gagatctacc aggccggaag caccccttgc 1440

aacggcgtgg agggcttcaa ctgctacttc cccctgcaga gctacggctt ccagcctacc 1500

aatggcgtgg gctaccagcc ctacagggtg gtggtgctga gctttgagct gctgcatgct 1560

cctgccaccg tgtgcggccc caagaagagc accaacctgg tgaagaacaa gtgcgtgaac 1620

ttcaacttca acggcctgac cggcaccggc gtgctgaccg agagcaacaa gaagttcctg 1680

cccttccagc agttcggcag ggacatcgcc gataccaccg atgccgtgag agaccctcag 1740

accctggaga tcctggacat caccccttgc agcttcggcg gagtgagcgt gatcacacct 1800

ggcaccaaca ccagcaacca ggtggccgtg ctgtaccagg acgtgaactg caccgaggtg 1860

cctgtggcca ttcacgccga tcagctgacc cccacctgga gagtgtacag caccggcagc 1920

aacgtgttcc agaccagagc cggctgtctg atcggcgccg agcacgtgaa caacagctac 1980

gagtgcgaca tccccatcgg cgccggcatc tgtgccagct atcagaccca gaccaacagc 2040

cctaggaggg ccagaagcgt ggccagccag tctatcatcg cctacaccat gagcctgggc 2100

gccgagaaca gcgtggccta cagcaacaac agcatcgcca tccccaccaa cttcaccatc 2160

agcgtgacca ccgagatcct gcccgtgagc atgaccaaga ccagcgtgga ctgcaccatg 2220

tacatctgcg gcgacagcac cgagtgcagc aacctgctgc tgcagtacgg cagcttctgc 2280

acccagctga acagagccct gacaggcatc gccgtggagc aggacaagaa cacccaggag 2340

gtgttcgccc aggtgaagca gatctacaag acccccccca tcaaggactt cggcggcttc 2400

aacttcagcc agatcctgcc tgaccccagc aagcccagca agcggagctt catcgaggac 2460

ctgctgttca acaaggtgac cctggccgac gccggcttca tcaagcagta cggcgactgt 2520

ctgggcgaca tcgccgccag agacctgatc tgtgcccaga agttcaacgg cctgaccgtg 2580

ctgccccctc tgctgaccga tgagatgatc gcccagtaca cctctgccct gctggccggc 2640

accatcacat ctggctggac ctttggagct ggagccgccc tgcagatccc tttcgccatg 2700

cagatggcct accggttcaa cggcatcggc gtgacccaga acgtgctgta cgagaaccag 2760

aagctgatcg ccaaccagtt caacagcgcc atcggcaaga tccaggacag cctgagcagc 2820

accgcctctg ctctgggcaa actgcaggac gtggtgaacc agaacgccca ggccctgaac 2880

accctggtga agcagctgag cagcaacttc ggcgccatca gcagcgtgct gaacgacatc 2940

ctgagcaggc tggacaaggt ggaggccgag gtgcagatcg acaggctgat caccggcaga 3000

ctgcagagcc tgcagaccta cgtgacccag cagctgatca gagccgccga gatcagagcc 3060

tctgccaatc tggccgccac caagatgagc gagtgtgtgc tgggccagag caagagggtg 3120

gacttctgcg gcaagggcta ccacctgatg agcttccccc agtctgcccc tcatggcgtg 3180

gtgttcctgc acgtgaccta cgtgcctgcc caggagaaga acttcaccac cgcccctgcc 3240

atctgccacg atggcaaggc ccacttccct agagagggcg tgttcgtgag caacggcacc 3300

cactggttcg tgacccagcg gaacttctac gagccccaga tcatcaccac cgacaacacc 3360

ttcgtgagcg gcaactgcga cgtggtgatc ggcatcgtga acaacaccgt gtacgacccc 3420

ctgcagcccg agctggacag cttcaaggag gagctggaca agtacttcaa gaaccacacc 3480

agccccgacg tggacctggg cgacatcagc ggcatcaacg ccagcgtggt gaacatccag 3540

aaggagatcg accggctgaa cgaggtggcc aagaacctga acgagagcct gatcgacctg 3600

caggagctgg gcaagtacga gcagtacatc aagtggccct ggtacatctg gctgggcttc 3660

atcgccggcc tgatcgccat cgtgatggtg accatcatgc tgtgctgcat gaccagctgc 3720

tgcagctgcc tgaagggctg ctgcagctgt ggcagctgtt gcaagttcga cgaggacgac 3780

agcgagcccg tgctgaaggg cgtgaagctg cactacacc 3819

<210> 28

<211> 957

<212> DNA

<213> Artificial Sequence

<220>

<223> MT2AE

<400> 28

atggccgatt ctaatggcac catcaccgtg gaagagctga agaagctgct cgagcaatgg 60

aacctggtga tcggatttct gttcctgacc tggatctgtc tgttgcagtt cgcctacgcc 120

aaccggaaca gattcctgta catcatcaaa ctgatcttcc tgtggctgct gtggcctgtg 180

accctggcct gcttcgtgct ggccgccgtg taccggatta actggatcac cggaggcatc 240

gctatcgcca tggcatgcct ggtcggactt atgtggctgt cttatttcat cgccagcttc 300

agactgttcg ctagaaccag aagcatgtgg tcctttaacc ctgagacaaa catcctgctg 360

aacgtgcctc tgcacggcac aatcctgaca cggccactgc tggaaagcga gctggtcatc 420

ggcgccgtga tcctgcgggg ccatctgcgc attgccggac accacctggg cagatgcgac 480

atcaaggacc tgcccaagga aatcaccgtg gccaccagca gaacactgtc ctactacaaa 540

ctgggcgcta gtcagagagt ggccggcgac agcggcttcg ccgcttattc tagatacaga 600

atcggcaact acaagctgaa taccgatcac agcagcagca gcgacaacat cgccctgctg 660

gtgcagggca gcggcgaggg cagaggaagc ctgctgacat gtggcgatgt ggaagagaac 720

cccggccctg ccatgtacag ctttgtgtct gaggaaaccg gcaccctgat cgtgaacagc 780

gtgctgctgt ttctggcctt cgtcgtgttc ctgctggtga cactggctat cctgaccgcc 840

ctgaggctgt gcgcctactg ctgcaacatc gtgaatgtat ccctggtgaa gccttccttc 900

tacgtgtaca gccgggtgaa gaaccttaat agctctagag tgcccgacct gctcgtt 957

<210> 29

<211> 960

<212> DNA

<213> Artificial Sequence

<220>

<223> MP2AE

<400> 29

atggccgaca gcaacggcac aatcacagtg gaagagctga agaagctgct ggagcagtgg 60

aacctggtga ttggatttct tttcctcacc tggatctgcc tgctgcagtt cgcctatgcc 120

aaccggaaca gattcctgta catcatcaag ctgatcttcc tgtggctgct gtggcccgtg 180

accctggcct gttttgtgct ggccgccgtg taccggatca actggatcac cggcggaatc 240

gctatcgcca tggcctgcct ggtgggcctg atgtggctga gctacttcat cgcctccttt 300

agactgttcg ccagaaccag aagcatgtgg tccttcaacc ctgagacaaa tatcctgctc 360

aacgtgcccc tgcacggcac catcctgacc cggcctctgc tcgagagcga gctggtgatc 420

ggcgccgtga tcctgagagg ccacctgaga atcgccggac accacctggg cagatgcgac 480

atcaaggacc tgccaaagga aatcaccgtt gctacaagca gaacactgtc ctactacaag 540

ctgggcgctt ctcaaagagt cgccggcgac agcggcttcg ctgcttatag ccgctacagg 600

attggaaatt acaagctgaa caccgatcat tcttctagca gcgacaacat cgccctgctg 660

gtccagggca gcggcgccac aaacttcagc ctgcttaaac aggccggcga tgtggaagag 720

aaccccggcc ctgccatgta cagcttcgtg tccgaggaaa ccggcaccct gatcgtgaac 780

agcgtgctgc tgttccttgc ttttgtggtg ttcctgctgg tcaccctggc catcctgacc 840

gccctgagac tgtgtgccta ctgctgcaac atcgtgaatg tgtctctggt gaagcctagc 900

ttctacgtgt acagccgggt gaaaaacctg aactctagcc gggtgcctga tctgctggtg 960

<210> 30

<211> 798

<212> DNA

<213> Artificial Sequence

<220>

<223> SGS-RBD

<400> 30

atggagacag acacactcct gctatgggta ctgctgctct gggttccagg ttccaccgga 60

gactgcccat ttggcgaggt gttcaacgca acccgcttcg ccagcgtgta cgcctggaat 120

aggaagcgga tcagcaactg cgtggccgac tatagcgtgc tgtacaactc cgcctctttc 180

agcaccttta agtgctatgg cgtgtccccc acaaagctga atgacctgtg ctttaccaac 240

gtctacgccg attctttcgt gatcaggggc gacgaggtgc gccagatcgc ccccggccag 300

acaggcaaga tcgcagacta caattataag ctgccagacg atttcaccgg ctgcgtgatc 360

gcctggaaca gcaacaatct ggattccaaa gtgggcggca actacaatta tctgtaccgg 420

ctgtttagaa agagcaatct gaagcccttc gagagggaca tctctacaga aatctaccag 480

gccggcagca ccccttgcaa tggcgtggag ggctttaact gttatttccc actccagtcc 540

tacggcttcc agcccacaaa cggcgtgggc tatcagcctt accgcgtggt ggtgctgagc 600

tttgagctgc tgcacgccta cccgtacgac gtgccggact acgccaatgc tgtgggccag 660

gacacgcagg aggtcatcgt ggtgccacac tccttgccct ttaaggtggt ggtgatctca 720

gccatcctgg ccctggtggt gctcaccatc atctccctta tcatcctcat catgctttgg 780

cagaagaagc cacgttag 798

<210> 31

<211> 3819

<212> RNA

<213> Artificial Sequence

<220>

<223> SDC-50 mRNA

<400> 31

uacaagcaca aggaccacga cgacggagac cacagaagag ucacacacuu agacuguugg 60

ucuugggucg acggaggacg gauaugguug ucgaaguguu cuccgcacau gaugggacug 120

uuccacaagu ccagaagaca cgacgugaga uggguccuag acaaggacgg aaagaagucg 180

uugcacugga ccaaagugcg guagguacac agaccguggu uaccgugguu cucuaagcug 240

uuaggacacg acggaaaguu gcuaccgcac augaagcgga gauggcucuu cucguuguag 300

uagucuccga ccuagaaacc guguugggac cuaucguucu gggucagaga cgacuagcac 360

uuguuacggu gguugcacca cuaguuccac acgcucaagg ucaagacguu acugggaaag 420

gacccgcaca ugaugguguu cuuguuguuc ucgaccuacc ucucgcucaa gucccacaug 480

ucgagacggu uguuaacgug gaagcucaug cacucggucg gaaaggacua ccuagaccuu 540

ccuuucgucc cguugaaguu cuuggacgcc cucaagcaca aguucuugua gcugccgaug 600

aaguucuaga ugucguucgu guggggguag uuagaccacu cucuagacgg agucccuaaa 660

agacgagacc uuggagacca ccuagacgga uagccguagu uguagugguc uaaggucugu 720

gacgaccgag acgugucuuc gauagacugu ggaccgcuaa gaagaagacc uaccugucga 780

ccucgacgac gaauaaugca cccgauggac gucggaucuu ggaaggacga cuucauguug 840

cucuuaccgu gguaguggcu acgacaccua acacgggacc uaggagacag acucuguuuc 900

acaugggacu ucucgaagug gcaccucuuc ccguagaugg ucuggucguu aaagucucac 960

gucggauggc ucucguagca cucuaagggg uuguaguggu uagacacggg aaaaccgcuc 1020

cacaaguuac gguggucuaa acggucgcac auacggaccu uguccuucuc uuagucguug 1080

acacaccggc ugaugucgca cgacauguua agacggucga aaucguggaa guucacgaug 1140

ccgcacagag gaugguucga cuuacuggac acaaaguggu ugcacaugcg gcugucgaag 1200

cacuagucuc cucuacuuca cucugucuaa cgaggaccgg ucuguccguu cuagcggcua 1260

auguugaugu ucgacggacu acugaagugg ccgacacacu agcggaccuu aucguuguua 1320

gaccugucgu uucacccgcc guugauguug auggacaugu ccgacaaguc cuucucguug 1380

gacuucggga agcucucucu guagagaugg cucuagauag uccgaccuuc guggggaaca 1440

uuaccgcacc uuccgaaguu gacaaugaag ggagacgucu cgaugccgaa agucggaugg 1500

uuaccucacc cuauagucgg aaugucucac caccacgacu cgaaacuuga cgacguacga 1560

ggacgauguc acacaccggg auucuucucg ugguuggacc acuucuuguu cacgcacuug 1620

aaguugaagu ugccggacug gccuuguccu cacgacuguc ucucguuguu cuucaaggac 1680

gggaaggucg ucaaaccguc ucuguaacgg cuaugguguc uacggcacuc ucuaggaguc 1740

ugugaccucu aggaccuaua guguggaacg ucgaaaccgc cucacagaca cuagugugga 1800

ccuugguuau ggucguuagu ccaccgacac gacauggucc ugcacuuaac gugucuucac 1860

ggacaccggu aaguacgacu agucgacugg ggauguaccu cucacauguc guguccgucg 1920

uuacacaaag ucuggucucg gccuacagac uaaccucgac ucgugcacuu guugucgaug 1980

cucacacugu agggauaacc ucggccuuag acacggucga uagucugugu cugguugaga 2040

ggaucuucuc ggucuagaca ccggucgguc agauaguagc ggauauggua cagagacccu 2100

cgacucuuau cgcaccggau gucguuguug ucguagcggu agggaugguu gaagugguag 2160

ucgcacuguu gucucuagga cggacacucg uacugguucu guagacaccu gacgugguac 2220

auguagacac cgcugucgug ucucacaucg uuagacgacg acgucaugcc gucgaaaaca 2280

ugggucgacu uaucucggga cuguccuuaa cggcaccucg uccuauucuu auggguccuc 2340

cacaaacggg uccacuucgu cuagauguuc uggggaggau aguuccugaa gccgccgaag 2400

uugaagucgg ucuaagacgg acuaggaucg uucgggucgu ucucuucaaa guagcuccua 2460

gacgacaagu uguuccacug ggaccggcua cggccuaaau aguucgucau accgcuaaca 2520

gacccgcuau agcggcgguc ucuagacuaa acacgggucu ucaaguuacc ugacuggcac 2580

gacggaggag acgacugucu acucuacuaa cgagucaugu guagacggga cgaccgaccg 2640

uguuagugua gaccuaccug uaaaccucga ccucgacgag acgucuaggg aaaacgguac 2700

gucuaccgga ugucuaaguu gccguagccu cacugggucu uacacgacau gcucuugguc 2760

uucgacuagc gguuggucaa guugucgcgg uaaccguucu agguccuaag agacagaucg 2820

ugucgaagac gagacccguu ugacguccua caccacuuag ucuuacgagu ccgggacuua 2880

ugggaccacu ucgucgacag aucguuaaaa ccgcgguagu cgucgcacga cuuacuguag 2940

gacucgucug accuauuuca ccuccggcuu cacgucuagc ugucugacua guguccuucu 3000

gacgucagag acgucuggau gcacuguguc gucgacuaau cucgacggcu cuaaucucgg 3060

agacgauuag accgacggug guucuacaga cucacacacg acccugucag auucucucac 3120

cugaagacac cguuuccgau gguggacuac ucgaaaggag ucagacgagg aguaccucac 3180

cacaaagacg ugcacuguau acacggacgg guccucuucu ugaaguggug ucgaggacgg 3240

uaaacagugc uaccguuucg ggugaaagga ucucuuccgc acaagcacuc guuaccuugg 3300

gugaccaaac acugggucuc uuugaagaug cucggggucu aguaguggug gcuguuaugg 3360

aagcacagac cguuaacgcu gcaccacuag ccguagcacu uguuauggca cauacuagga 3420

gacgucggac ucgaccuguc gaaguuccuc cucgaccugu ucaugaaguu cuuggugugg 3480

ucgggacuac accuagaccc gcuauagaga ccguaguuac ggagacacca cuuguagguc 3540

uuccucuagc uguccgacuu acuccaccgg uucuuggacu uacucucgga cuagcuagac 3600

guccucgacc cuuucaugcu cgucauguag uucaccggaa ccauguagac cgacccgaaa 3660

uaacggccug acuaacggua gcacuaccac ugguaguacg acacgacgua cuguucgaca 3720

acaucgacag acuucccgac gacaagaaca ccgucgacaa cguucaagcu acuccuacua 3780

ucgcucggac acgacuuucc gcacuucgac gugaugugg 3819

<210> 32

<211> 3819

<212> RNA

<213> Artificial Sequence

<220>

<223> SDC-54 mRNA

<400> 32

uacaagcaca aggaccacga cgacggagac cacucgagag ucacacacuu agacuggugu 60

ucuugggucg acggaggacg gaugugguug ucgaaauggu cuccucacau gauggggcug 120

uuccacaagu cuucgucgca cgacguaucg uguguccuag acaaggacgg gaagaagucg 180

uugcacugga ccaaagugcg guagguacac agaccguggu uaccgugguu cucuaagcug 240

uugggacacg acggaaaguu gcuaccgcac augaagcgga gauggcucuu cucguuguag 300

uagucuccga ccuagaagcc guggugugac cuaucguucu gggucagaga cgacuagcac 360

uuguugcggu gguugcacca cuaguuccac acgcucaagg ucaagacguu gcuggggaag 420

gacccgcaca ugaugguguu cuuguuguuc ucgaccuacc ucucgcucaa gucccacaug 480

ucgucgcggu uguuaacgug gaagcucaug cacucggucg gaaaggacua ccuagaccuc 540

ccuuucgucc cguugaaguu cuuggacgcc cucaagcaca aguucuugua gcugccgaug 600

aaguucuaga ugucguucgu guggggguag uuggaccacu cucuagacgg agucccuaaa 660

agacgagacc ucggagacca ccuagacgga uagccguagu uguagugguc uaaggucugu 720

gacgaccggg acgugucuuc gauggacugu ggaccucuaa gaagaagacc gaccugucga 780

ccucgacgac ggauaaugca cccgauagac gucgggucuu ggaaggacga cuucauguug 840

cucuugccgu gguagugucu acggcaccua acacgggacc uaggagacag acucugguuc 900

acaugggacu ucucgaagug gcaccucuuc ccguagaugg ucuggucguu gaagucucac 960

gucggauggc ucucguagca cucuaagggg uuguaguggu uggacacggg aaaaccgcuc 1020

cacaaguuac gguggucuaa acggucgcac augcggaccu uguccuucuc cuagucguug 1080

acacaccggc uaaugucgca cgacauguug agacggucga agucguggaa guucacgaug 1140

ccgcacagag gauguuucga cuugcuggac acgaaguggu ugcacaugcg gcugucgaag 1200

cacuaaucuc cgcuacucca cucugucuaa cgaggaccgg ucuguccguu cuaacggcug 1260

auguugaugu ucgacggacu gcugaagugg ccgacacacu aacggaccuu gucguuguua 1320

gaccugucgu uccacccgcc guugauguug auggacaugu ccgacaaguc cuucucguug 1380

gacuucggga agcucucucu guagucgugg cucuagauag uccgaccuuc guggggaaca 1440

uuaccgcacc ucccgaaguu gacaaugaag ggagacgucu cgaugccgaa agucggaugg 1500

uuaccucacc cgauagucgg aaugucucac caccacgacu cgaaacuuga cgacguacga 1560

ggacgauguc acacaccggg guucuucucg ugguuggacc acuucuuguu cacgcacuug 1620

aaguugaagu ugccggacug gccuuguccu cacgacuguc ucucguuguu cuucaaggac 1680

gggaaggucg ucaagccguc ucuauagcgg cuaugguguc uacggcacuc ucuaggaguc 1740

ugugaccucu aggaccugua guguggaacg ucgaaaccgc cucacagaca cuagugugga 1800

ccgugguuau ggucguuagu ccaccgacac gacauggucc ugcacuuaac guggcuucac 1860

ggacaccggu aaguacgacu agucgacugg ggauguaccu cucacauguc guggccgaga 1920

uuacacaagg ucuggucucg gccuacagac uaaccucggc ucgugcacuu auugucgaug 1980

cucacgcugu agggauaacc ucggccguag acacggagaa uagucugggu cugguugaga 2040

ggaucuucuc ggucuucgca ccggagaguc ucguaguaac ggauguggua cagagacccu 2100

cggcucuuau cgcaccggau gucguuauug ucguagcggu aggggugguu gaagugguag 2160

ucgcacuggu gucucuaaga cggacacucg uacugguucu ggagacaccu gacgugguac 2220

auguagacac cgcugagaug gcucacgucg uuagacgacg acgucauacc gucgaaaaca 2280

ugggucgacu ugucucggga cuguccguaa cgacaccucg uccuauucuu guggguccuc 2340

cacaaacggg uccacuucgu cuagauguuc uggggagggu aguuccugaa gccgccgaaa 2400

uugaagucgg ucuaggacgg acuaggaucg uucgggucgu ucuccucgaa auagcuccug 2460

gacgacaagu uguuccacug ggaccggcua cgaccgaaau aguucgucau gccucuaaca 2520

gacccgcuau agcggcgguc ucuggacuaa acacgggucu ucaaguuacc ugacuggcac 2580

gacggaggag acgacugucu acucuacuaa cgggucaugu guagacggga cgaccgaccg 2640

uguuagugua gaccuaccug uaaaccucga ccucgacggg acgucuaggg aaaacgguac 2700

gucuaccgga ugucuaaguu gccguagccg cacugggucu uacacgacau gcucuugguc 2760

uucgacuagc gguuggucaa guugucgcgg uagccguucu agguccuaag agacagaucg 2820

ugucggagac gagacccuuu cgacguccua caccacuuag ucuuacgggu ccgggacuua 2880

ugugaccacu ucgucgacuc gucguugaaa ccgcgguagu cgagacacga cuuacuguag 2940

gacucgucug accuguucca ccuccgacuu cacgucuagc ugucugacua guguccgucu 3000

gacgucagag acgucuggau gcacuguguc gucgacuaau cucgacggcu cuagucucga 3060

agacgguuag accgacggug guucuacaga cucacacacg acccugucuc guucucucac 3120

cugaagacac cguuuccgau gguggacuac ucgaagggag ucagacgagg aguaccucac 3180

cacaaagacg ugcacuggau acacggacgg guccucuucu ugaaguggug ucgaggacgg 3240

uaaacagugc uaccguuccg ggugaaagga ucucuuccgc acaagcacuc guuaccgugu 3300

gugaccaagc acugggucuc cuugaagaug cucggggucu aguaguggug gcuauugugg 3360

aagcacucgc cguuaacgcu gcaccacuag ccguagcacu uguuauggca caugcuagga 3420

gacgucggac ucgaccuguc gaaguuccuc cucgaccugu ucaugaaguu cuuggugugg 3480

ucgggacuac accuagaccc gcuguagaga ccguaguuac ggucgcacca cuuguagguc 3540

uuccucuagc uguccgacuu gcuccaccgg uucuuggacu uacucucgga cuagcuagac 3600

guccucgacc cguucaugcu cgucauguag uucaccggaa ccauguagac cgacccgaaa 3660

uagcggccug acuaacggua gcacuaccac ugguaguacg acacgacgua cuggucgacg 3720

acaucgacag acuucccgac aacaucgaca ccgucgacaa cguucaagcu gcuccuacua 3780

ucgcucggac acgacuuucc gcacuucgac gugaugugg 3819

<210> 33

<211> 3819

<212> RNA

<213> Artificial Sequence

<220>

<223> SDC-58 mRNA

<400> 33

uacaagcaca aggaccacga cgacggggac cacucgagag ucacacacuu ggacuggugg 60

ucuugggucg acggaggacg gaugugguug ucgaaguggu cuccgcacau gauggggcug 120

uuccacaagu cuucgucgca cgacgugucg uggguccugg acaaggacgg gaagaagucg 180

uugcacugga ccaaggugcg guaggugcac agaccguggu uaccgugguu cuccaagcug 240

uugggacacg acgggaaguu gcugccgcac augaagcggu cguggcucuu cucguuguag 300

uaguccccga ccuagaagcc guggugggac cugucguucu gggucucgga cgacuagcac 360

uuguugcggu gguugcacca cuaguuccac acgcucaagg ucaagacguu gcuggggaag 420

gacccgcaca ugaugguguu cuuguuguuc ucgaccuacc ucucgcucaa ggcccacaug 480

ucgucgcggu uguugacgug gaagcucaug cacucggucg ggaaggacua ccuggaccuc 540

ccguucgucc cguugaaguu cuuggacgcc cucaagcaca aguucuugua gcugccgaug 600

aaguucuaga ugucguucgu guggggguag uuggaccacu cucuggacgg agucccgaaa 660

agacgggacc ucggagacca ccuggacgga uagccguagu uguagugguc caaggucugg 720

gacgaccggg acgugucuuc gauggacugu ggaccgcuau cgagaagacc gaccugucga 780

ccucgacgac ggauaaugca cccgauggac gucggguccu ggaaggacga cuucauguug 840

cucuugccgu gguaguggcu gcggcaccua acacgggacc uaggagacuc gcucugguuc 900

acgugggacu ucucgaagug gcaccucuuc ccguagaugg ucuggucguu gaaggcccac 960

gucggauggc ucucguagca cuccaagggg uuguaguggu uggacacggg aaagccgcuc 1020

cacaaguugc gguggucuaa gcggagacac augcggaccu uguccuucgc cuagucguug 1080

acgcaccggc ugaugucgca cgacauguug ucgcggucga agucguggaa guucacgaug 1140

ccgcacucgg gaugguucga cuugcuggac acgaaguggu ugcacaugcg gcugucgaag 1200

cacuagucuc cgcuacucca cucugucuag cggggaccug ucuggccguu cuagcggcug 1260

auguugaugu ucgacgggcu gcugaagugg ccgacacacu agcggaccuu gucguuguug 1320

gaccugucgu uccacccgcc guugauguug auggacaugg ccgacaaggc cuucucguug 1380

gacuucggga agcucucccu guagucgugg cucuagaugg uccggccuuc guguggaacg 1440

uuaccgcacc ucccgaaguu gacgaugaag ggggacgucu cgaugccgaa agucggaugg 1500

uuaccgcacc cgauggucgg gaugucucac caccacgacu cgaaacuuga cgacguacgg 1560

ggacgguguc acacaccggg guucuucucg ugguuggacc acuucuuguu cacgcacuug 1620

aaguugaagu ugccggacug gccguguccg cacgacuggc ucucguuguu cuucaaggac 1680

gggaaggucg ucaagccguc ucuguagcgg cuaugguggc uacggcacuc ucuaggaguc 1740

ugggaccucu aggaccugua guggggaacg ucgaaaccgc cucacucgca cuagugugga 1800

ccgugguugu ggucguuagu ccaccggcac gacauggucc ugcacuugac gugucuccac 1860

ggacaccggu aaguacggcu agucgacugg ggauggaccu cucacauguc guggccgucg 1920

uuacacaagg ucuggucucg gccgacagac uagccucggc ucgugcacuu guugucgaug 1980

cucacgcugu agggauagcc ucggccguag acgcggagaa uggucugugu cugguugucg 2040

gggucuucuc ggucuucgca ccggucgguc agauaguagc ggauguggua cucggacccu 2100

cggcucuugu cgcaccggau gucguuguug ucguagcggu aggggugguu gaagugguag 2160

ucgcacuggu ggcucuagga cgggcacucg uacugguucu ggucgcaccu gacgugguac 2220

auguagacgc cgcugucgug ucucacgucg uuggacgacg acgucaugcc gucgaaaacg 2280

ugggucgacu ugucucggga cuguccguaa cggcaccucg uccuguucuu guggguccuc 2340

cacaagcggg uccacuucgu cuagauguuc uggggggggu aguuccugaa gccgccgaag 2400

uugaagucgg ucuaggacgg acugggaucg uucgggucgu ucgccucgaa guagcuccug 2460

gacgacaagu uguuccacug ggaccggcua cggccgaagu aguucgucau gccgcuaaca 2520

gacccgcuau agcggcgguc ucuggacuag acacgggucu ucaaguugcc ggacuggcac 2580

gacggaggag acgacugucu acucuacuag cgggucaugu ggagacggga cgaccggccu 2640

ugguagugua gaccgaccug uaaaccucga ccucggcggg acgucuaggg aaagcgguac 2700

gucuaccgga uguccaaguu gccguagccg cacugggucu ugcacgacau gcucuugguc 2760

uucgacuagc gguuggucaa guugucgcgg uagccguucu agguccuguc ggacagaucg 2820

ugucggagac gagacccguu cgacguccua caccacuugg ucuuacgggu ccgggacuug 2880

ugggaccacu ucgucgacuc gucguuaaag ccgcgguagu cgucgcacga cuugcuguag 2940

gacucgucug accuguucca ccuccggcuc cacgucuagc ugucugacua guggccgucu 3000

gacgucucgg acgucuggau gcacuguguc gucgacuagu cucggcggcu cuagucucgg 3060

agacgguuag accgacggug guucuacucg cucacacacg acccggucuc guucucucac 3120

cugaagacgc cguuuccgau gguggacuac ucgaaggggg ucagacgagg aguaccgcac 3180

cacaaagacg ugcacuggau gcacggacgg guccucuucu ugaaguggug ucggggacgg 3240

uagacagugc uaccguuccg ggugaaggga ucucucccgc acaagcacuc guuaccgugg 3300

gugaccaagc acugggucgc cuugaagaug cucggggucu aguaguggug gcuguugugg 3360

aagcacucgc cguugacgcu gcaccacuag ccguagcacu uguuguggca caugcuggga 3420

gacgucgggc ucgaccuguc gaaguuccuc cucgaccugu ucaugaaguu cuuggugugg 3480

ucggggcugc accuagaccc gcuguagucg ccguaguugc ggucgcacca cuuguagguc 3540

uuccucuagc uggccgacuu gcuccaccgg uucuuggacu ugcucucgga cuagcuggac 3600

guccucgacc cguucaugcu cgucauguag uucaccggga ccauguagac cgacccgaaa 3660

uagcggccgg acuagcggua gcacuaccac ugguaguacg acacgacgua cuggucgacg 3720

acgucgacgg acuucccgac aacaucgaca ccgucgacga cguucaagcu gcuccugcua 3780

ucgcucggac acgacuuccc gcacuucgac gugaugugg 3819

<210> 34

<211> 3819

<212> RNA

<213> Artificial Sequence

<220>

<223> SDC-60 mRNA

<400> 34

uacaagcaca aggaccacga cgacggggac cacucgucgg ucacacacuu ggacuggugg 60

ucuugggucg acggagggcg gaugugguug ucgaaguggu ccccgcacau gauggggcug 120

uuccacaagu ccucgucgca cgacgugucg uggguccugg acaaggacgg gaagaagucg 180

uugcacugga ccaaggugcg guaggugcac ucgccguggu uaccgugguu cgccaagcug 240

uugggacacg acgggaaguu gcugccgcac augaagcggu cguggcucuu cucguuguag 300

uaggccccga ccuagaagcc guggugggac cugucguucu gggucucgga cgacuagcac 360

uuguugcggu gguugcacca cuaguuccac acgcucaagg ucaagacguu gcuggggaag 420

gacccgcaca ugaugguguu cuuguuguuc ucgaccuacc ucucgcucaa ggcccacaug 480

ucgucgcggu uguugacgug gaagcucaug cacucggucg ggaaggacua ccuggaccuc 540

ccguucgucc cguugaaguu cuuggacgcc cucaagcaca aguucuugua gcugccgaug 600

aaguucuaga ugucguucgu guggggguag uuggaccacu cccuggacgg agucccgaaa 660

agacgggacc ucggagacca ccuggacggg uagccguagu uguagugguc caaggucugg 720

gacgaccggg acguguccuc gauggacugu ggaccgcuau cgagaagacc gaccugucgg 780

ccucgacgac ggaugaugca cccgauggac gucggggccu ggaaggacga cuucauguug 840

cucuugccgu gguaguggcu gcggcaccua acgcgggacc uaggagacuc gcucugguuc 900

acgugggacu ucucgaagug gcaccucuuc ccguagaugg ucuggucguu gaaggcccac 960

gucggguggc ucucguagca cuccaagggg uuguaguggu uggacacggg gaagccgcuc 1020

cacaaguugc gguggucuaa gcggucgcac augcggaccu uggccuucgc cuagucguug 1080

acgcaccggc ugaugucgca cgacauguug ucgcggucga agucguggaa guucacgaug 1140

ccgcacucgg ggugguucga cuugcuggac acgaaguggu ugcacaugcg gcugucgaag 1200

cacuaguccc cgcuacucca cucugucuag cggggaccgg ucuggccguu cuagcggcug 1260

auguugaugu ucgacgggcu gcugaagugg ccgacgcacu agcggaccuu gucguuguug 1320

gaccugucgu uccacccgcc guugauguug auggacaugg ccgacaaggc cuucucguug 1380

gacuucggga agcucgcccu guagucgugg cucuagaugg uccggccuuc guggggaacg 1440

uugccgcacc ucccgaaguu gacgaugaag ggggacgucu cgaugccgaa ggucggaugg 1500

uuaccgcacc cgauggucgg gaugucccac caccacgacu cgaaacucga cgacguacga 1560

ggacgguggc acacgccggg guucuucucg ugguuggacc acuucuuguu cacgcacuug 1620

aaguugaagu ugccggacug gccguggccg cacgacuggc ucucguuguu cuucaaggac 1680

gggaaggucg ucaagccguc ccuguagcgg cuaugguggc uacggcacuc ucugggaguc 1740

ugggaccucu aggaccugua guggggaacg ucgaagccgc cucacucgca cuagugugga 1800

ccgugguugu ggucguuggu ccaccggcac gacauggucc ugcacuugac guggcuccac 1860

ggacaccggu aagugcggcu agucgacugg ggguggaccu cucacauguc guggccgucg 1920

uugcacaagg ucuggucucg gccgacagac uagccgcggc ucgugcacuu guugucgaug 1980

cucacgcugu agggguagcc gcggccguag acacggucga uagucugggu cugguugucg 2040

ggauccuccc ggucuucgca ccggucgguc agauaguagc ggauguggua cucggacccg 2100

cggcucuugu cgcaccggau gucguuguug ucguagcggu aggggugguu gaagugguag 2160

ucgcacuggu ggcucuagga cgggcacucg uacugguucu ggucgcaccu gacgugguac 2220

auguagacgc cgcugucgug gcucacgucg uuggacgacg acgucaugcc gucgaagacg 2280

ugggucgacu ugucucggga cuguccguag cggcaccucg uccuguucuu guggguccuc 2340

cacaagcggg uccacuucgu cuagauguuc uggggggggu aguuccugaa gccgccgaag 2400

uugaagucgg ucuaggacgg acuggggucg uucgggucgu ucgccucgaa guagcuccug 2460

gacgacaagu uguuccacug ggaccggcug cggccgaagu aguucgucau gccgcugaca 2520

gacccgcugu agcggcgguc ucuggacuag acacgggucu ucaaguugcc ggacuggcac 2580

gacgggggag acgacuggcu acucuacuag cgggucaugu ggagacggga cgaccggccg 2640

ugguagugua gaccgaccug gaaaccucga ccucggcggg acgucuaggg aaagcgguac 2700

gucuaccgga uggccaaguu gccguagccg cacugggucu ugcacgacau gcucuugguc 2760

uucgacuagc gguuggucaa guugucgcgg uagccguucu agguccuguc ggacucgucg 2820

uggcggagac gagacccguu ugacguccug caccacuugg ucuugcgggu ccgggacuug 2880

ugggaccacu ucgucgacuc gucguugaag ccgcgguagu cgucgcacga cuugcuguag 2940

gacucguccg accuguucca ccuccggcuc cacgucuagc uguccgacua guggccgucu 3000

gacgucucgg acgucuggau gcacuggguc gucgacuagu cucggcggcu cuagucucgg 3060

agacgguuag accggcggug guucuacucg cucacacacg acccggucuc guucucccac 3120

cugaagacgc cguucccgau gguggacuac ucgaaggggg ucagacgggg aguaccgcac 3180

cacaaggacg ugcacuggau gcacggacgg guccucuucu ugaaguggug gcggggacgg 3240

uagacggugc uaccguuccg ggugaaggga ucucucccgc acaagcacuc guugccgugg 3300

gugaccaagc acugggucgc cuugaagaug cucggggucu aguaguggug gcuguugugg 3360

aagcacucgc cguugacgcu gcaccacuag ccguagcacu uguuguggca caugcugggg 3420

gacgucgggc ucgaccuguc gaaguuccuc cucgaccugu ucaugaaguu cuuggugugg 3480

ucggggcugc accuggaccc gcuguagucg ccguaguugc ggucgcacca cuuguagguc 3540

uuccucuagc uggccgacuu gcuccaccgg uucuuggacu ugcucucgga cuagcuggac 3600

guccucgacc cguucaugcu cgucauguag uucaccggga ccauguagac cgacccgaag 3660

uagcggccgg acuagcggua gcacuaccac ugguaguacg acacgacgua cuggucgacg 3720

acgucgacgg acuucccgac gacgucgaca ccgucgacaa cguucaagcu gcuccugcug 3780

ucgcucgggc acgacuuccc gcacuucgac gugaugugg 3819

<210> 35

<211> 957

<212> RNA

<213> Artificial Sequence

<220>

<223> MT2AE mRNA

<400> 35

uaccggcuaa gauuaccgug guaguggcac cuucucgacu ucuucgacga gcucguuacc 60

uuggaccacu agccuaaaga caaggacugg accuagacag acaacgucaa gcggaugcgg 120

uuggccuugu cuaaggacau guaguaguuu gacuagaagg acaccgacga caccggacac 180

ugggaccgga cgaagcacga ccggcggcac auggccuaau ugaccuagug gccuccguag 240

cgauagcggu accguacgga ccagccugaa uacaccgaca gaauaaagua gcggucgaag 300

ucugacaagc gaucuugguc uucguacacc aggaaauugg gacucuguuu guaggacgac 360

uugcacggag acgugccgug uuaggacugu gccggugacg accuuucgcu cgaccaguag 420

ccgcggcacu aggacgcccc gguagacgcg uaacggccug ugguggaccc gucuacgcug 480

uaguuccugg acggguuccu uuaguggcac cgguggucgu cuugugacag gaugauguuu 540

gacccgcgau cagucucuca ccggccgcug ucgccgaagc ggcgaauaag aucuaugucu 600

uagccguuga uguucgacuu auggcuagug ucgucgucgu cgcuguugua gcgggacgac 660

cacgucccgu cgccgcuccc gucuccuucg gacgacugua caccgcuaca ccuucucuug 720

gggccgggac gguacauguc gaaacacaga cuccuuuggc cgugggacua gcacuugucg 780

cacgacgaca aagaccggaa gcagcacaag gacgaccacu gugaccgaua ggacuggcgg 840

gacuccgaca cgcggaugac gacguuguag cacuuacaua gggaccacuu cggaaggaag 900

augcacaugu cggcccacuu cuuggaauua ucgagaucuc acgggcugga cgagcaa 957

<210> 36

<211> 960

<212> RNA

<213> Artificial Sequence

<220>

<223> MP2AE mRNA

<400> 36

uaccggcugu cguugccgug uuagugucac cuucucgacu ucuucgacga ccucgucacc 60

uuggaccacu aaccuaaaga aaaggagugg accuagacgg acgacgucaa gcggauacgg 120

uuggccuugu cuaaggacau guaguaguuc gacuagaagg acaccgacga caccgggcac 180

ugggaccgga caaaacacga ccggcggcac auggccuagu ugaccuagug gccgccuuag 240

cgauagcggu accggacgga ccacccggac uacaccgacu cgaugaagua gcggaggaaa 300

ucugacaagc ggucuugguc uucguacacc aggaaguugg gacucuguuu auaggacgag 360

uugcacgggg acgugccgug guaggacugg gccggagacg agcucucgcu cgaccacuag 420

ccgcggcacu aggacucucc gguggacucu uagcggccug ugguggaccc gucuacgcug 480

uaguuccugg acgguuuccu uuaguggcaa cgauguucgu cuugugacag gaugauguuc 540

gacccgcgaa gaguuucuca gcggccgcug ucgccgaagc gacgaauauc ggcgaugucc 600

uaaccuuuaa uguucgacuu guggcuagua agaagaucgu cgcuguugua gcgggacgac 660

caggucccgu cgccgcggug uuugaagucg gacgaauuug uccggccgcu acaccuucuc 720

uuggggccgg gacgguacau gucgaagcac aggcuccuuu ggccguggga cuagcacuug 780

ucgcacgacg acaaggaacg aaaacaccac aaggacgacc agugggaccg guaggacugg 840

cgggacucug acacacggau gacgacguug uagcacuuac acagagacca cuucggaucg 900

aagaugcaca ugucggccca cuuuuuggac uugagaucgg cccacggacu agacgaccac 960

<210> 37

<211> 798

<212> RNA

<213> Artificial Sequence

<220>

<223> SGS-RBD mRNA

<400> 37

uaccucuguc ugugugagga cgauacccau gacgacgaga cccaaggucc aagguggccu 60

cugacgggua aaccgcucca caaguugcgu ugggcgaagc ggucgcacau gcggaccuua 120

uccuucgccu agucguugac gcaccggcug auaucgcacg acauguugag gcggagaaag 180

ucguggaaau ucacgauacc gcacaggggg uguuucgacu uacuggacac gaaaugguug 240

cagaugcggc uaagaaagca cuaguccccg cugcuccacg cggucuagcg ggggccgguc 300

uguccguucu agcgucugau guuaauauuc gacggucugc uaaaguggcc gacgcacuag 360

cggaccuugu cguuguuaga ccuaagguuu cacccgccgu ugauguuaau agacauggcc 420

gacaaaucuu ucucguuaga cuucgggaag cucucccugu agagaugucu uuagaugguc 480

cggccgucgu ggggaacguu accgcaccuc ccgaaauuga caauaaaggg ugaggucagg 540

augccgaagg ucggguguuu gccgcacccg auagucggaa uggcgcacca ccacgacucg 600

aaacucgacg acgugcggau gggcaugcug cacggccuga ugcgguuacg acacccgguc 660

cugugcgucc uccaguagca ccacggugug aggaacggga aauuccacca ccacuagagu 720

cgguaggacc gggaccacca cgagugguag uagagggaau aguaggagua guacgaaacc 780

gucuucuucg gugcaauc 798

<210> 38

<211> 66

<212> DNA

<213> Artificial Sequence

<220>

<223> T2A DNA

<400> 38

ggcagcggcg agggcagagg aagcctgctg acatgtggcg atgtggaaga gaaccccggc 60

cctgcc 66

<210> 39

<211> 69

<212> DNA

<213> Artificial Sequence

<220>

<223> P2A DNA

<400> 39

ggcagcggcg ccacaaactt cagcctgctt aaacaggccg gcgatgtgga agagaacccc 60

ggccctgcc 69

<210> 40

<211> 66

<212> RNA

<213> Artificial Sequence

<220>

<223> T2A mRNA

<400> 40

ccgucgccgc ucccgucucc uucggacgac uguacaccgc uacaccuucu cuuggggccg 60

ggacgg 66

<210> 41

<211> 69

<212> RNA

<213> Artificial Sequence

<220>

<223> P2A mRNA

<400> 41

ccgucgccgc gguguuugaa gucggacgaa uuuguccggc cgcuacaccu ucucuugggg 60

ccgggacgg 69

<210> 42

<211> 22

<212> PRT

<213> thosea asigna virus 2A

<400> 42

Gly Ser Gly Glu Gly Arg Gly Ser Leu Leu Thr Cys Gly Asp Val Glu

1 5 10 15

Glu Asn Pro Gly Pro Ala

20

<210> 43

<211> 23

<212> PRT

<213> porcine teschovirus-1 2A

<400> 43

Gly Ser Gly Ala Thr Asn Phe Ser Leu Leu Lys Gln Ala Gly Asp Val

1 5 10 15

Glu Glu Asn Pro Gly Pro Ala

20

73页详细技术资料下载
上一篇:一种医用注射器针头装配设备
下一篇:一种提高杨树木材产量的方法

网友询问留言

已有0条留言

还没有人留言评论。精彩留言会获得点赞!

精彩留言,会给你点赞!