用于改善生长的光合生物基因调节

文档序号:1366881 发布日期:2020-08-11 浏览:37次 >En<

阅读说明:本技术 用于改善生长的光合生物基因调节 (Photosynthetic organism gene regulation for improved growth ) 是由 I·阿加维 F·I·库兹米诺夫 R·R·拉达科维茨 J·H·维卢托 S·波茨 R·斯普雷菲 于 2018-12-27 设计创作,主要内容包括:提供了具有减少的叶绿素和增加的光合效率的突变体光合生物。所述突变体菌株具有突变或弱化的:叶绿体SRP54基因和SGI1基因;叶绿体SRP54基因和SGI2基因;公开了叶绿体SRP54基因、SGI1基因和SGI2基因。相对于野生型菌株,所述突变体光合生物展现出增加的生产力。还提供了具有突变或弱化的胞质SRP54基因的突变体光合生物。本文提供了使用在SRP54基因、SGI1基因、SGI2基因、SGI1/SRP54的组合以及SGI2基因和SRP54基因的组合中具有突变的菌株生产生物质和如脂质等其它产物的方法。还包含用于弱化或破坏SRP54基因、SGI1基因和SGI2基因的构建体和方法。(Mutant photosynthetic organisms having reduced chlorophyll and increased photosynthetic efficiency are provided. The mutant strain has a mutation or attenuation of: chloroplast SRP54 gene and SGI1 gene; chloroplast SRP54 gene and SGI2 gene; discloses chloroplast SRP54 gene, SGI1 gene and SGI2 gene. The mutant photosynthetic organisms exhibit increased productivity relative to the wild type strain. Also provided are mutant photosynthetic organisms having a mutated or attenuated cytoplasmic SRP54 gene. Provided herein are methods of producing biomass and other products such as lipids using strains having mutations in the SRP54 gene, the SGI1 gene, the SGI2 gene, the combination of SGI1/SRP54, and the combination of SGI2 gene and SRP54 gene. Also included are constructs and methods for attenuating or disrupting the SRP54 gene, SGI1 gene and SGI2 gene.)

用于改善生长的光合生物基因调节

相关申请的交叉引用

本申请根据35 U.S.C.§119(e)要求于2017年12月29日提交的美国序列号62/612,251和于2018年6月26日提交的美国序列号62/690,205的优先权的权益,所述申请的全部内容通过引用整体并入本文。

序列表的并入

所附序列表中的材料特此通过引用并入到本申请中。名称为SGI2140_2WO_Sequence_Listing.txt的所附序列表文本文件于2018年12月18日创建并且大小为419kb。此文件可以在使用Windows OS的计算机上使用Microsoft Word来访问。

背景技术

光合生物的生物质生产力的提高与从生物燃料到高价值产物的各种商业应用相关。用于增加生物质的总蛋白质含量的基因操作是非常令人期望的,但是这样做的策略在本领域中并不明显。

对光合生物进行工程化以增加光合效率,从而获得更高的生产力是植物和藻类生物学家的长期目标。US 2014/0220638和US 2016/030489(两者均通过引用并入本文)描述了用于获得叶绿素减少的藻类突变体的突变体筛选,所述藻类突变体在低光适应能力方面受损,也就是说,其即使在低光下仍能保持高光适应细胞的低叶绿素状态。US 2014/0220638描述了在光适应调控子LAR1、LAR2和LAR3基因中具有突变的藻类突变体,并且US2016/0304896公开了在叶绿体SRP54基因中具有突变的藻类突变体。

发明内容

本文公开了包括光合效率和生产力增加的调节基因的光合生物,其在光能自养条件下生产产物的用途,和生产此类光合生物的方法以及用于调节此类基因的核酸分子和构建体。

一方面,提供了突变体光合生物,其包括突变或弱化的对显著生长改善基因2(SGI2)进行编码的基因。

一方面,提供了突变体光合生物,其包括突变或弱化的对叶绿体信号识别蛋白54(cpSRP54)进行编码的基因和突变或弱化的显著生长改善基因2(SGI2)。

一方面,提供了突变体光合生物,其包括突变或弱化的对叶绿体信号识别蛋白54(cpSRP54)进行编码的基因和突变或弱化的显著生长改善基因1(SGI1)。

一方面,提供了突变体光合生物,其包括突变或弱化的对叶绿体信号识别蛋白54(cpSRP54)进行编码的基因、突变或弱化的显著生长改善基因1(SGI1)和突变或弱化的显著生长改善基因2(SGI2)。

一方面,提供了突变体光合生物,其包括突变或弱化的对胞质信号识别蛋白54(cytoSRP54)进行编码的基因和突变或弱化的显著生长改善基因2(SGI2)。

一方面,提供了突变体光合生物,其包括突变或弱化的对胞质信号识别蛋白54(cytoSRP54)进行编码的基因和突变或弱化的显著生长改善基因1(SGI1)。

一方面,提供了突变体光合生物,其包括突变或弱化的对胞质信号识别蛋白54(cytoSRP54)进行编码的基因、突变或弱化的显著生长改善基因1(SGI1)和突变或弱化的显著生长改善基因2(SGI2)。

一方面,提供了包括突变体光合生物的生物质,其中所述突变体光合生物包括突变或弱化的对叶绿体信号识别蛋白54(cpSRP54)进行编码的基因和突变或弱化的显著生长改善基因1(SGI1)和/或突变或弱化的显著生长改善基因2(SGI2)。

一方面,提供了生产生物产物的方法。所述方法包含培养突变体光合生物,其中所述突变体光合生物包括突变或弱化的对叶绿体信号识别蛋白54(cpSRP54)进行编码的基因和突变或弱化的显著生长改善基因1(SGI1)和/或突变或弱化的显著生长改善基因2(SGI2);以及从所述培养物中分离出至少一种产物。

一方面,提供了将CRISPR基因的单拷贝插入到微生物的所选基因座中的方法。在一些实施例中,所述CRISPR基因被密码子优化以用于在微生物中表达。在一些实施例中,所述所插入的CRISPR基因包括多个异源内含子。在一些实施例中,所述异源内含子的数量可以是至少5个、6个、7个、8个、9个、10个、11个、12个、13个、14个、15个、16个、17个、18个、19个、20个、21个、22个、23个、24个、25个、26个、27个、28个、29个、30个、35个、40个或更多。所述CRISPR基因的非限制性实例包含Cas9和Cpf1。在一些实施例中,所述CRISPR基因可以可操作地连接到微生物的天然启动子。在一些实施例中,所述启动子是诱导型的。在一些实施例中,所述CRISPR基因可以可操作地连接到微生物的异源启动子。

在一些实施例中,生物产物是脂质、蛋白质、肽、一个或多个氨基酸、氨基酸、一个或多个核苷酸、维生素、辅因子、激素、抗氧化剂或颜料或着色剂。在一些实施例中,所述生物产物是生物质。在一些实施例中,突变体光合生物是藻类,并且所述生物质是藻类生物质。

在一些实施例中,所述突变体光合生物被工程化成包含对参与所述脂质的生产的多肽进行编码的至少一种外源基因。在一些实施例中,所述突变体光合生物是光养培养的。在一些实施例中,所述突变体光合生物是藻类,并且所述藻类是在池塘或水沟中进行培养。

一方面,提供了用于同源重组的核酸分子构建体,所述核酸分子构建体包括来自或邻近对SGI2蛋白进行编码的天然存在的光合生物基因的核苷酸序列,其中在所述基因的突变或弱化之前,所述SGI2蛋白包括与选自由以下组成的组的氨基酸序列具有至少55%同一性的氨基酸序列:SEQ ID NO:5、SEQ ID NO:40、SEQ ID NO:41、SEQ ID NO:42、SEQ IDNO:43、SEQ ID NO:44、SEQ ID NO:45、SEQ ID NO:46、SEQ ID NO:47、SEQ ID NO:48、SEQ IDNO:49、SEQ ID NO:50、SEQ ID NO:51、SEQ ID NO:52、SEQ ID NO:53、SEQ ID NO:54、SEQ IDNO:55和SEQ ID NO:56。

一方面,提供了用于同源重组的多个核酸分子构建体,所述核酸分子构建体包括来自或邻近对cpSRP54蛋白进行编码的天然存在的光合生物基因和对SGI1蛋白进行编码的光合生物基因的核苷酸序列,其中在所述基因的突变或弱化之前,所述cpSRP54蛋白包括与SEQ ID NO:68、SEQ ID NO:75、SEQ ID NO:76、SEQ ID NO:77、SEQ ID NO:78、SEQ ID NO:79、SEQ ID NO:80、SEQ ID NO:81、SEQ ID NO:82、SEQ ID NO:83、SEQ ID NO:84和SEQ IDNO:85具有至少55%同一性的氨基酸序列,并且其中在所述SGI1基因的突变或弱化之前,所述SGI1基因对具有以下氨基酸序列的多肽进行编码,所述氨基酸祖烈包括与选自由以下组成的组的氨基酸序列具有至少55%同一性的氨基酸序列:SEQ ID NO:3、SEQ ID NO:9、SEQID NO:10、SEQ ID NO:11、SEQ ID NO:12、SEQ ID NO:13、SEQ ID NO:14、SEQ ID NO:15、SEQID NO:16、SEQ ID NO:17、SEQ ID NO:18、SEQ ID NO:19、SEQ ID NO:20、SEQ ID NO:21、SEQID NO:22、SEQ ID NO:23、SEQ ID NO:24、SEQ ID NO:25、SEQ ID NO:26、SEQ ID NO:27、SEQID NO:28、SEQ ID NO:29、SEQ ID NO:30、SEQ ID NO:31、SEQ ID NO:34、SEQ ID NO:35、SEQID NO:36、SEQ ID NO:37、SEQ ID NO:38和SEQ ID NO:39。

一方面,提供了用于同源重组的多个核酸分子构建体,所述核酸分子构建体包括来自或邻近对cpSRP54蛋白进行编码的天然存在的光合生物基因和对SGI2蛋白进行编码的光合生物基因的核苷酸序列,其中在所述基因的突变或弱化之前,所述cpSRP54蛋白包括与SEQ ID NO:68、SEQ ID NO:75、SEQ ID NO:76、SEQ ID NO:77、SEQ ID NO:78、SEQ ID NO:79、SEQ ID NO:80、SEQ ID NO:81、SEQ ID NO:82、SEQ ID NO:83、SEQ ID NO:84或SEQ IDNo:85具有至少55%同一性的氨基酸序列,并且其中在所述基因的突变或弱化之前,所述SGI2蛋白包括与SEQ ID NO:5、SEQ ID NO:40、SEQ ID NO:41、SEQ ID NO:42、SEQ ID NO:43、SEQ ID NO:44、SEQ ID NO:45、SEQ ID NO:46、SEQ ID NO:47、SEQ ID NO:48、SEQ IDNO:49、SEQ ID NO:50、SEQ ID NO:51、SEQ ID NO:52、SEQ ID NO:53、SEQ ID NO:54、SEQ IDNO:55或SEQ ID NO:56具有至少55%同一性的氨基酸序列。

一方面,提供了用于表达反义RNA、shRNA、微RNA或核酶的核酸分子构建体,所述核酸分子构建体包括与对SGI2蛋白进行编码的天然存在的光合生物基因的至少一部分互补的核苷酸序列,其中在所述基因的突变或弱化之前,所述SGI2蛋白包括与SEQ ID NO:5、SEQID NO:40、SEQ ID NO:41、SEQ ID NO:42、SEQ ID NO:43、SEQ ID NO:44、SEQ ID NO:45、SEQID NO:46、SEQ ID NO:47、SEQ ID NO:48、SEQ ID NO:49、SEQ ID NO:50、SEQ ID NO:51、SEQID NO:52、SEQ ID NO:53、SEQ ID NO:54、SEQ ID NO:55或SEQ ID NO:56具有至少55%同一性的氨基酸序列。

一方面,提供了用于表达反义RNA、shRNA、微RNA或核酶的多个核酸分子构建体,所述核酸分子构建体包括与对cpSRP54蛋白进行编码的天然存在的光合生物基因的至少一部分和对SGI1蛋白进行编码的光合生物基因互补的核苷酸序列,其中在所述基因的突变或弱化之前,所述cpSRP54蛋白包括与SEQ ID NO:68、SEQ ID NO:75、SEQ ID NO:76、SEQ ID NO:77、SEQ ID NO:78、SEQ ID NO:79、SEQ ID NO:80、SEQ ID NO:81、SEQ ID NO:82、SEQ IDNO:83、SEQ ID NO:84或SEQ ID NO:85具有至少55%同一性的氨基酸序列,并且其中在所述SGI1基因的突变或弱化之前,所述SGI1蛋白包括与SEQ ID NO:3、SEQ ID NO:9、SEQ ID NO:10、SEQ ID NO:11、SEQ ID NO:12、SEQ ID NO:13、SEQ ID NO:14、SEQ ID NO:15、SEQ IDNO:16、SEQ ID NO:17、SEQ ID NO:18、SEQ ID NO:19、SEQ ID NO:20、SEQ ID NO:21、SEQ IDNO:22、SEQ ID NO:23、SEQ ID NO:24、SEQ ID NO:25、SEQ ID NO:26、SEQ ID NO:27、SEQ IDNO:28、SEQ ID NO:29、SEQ ID NO:30、SEQ ID NO:31、SEQ ID NO:34、SEQ ID NO:35、SEQ IDNO:36、SEQ ID NO:37、SEQ ID NO:38或SEQ ID NO:39具有至少55%同一性的氨基酸序列。

在一些实施例中,所述构建体包括cpSRP54、SGI1、SGI2或两个或更多个基因的组合的5'UTR的至少一部分、cpSRP54、SGI1、SGI2或两个或更多个基因的组合的启动子区的至少一部分和/或cpSRP54、SGI1、SGI2或两个或更多个基因的组合的3'UTR的至少一部分。在一些实例中,所述构建体可以是RNAi、核酶或反义构建体,并且可以包含在有义或反义朝向上从cpSRP54、SGI1、SGI2或所述基因中的两个或更多个的组合的转录区开始的序列。在进一步的实例中,一种构建体可以被设计用于引导RNA的体外或体内表达,所述引导RNA被设计成靶向cpSRP54、SGI1、SGI2或两个或更多个基因的组合,并且可以包含与任何基因的一部分同源的序列,包含例如基因的内含子、5'UTR、启动子区和/或3'UTR。在又进一步的实例中,用于弱化对cpSRP54、SGI1或SGI2多肽进行编码的基因的表达的构建体可以是引导RNA或反义寡核苷酸,其中所述序列与在反义朝向上的cpSRP54、SGI1、SGI2或两个或更多个基因的组合的转录区同源。

一方面,提供了用于表达反义RNA、shRNA、微RNA或核酶的多个核酸分子构建体,所述核酸分子构建体包括与对cpSRP54蛋白进行编码的天然存在的光合生物基因的至少一部分和对SGI2蛋白进行编码的光合生物基因互补的核苷酸序列,其中在所述基因的突变或弱化之前,所述cpSRP54蛋白编码包括与SEQ ID NO:68、SEQ ID NO:75、SEQ ID NO:76、SEQ IDNO:77、SEQ ID NO:78、SEQ ID NO:79、SEQ ID NO:80、SEQ ID NO:81、SEQ ID NO:82、SEQ IDNO:83、SEQ ID NO:84或SEQ ID No:85具有至少55%同一性的氨基酸序列的蛋白质,并且其中在所述基因的突变或弱化之前,所述SGI2蛋白编码包括与SEQ ID NO:5、SEQ ID NO:40、SEQ ID NO:41、SEQ ID NO:42、SEQ ID NO:43、SEQ ID NO:44、SEQ ID NO:45、SEQ ID NO:46、SEQ ID NO:47、SEQ ID NO:48、SEQ ID NO:49、SEQ ID NO:50、SEQ ID NO:51、SEQ IDNO:52、SEQ ID NO:53、SEQ ID NO:54、SEQ ID NO:55或SEQ ID NO:56具有至少55%同一性的氨基酸序列的蛋白质。

一方面,提供了对引导RNA进行编码的多个核酸分子,其中所述引导RNA包括天然存在的光合生物基因SGI2的至少一部分,其中在所述基因的突变或弱化之前,SGI2基因编码包括与SEQ ID NO:5、SEQ ID NO:40、SEQ ID NO:41、SEQ ID NO:42、SEQ ID NO:43、SEQID NO:44、SEQ ID NO:45、SEQ ID NO:46、SEQ ID NO:47、SEQ ID NO:48、SEQ ID NO:49、SEQID NO:50、SEQ ID NO:51、SEQ ID NO:52、SEQ ID NO:53、SEQ ID NO:54、SEQ ID NO:55或SEQ ID NO:56具有至少55%同一性的氨基酸序列的蛋白质。

一方面,提供了对至少两个引导RNA进行编码的多个核酸分子,其中所述引导RNA包括对cpSRP54进行编码的天然存在的光合生物基因的至少一部分和对SGI1进行编码的光合生物基因,其中在所述基因的突变或弱化之前,所述cpSRP54编码包括与SEQ ID NO:68、SEQ ID NO:75、SEQ ID NO:76、SEQ ID NO:77、SEQ ID NO:78、SEQ ID NO:79、SEQ ID NO:80、SEQ ID NO:81、SEQ ID NO:82、SEQ ID NO:83、SEQ ID NO:84或SEQ ID NO:85具有至少55%同一性的氨基酸序列的蛋白质,并且其中在所述SGI1基因的突变或弱化之前,所述SGI1基因包括与SEQ ID NO:3、SEQ ID NO:9、SEQ ID NO:10、SEQ ID NO:11、SEQ ID NO:12、SEQ ID NO:13、SEQ ID NO:14、SEQ ID NO:15、SEQ ID NO:16、SEQ ID NO:17、SEQ ID NO:18、SEQ ID NO:19、SEQ ID NO:20、SEQ ID NO:21、SEQ ID NO:22、SEQ ID NO:23、SEQ IDNO:24、SEQ ID NO:25、SEQ ID NO:26、SEQ ID NO:27、SEQ ID NO:28、SEQ ID NO:29、SEQ IDNO:30、SEQ ID NO:31、SEQ ID NO:34、SEQ ID NO:35、SEQ ID NO:36、SEQ ID NO:37、SEQ IDNO:38或SEQ ID NO:39具有至少55%同一性的氨基酸序列。

一方面,提供了对至少两个引导RNA进行编码的多个核酸分子,其中所述引导RNA包括天然存在的光合生物cpSRP54基因的至少一部分和光合生物基因SGI2基因,其中在所述基因的突变或弱化之前,所述cpSRP54基因编码包括与SEQ ID NO:68、SEQ ID NO:75、SEQID NO:76、SEQ ID NO:77、SEQ ID NO:78、SEQ ID NO:79、SEQ ID NO:80、SEQ ID NO:81、SEQID NO:82、SEQ ID NO:83、SEQ ID NO:84或SEQ ID No:85具有至少55%同一性的氨基酸序列的蛋白质,并且其中在所述基因的突变或弱化之前,所述SGI2基因包括与SEQ ID NO:5、SEQ ID NO:40、SEQ ID NO:41、SEQ ID NO:42、SEQ ID NO:43、SEQ ID NO:44、SEQ ID NO:45、SEQ ID NO:46、SEQ ID NO:47、SEQ ID NO:48、SEQ ID NO:49、SEQ ID NO:50、SEQ IDNO:51、SEQ ID NO:52、SEQ ID NO:53、SEQ ID NO:54、SEQ ID NO:55或SEQ ID NO:56具有至少的氨基酸序列。

一方面,提供了增加光合生物的生物质的方法,所述方法包括调节SGI2基因。

一方面,提供了一种增加光合生物的生物质的方法,所述方法包括调节叶绿体信号识别蛋白54(cpSRP54)和显著生长改善基因1(SGI1),其中在所述基因的突变或弱化之前,所述cpSRP54基因编码包括与SEQ ID NO:68、SEQ ID NO:75、SEQ ID NO:76、SEQ ID NO:77、SEQ ID NO:78、SEQ ID NO:79、SEQ ID NO:80、SEQ ID NO:81、SEQ ID NO:82、SEQ IDNO:83、SEQ ID NO:84或SEQ ID NO:85具有至少55%同一性的氨基酸序列的蛋白质,并且其中在所述SGI1基因的突变或弱化之前,所述SGI1基因包括与SEQ ID NO:3、SEQ ID NO:9、SEQ ID NO:10、SEQ ID NO:11、SEQ ID NO:12、SEQ ID NO:13、SEQ ID NO:14、SEQ ID NO:15、SEQ ID NO:16、SEQ ID NO:17、SEQ ID NO:18、SEQ ID NO:19、SEQ ID NO:20、SEQ IDNO:21、SEQ ID NO:22、SEQ ID NO:23、SEQ ID NO:24、SEQ ID NO:25、SEQ ID NO:26、SEQ IDNO:27、SEQ ID NO:28、SEQ ID NO:29、SEQ ID NO:30、SEQ ID NO:31、SEQ ID NO:34、SEQ IDNO:35、SEQ ID NO:36、SEQ ID NO:37、SEQ ID NO:38或SEQ ID NO:39具有至少的氨基酸序列。

一方面,提供了一种增加光合生物的生物质的方法,所述方法包括调节叶绿体信号识别蛋白54基因(cpSRP54)和显著生长改善基因2(SGI2),其中在所述基因的突变或弱化之前,所述cpSRP54基因编码包括与SEQ ID NO:68、SEQ ID NO:75、SEQ ID NO:76、SEQ IDNO:77、SEQ ID NO:78、SEQ ID NO:79、SEQ ID NO:80、SEQ ID NO:81、SEQ ID NO:82、SEQ IDNO:83、SEQ ID NO:84或SEQ ID No:85具有至少55%同一性的氨基酸序列的蛋白质,并且其中在所述基因的突变或弱化之前,所述SGI2基因包括与SEQ ID NO:5、SEQ ID NO:40、SEQID NO:41、SEQ ID NO:42、SEQ ID NO:43、SEQ ID NO:44、SEQ ID NO:45、SEQ ID NO:46、SEQID NO:47、SEQ ID NO:48、SEQ ID NO:49、SEQ ID NO:50、SEQ ID NO:51、SEQ ID NO:52、SEQID NO:53、SEQ ID NO:54、SEQ ID NO:55或SEQ ID NO:56具有至少的氨基酸序列。

一方面,提供了一种增加光合生物的生物质的方法,所述方法包括调节胞质信号识别蛋白54(cytoSRP54)和显著生长改善基因2(SGI2),其中在所述基因的突变或弱化之前,所述SGI2基因编码包括与SEQ ID NO:5、SEQ ID NO:40、SEQ ID NO:41、SEQ ID NO:42、SEQ ID NO:43、SEQ ID NO:44、SEQ ID NO:45、SEQ ID NO:46、SEQ ID NO:47、SEQ ID NO:48、SEQ ID NO:49、SEQ ID NO:50、SEQ ID NO:51、SEQ ID NO:52、SEQ ID NO:53、SEQ IDNO:54、SEQ ID NO:55或SEQ ID NO:56具有至少的氨基酸序列的蛋白质。

在一些实施例中,所述突变体光合生物的培养物展现出比同一物种的对照光合生物的培养物更大的生物质生产力。在一些实施例中,所述突变体光合生物在光能自养培养物中显示出更大的生物质生产力。在一些实施例中,在连续光条件下,所述突变体光合生物展现出比同一物种的对照光合生物的培养物更大的生物质生产力。在一些实施例中,在昼夜循环条件下,所述突变体光合生物展现出比同一物种的对照光合生物的培养物更大的生物质生产力。在一些实施例中,在昼夜循环条件下,所述突变体光合生物展现了比同一物种的对照光合生物的培养物更大的生物质生产力,其中光概况模拟自然日光概况。

在一些实施例中,增加光合生物的生物质包括总有机碳的增加。在一些实施例中,增加光合生物的生物质包括总脂质含量的增加。在一些实施例中,增加光合生物的生物质包括总氮含量的增加。

在一些实施例中,相对于同一物种的对照光合生物,所述突变体光合生物展现出叶绿素在低光条件下的减少和在高于100、125、150、200或250μE m-2s-1的所有生理相关辐照度下,光系统II中的光化学最大量子产率(Fv/FM)更高。在一些实施例中,相对于同一物种的对照光合生物,叶绿素的减少为至少减少20%、30%、40%、50%、60%或70%。在一些实施例中,相对于同一物种的对照光合生物,所述突变体光合生物展现出在高于125、150、200或250μE m-2s-1的所有生理相关辐照度下,非光化学淬灭(NPQ)更低。

在一些实施例中,所述突变体光合生物在每个叶绿素的基础上展现出同一物种的对照光合生物更高的固碳率。在一些实施例中,所述固碳率比同一物种的对照光合生物高至少50%、60%、70%、80%、90%或100%。

在一些实施例中,所述突变体光合生物展现出比同一物种的对照光合生物至少100%、150%、200%、300%、400%或更高的析氧率每毫克叶绿素。在一些实施例中,所述突变体光合生物展现出总有机碳(TOC)的μ至少100%、150%、200%、300%、400%或更高的析氧率每克。

在一些实施例中,所述突变体光合生物的培养物展现出比同一物种的对照光合生物的培养物更大的脂质生产力。在一些实施例中,所述突变体光合生物在光能自养培养物中展现出更大的脂质生产力。在一些实施例中,所述突变体光合生物是藻类。

在一些实施例中,所述突变体光合生物通过调节生物的SGI2基因生成。在一些实施例中,所述突变体光合生物通过调节生物的cpSRP54基因和SGI1或SGI2基因生成。在一些实施例中,调节所述基因包括UV辐照、γ辐照或化学诱变。在一些实施例中,调节所述基因包括所述cpSRP54基因、SGI1基因、SGI2基因或所述基因的组合中的碱基取代突变、插入诱变、基因置换、RNAi、反义RNA、大范围核酸酶基因组工程化、一种或多种核酶和/或CRISPR/Cas系统。

在一些实施例中,在所述基因的突变或弱化之前,所述突变体光合生物包括对具有以下氨基酸序列的蛋白质进行编码的cpSRP54基因,所述氨基酸序列与选自由以下组成的组的氨基酸序列具有至少50%、55%、60%、65%、70%、75%、80%、85%、90%或95%同一性的氨基酸序列:SEQ ID NO:68、SEQ ID NO:75、SEQ ID NO:76、SEQ ID NO:77、SEQ IDNO:78、SEQ ID NO:79、SEQ ID NO:80、SEQ ID NO:81、SEQ ID NO:82、SEQ ID NO:83、SEQ IDNO:84或SEQ ID NO:85。在一些实施例中,在所述基因的突变或弱化之前,所述突变体光合生物包括对具有以下氨基酸序列的蛋白质进行编码的cpSRP54基因,所述氨基酸序列与至少30个、35个、40个、45个、50个、60个、70个、80个、100个、150个、200个、250个、300个氨基酸或与选自由以下组成的组的氨基酸序列的全长具有至少50%、55%、60%、65%、70%、75%、80%、85%、90%或95%同一性:SEQ ID NO:68、SEQ ID NO:75、SEQ ID NO:76、SEQ IDNO:77、SEQ ID NO:78、SEQ ID NO:79、SEQ ID NO:80、SEQ ID NO:81、SEQ ID NO:82、SEQ IDNO:83、SEQ ID NO:84或SEQ ID NO:85。

在一些实施例中,所述突变体光合生物包括在对cpSRP54 GTP酶结构域的前169个氨基酸进行编码的序列之外发生的cpSRP54基因中的突变。在一些实施例中,所述对SRP54蛋白进行编码的cpSRP54基因中的所述突变发生在对所述cpSRP54 GTP酶结构域进行编码的所述序列之外。在一些实施例中,所述cpSRP54基因中的突变不包含所述cpSRP54 GTP酶结构域中的基因破坏性突变。

在一些实施例中,在所述基因的突变或弱化之前,所述突变体光合生物的所述SGI2基因编码具有以下氨基酸序列的蛋白质,所述氨基酸序列与SEQ ID NO:5、SEQ ID NO:40、SEQ ID NO:41、SEQ ID NO:42、SEQ ID NO:43、SEQ ID NO:44、SEQ ID NO:45、SEQ IDNO:46、SEQ ID NO:47、SEQ ID NO:48、SEQ ID NO:49、SEQ ID NO:50、SEQ ID NO:51、SEQ IDNO:52、SEQ ID NO:53、SEQ ID NO:54、SEQ ID NO:55或SEQ ID NO:56的氨基酸序列至少50%、65%、70%、75%、80%、85%、90%、95%同一性。在一些实施例中,在所述基因的突变或弱化之前,所述突变体光合生物的所述SGI2基因编码具有以下氨基酸序列的蛋白质,所述氨基酸序列与至少30个、35个、40个、45个、50个、60个、70个、80个、100个、150个、200个、250个、300个氨基酸或与SEQ ID NO:5、SEQ ID NO:40、SEQ ID NO:41、SEQ ID NO:42、SEQID NO:43、SEQ ID NO:44、SEQ ID NO:45、SEQ ID NO:46、SEQ ID NO:47、SEQ ID NO:48、SEQID NO:49、SEQ ID NO:50、SEQ ID NO:51、SEQ ID NO:52、SEQ ID NO:53、SEQ ID NO:54、SEQID NO:55或SEQ ID NO:56的氨基酸序列的全长至少50%、65%、70%、75%、80%、85%、90%、95%同一性。

在一些实施例中,在所述SGI1基因的突变或弱化之前,所述突变体光合生物的所述SGI1基因编码具有以下氨基酸序列的蛋白质,所述氨基酸序列与SEQ ID NO:3、SEQ IDNO:9、SEQ ID NO:10、SEQ ID NO:11、SEQ ID NO:12、SEQ ID NO:13、SEQ ID NO:14、SEQ IDNO:15、SEQ ID NO:16、SEQ ID NO:17、SEQ ID NO:18、SEQ ID NO:19、SEQ ID NO:20、SEQ IDNO:21、SEQ ID NO:22、SEQ ID NO:23、SEQ ID NO:24、SEQ ID NO:25、SEQ ID NO:26、SEQ IDNO:27、SEQ ID NO:28、SEQ ID NO:29、SEQ ID NO:30、SEQ ID NO:31、SEQ ID NO:34、SEQ IDNO:35、SEQ ID NO:36、SEQ ID NO:37、SEQ ID NO:38或SEQ ID NO:39的氨基酸序列至少50%、65%、70%、75%、80%、85%、90%、95%同一性。在一些实施例中,在所述SGI1基因的突变或弱化之前,所述突变体光合生物的所述SGI1基因编码具有以下氨基酸序列的蛋白质,所述氨基酸序列与至少30个、35个、40个、45个、50个、60个、70个、80个、100个、150个、200个、250个、300个氨基酸或与SEQ ID NO:3、SEQ ID NO:9、SEQ ID NO:10、SEQ ID NO:11、SEQ ID NO:12、SEQ ID NO:13、SEQ ID NO:14、SEQ ID NO:15、SEQ ID NO:16、SEQ ID NO:17、SEQ ID NO:18、SEQ ID NO:19、SEQ ID NO:20、SEQ ID NO:21、SEQ ID NO:22、SEQ IDNO:23、SEQ ID NO:24、SEQ ID NO:25、SEQ ID NO:26、SEQ ID NO:27、SEQ ID NO:28、SEQ IDNO:29、SEQ ID NO:30、SEQ ID NO:31、SEQ ID NO:34、SEQ ID NO:35、SEQ ID NO:36、SEQ IDNO:37、SEQ ID NO:38或SEQ ID NO:39的氨基酸序列的全长至少50%、65%、70%、75%、80%、85%、90%、95%同一性。

在上述方面的一些实施例中,所述光合生物是多倍体的,例如,二倍体、三倍体、四倍体。在一些实施例中,基因:cpSRP54、SGI1或SGI2的一个或多个拷贝是突变或弱化的,留下所述基因的其它拷贝是未改变的或未弱化的,以生成突变体光合生物。在一些实施例中,因此所述突变体光合生物生成,相对于同一物种的对照光合生物,所述突变体光合生物展现出叶绿素在低光条件下的减少和在高于100、125、150、200或250μE m-2s-1的所有生理相关辐照度下,光系统II中的光化学最大量子产率(Fv/FM)更高。在一些实施例中,因此所述突变体光合生物生成,所述突变体光合生物展现出比同一物种的对照光合生物更大的生物质生产力。在一些实施例中,因此所述突变体光合生物生成,所述突变体光合生物展现出比同一物种的对照光合生物更大的脂质生产力。

在上述方面的一些实施例中,所述突变体光合生物是藻类。在一些实施例中,所述藻类属于属:曲壳藻属(Achnanthes)、茧形藻属(Amphiprora)、双眉藻属(Amphora)、纤维藻属(Ankistrodesmus)、星胞藻属(Asteromonas)、黄金色藻属(Boekelovia)、波利氏藻属(Bolidomonas)、包特氏菌属(Borodinella)、气球藻属(Botrydium)、葡萄藻属(Botryococcus)、荀球藻属(Bracteococcus)、角毛藻属(Chaetoceros)、四鞭藻属(Carteria)、衣藻属(Chlamydomonas)、绿球藻属(Chlorococcum)、绿梭藻属(Chlorogonium)、小球藻属(Chlorella)、蓝隐藻属(Chroomonas)、金球藻属(Chrysosphaera)、球钙板藻属(Cricosphaera)、隐甲藻属(Crypthecodinium)、隐藻属(Cryptomonas)、小环藻属(Cyclotella)、杜氏藻属(Dunaliella)、椭球藻属(Ellipsoidon)、球石藻属(Emiliania)、独球藻属(Eremosphaera)、衣迪斯藻属(Ernodesmius)、裸藻属(Euglena)、真眼点藻属(Eustigmatos)、伏氏藻属(Franceia)、脆杆藻属(Fragilaria)、丽丝藻属(Gloeothamnion)、红球藻属(Haematococcus)、嗜盐古菌(Halocafeteria)、异弯藻属(Heterosigma)、膜胞藻属(Hymenomonas)、等鞭金藻属(Isochrysis)、鳞孔藻属(Lepocinclis)、微芒藻属(Micractinium)、蒜头藻属(Monodus)、单针藻属(Monoraphidium)、微球藻属(Nannochloris)、微拟球藻属(Nannochloropsis)、舟形藻属(Navicula)、新绿藻属(Neochloris)、肾鞭藻属(Nephrochloris)、肾爿藻属(Nephroselmis)、菱形藻属(Nitzschia)、棕鞭藻属(Ochromonas)、鞘藻属(Oedogonium)、卵囊藻属(Oocystis)、蚝球藻属(Ostreococcus)、巴夫藻属(Pavlova)、拟小球藻属(Parachlorella)、帕氏藻属(Pascheria)、普莱格门(Pelagomonas)、褐指藻属(Phaeodactylum)、噬菌体属(Phagus)、微绿藻属(Picochlorum)、扁藻属(Platymonas)、颗石藻属(Pleurochrysis)、宽球藻属(Pleurococcus)、原壁菌属(Prototheca)、假小球藻属(Pseudochlorella)、拟新绿藻属(Pseudoneochloris)、假十字趾藻属(Pseudostaurastrum)、塔胞藻属(Pyramimonas)、葡串藻属(Pyrobotrys)、栅列藻属(Scenedesmus)、骨条藻属(Skeletonema)、螺旋藻属(Spyrogyra)、裂丝藻属(Stichococcus)、四爿藻属(Tetraselmis)、海链藻属(Thalassiosira)、黄丝藻属(Tribonema)、无隔藻属(Vaucheria)、鲜绿球藻属(Viridiella)魏氏藻属(Vischeria)和团藻属(Volvox)。在一些实施例中,所述突变体光合生物是绿藻门或轮藻门的成员,并且可以是例如,绿藻门类中任何一个的成员:绿藻纲(Chlorophyceae)、共球藻纲(Trebouxiophyceae)、四爿藻纲(Chlorodendrophyceae)、石莼纲(Ulvophyceae)、平藻纲(Pedinophyceae)或绿枝藻纲(Prasinophyceae)。例如,所述藻类突变体可以是属于以下的物种:绿藻纲、共球藻纲或四爿藻纲。在一些实施例中,所述突变体藻类细胞是绿藻门藻类细胞,并且可以是共球藻纲类的绿藻门藻类细胞,例如,基因的物种的藻类细胞,如葡萄藻属、小球藻属、产油微藻属(Auxenochlorella)、海南橡藻属(Heveochlorella)、海水小球藻属(Marinichlorella)、拟小球藻属、假小球藻属、四球藻属(Tetrachlorella)、独球藻属、伏氏藻属、微芒藻属、微球藻属、卵囊藻属、微绿藻属、或原壁菌属。在一些实施例中,所述突变体藻类可以是属于以下的物种:产油微藻属、小球藻属、海南橡藻属、海水小球藻属、拟小球藻属、假小球藻属或四球藻属。

在一些实施例中,所述突变体光合微生物是蓝藻菌。在一些实施例中,所述蓝藻菌是蓝细菌、阿格门氏藻(Agmenellum)、项圈藻属、拟项圈藻属、组囊藻属、丝囊藻属、节旋藻、球藻属、博氏藻属、眉藻属、管孢藻属、拟绿胶蓝细菌属、拟甲色球藻属、色球藻属、发毛针藻属、蓝藻菌、蓝菌属、蓝囊胞菌属、螺菌属、蓝丝菌属、拟柱胞藻属、筒孢藻属、蓝纤维藻属、小皮果蓝细菌属、侧生藻属、Fremyella、蓝细菌属、线状蓝细菌属、粘杆菌属、蓝绿藻属、粘杆藻属、盐螺旋藻属、形态属、鞘丝藻属、丝藻属、林氏藻属、鞘藻属、微胞藻属、粘囊藻属、节球藻属、念珠藻属、拟珠藻属、颤藻属、席藻属、浮丝藻属、厚皮藻属、原绿球藻、原绿藻属、原绿发藻属、伪项圈藻属、胶须藻属、裂须藻属、伪枝藻属、螺旋藻属、斯塔尼尔氏菌属、斯塔尔氏蓝细菌属、真枝藻属、束藻属、聚球藻属、集胞藻属、热集胞藻属(thermosynechocystis)、单歧藻属、束毛藻属、常丝藻属或异球藻属物种。

在一些实施例中,所述突变体光合微生物是植物。植物的非限制性实例包含单子叶植物和双子叶植物,如包含谷类作物(例如,小麦、玉米、水稻、小米、大麦)的作物、水果作物(例如,番茄、苹果、梨、草莓、橙子)、饲料作物(例如,苜蓿)、根蔬菜作物(例如,胡萝卜、马铃薯、甜菜、山药)、多叶蔬菜作物(例如,莴苣、菠菜);开花植物(例如,矮牵牛、玫瑰菊花)、针叶树和松树(例如,松杉、云杉)、用于植物修复的植物(例如,积聚重金属的植物);油料作物(例如,向日葵、油菜籽)和用于实验目的的植物(例如,拟南芥)。

突变的双子叶植物的非限制性实例包含属于以下目的植物:木兰目、Miciales、樟目、胡椒目、马兜铃目、睡莲目、毛茛目、罂粟目、瓶子草科、昆栏树目、金缕梅目、杜仲目、莱脱纳目、杨梅目、壳斗目、木麻黄目、石竹目、肉穗果目、蓼目、蓝雪目、五桠果目、山茶目、锦葵目、荨麻目、玉蕊目、紫堇目、杨柳目、白花菜目、欧石楠目、岩梅目、柿树目、报春花目、蔷薇目、豆目、河苔草目、小二仙草目、桃金娘目、山茱萸目、山龙眼目、San tales、大花草目、卫矛目、大戟目、鼠李目、无患子目、胡桃目、牻牛儿苗目、远志目、伞形目、龙胆目、花葱目、唇形目、车前草目、玄参目、桔梗目、茜草目、川绿断目和菊目。

突变的单子叶植物的非限制性实例包含属于以下目的植物:泽泻目、水鳖目、茨藻目、霉草目、鸭跖草目、谷精草目、帚灯草目、禾本目、灯芯草目、莎草科、香蒲目、凤梨目、姜目、槟榔目、环花目、露兜树目、天南星目、百合目、和兰目,或属于裸子植物的植物,例如,属于以下顺序的那些:松目、银杏目、苏铁目、南洋杉目、柏目和麻黄目。

在一些实施例中,所述突变的植物可以是沙生芥(Arabidopsis arenicola)、沙芥(Arabidopsis arenosa)、Arabidopsis cebennensis、Arabidopsis croatica、圆叶拟南芥(Arabidopsis halleri)、琴叶拟南芥(Arabidopsis lyrata)、Arabidopsis neglecta、Arabidopsis pedemontana、Arabidopsis suecica)、拟南芥(Arabidopsis thaliana)、玉米(Zea mays)、稻(Oryza sativa)、小麦(Triticum aestivum)、马铃薯(Solanumtuberosum)、洋葱(Allium cepa)、大蒜(Allium sativum)、大豆(Glycine max)、番茄(Solanum lycopersicum)、陆地棉(Gossypium hirsutum)、草本棉(Gossypiumherbaceum)、木本棉(Gossypium arboreum)、毛棉(Gossypium tomentosum)、黑芥(Brassica nigra)或芸薹属(Brassica sp.)。

在一些实施例中,植物中的SRP54、SGI1、SGI2或一种或多种基因的组合的调节可以是组织特异性的。在一些实施例中,所述植物组织可以是叶、茎或根。在一些实施例中,可以通过调节基因的组织特异性非编码区,例如,启动子、增强子、内含子、3'-或5'-非翻译区来实现对组织特异性基因的调节。在一些实施例中,所述植物中的SRP54、SGI1、SGI2或一种或多种基因的组合的调节可以在植物的不同发育阶段进行。

当结合附图阅读本发明的以下详细描述时,本发明的这些和其它目的和特征将变得更充分明显。

附图说明

图1A-1B.图1A示出了SGI1基因的示意图。指示了被设计成破坏SGI1基因(CRISPR靶)的gRNA的推定位置。图1B示出了SPR54基因的示意图。指示了被设计成破坏SPR54基因(CRISPR靶)的gRNA的推定位置。

图2A-2C.图2A示出了SGI1基因的示意图。指示了被设计成破坏SGI1基因(CRISPR靶)的gRNA的推定位置。图2B示出了SGI1蛋白的示意图。图2C示出了SPR54基因的示意图。指示了被设计成破坏SPR54基因(CRISPR靶)的gRNA的推定位置。

图3.图3示出了拟小球藻属(Parachorella sp.)SGI2蛋白的示例性结构域架构分析。

图4.图4示出了卵囊藻属(Oocystis sp.)SGI2蛋白的示例性结构域架构分析。

图5.图5示出了四爿藻属(Tetraselmis sp)SGI2蛋白的示例性结构域架构分析。

图6.图6示出了拟南芥(Arabidopsis thaliana)SGI2蛋白的示例性结构域架构分析。

图7.图7示出了拟南芥SGI2蛋白的示例性结构域架构分析。

图8.图8示出了拟南芥SGI2蛋白的示例性结构域架构分析。

图9.图9示出了拟南芥SGI2蛋白的示例性结构域架构分析。

图10A-10B.图10A示出了含有密码子优化的Cre基因的DNA盒的示意图,所述Cre基因侧翼为亚硝酸盐还原酶启动子和终止子。图10B示出了包括bleR和GFP序列的DNA盒的示意图。

图11.图11示出了拟小球藻属野生型菌株、SRP54敲除菌株、SGI2敲除菌株以及SGI2和SRP54的双敲除菌株的生产力测定的结果。

图12A-12B.图12A示出了拟小球藻属野生型菌株(STR00010)、SRP54敲除突变体(STR00625)、SGI1敲除突变体(STR24183)、SGI1/SRP54双敲除突变体(STR24538和STR24550)的半连续区域TOC生产力测定的结果。图12B示出了拟小球藻属野生型菌株(STR00010)、SRP54敲除突变体(STR00625)、SGI1敲除突变体(STR24183)、SGI1/SRP54双敲除突变体(STR24538和STR24550)的分批TOC生产力测定的结果。

图13A-13B.图13A示出了指示拟小球藻属野生型菌株(STR00010)、SRP54敲除突变体(STR00625)、SGI1敲除突变体(STR00012)、SGI2/SRP54双敲除突变体(STR25761)和SGI1/SGI2/SRP54三重敲除突变体(STR25761和STR25762)的半连续区域TOC生产力的测定的结果。图13B示出了指示拟小球藻属野生型菌株(STR00010)、SRP54敲除突变体(STR00625)、SGI1敲除突变体(STR00012)、SGI2/SRP54双敲除突变体(STR25761)和SGI1/SGI2/SRP54三重敲除突变体(STR25761和STR25762)的分批TOC生产力的测定的结果。

图14.图14示出了拟小球藻属野生型菌株(STR00010)、SRP54敲除突变体(STR00625)、SGI1敲除突变体(STR24183)、SGI1/SRP54双敲除突变体(STR24538和STR24540)的分批FAME生产力测定的结果。

图15.图15示出了拟小球藻属野生型菌株(STR00010)、SGI1敲除突变体(STR00012)、SGI2/SRP54双敲除突变体(STR00516)和SGI1/SGI2/SRP54三重敲除突变体(STR25761和STR25762)的分批FAME生产力测定的结果。

图16A-16B.图16A示出了用于敲除拟小球藻属SPR54的选择盒的示意图。图16B示出了用于敲除拟小球藻属SGI2的选择盒的示意图。

图17.图17示出了包括Cas9、GFP、BleR、Cre基因和lox位点的重组pCC1BAC载体的示意图。

具体实施方式

本申请的发明人令人惊讶且出乎意料地发现,调节光合生物中的SGI1和SGI2基因会导致叶绿素在低光条件下的减少,以及在所有生理相关的辐照度下,光系统II中的光化学最大量子产率(Fv/FM)更高。在一些实施例中,包括突变或弱化的SGI1或SGI2基因的突变体光合生物展现出在所有生理相关的辐照度下,非光化学淬灭(NPQ)较低。在一些实施例中,包括突变或弱化的SGI1或SGI2基因的突变体光合生物展现出比同一物种的对照光合生物增加的生物质。在一些实施例中,包括突变或弱化的SGI1或SGI2基因的突变体光合生物在每个叶绿素的基础上展现出更高的固碳率。在一些实施例中,包括突变或弱化的SGI1或SGI2基因的突变体光合生物在每个TOC的基础上展现出比同一物种的对照光合生物更高的固碳率。在一些实施例中,包括突变或弱化的SGI1或SGI2基因的突变体光合生物在每个mg叶绿素上展现出比同一物种的对照光合生物更高的析氧率。在一些实施例中,包括突变或弱化的SGI1或SGI2基因的突变体光合生物在每个TOC的基础上展现出比同一物种的对照光合生物更高的析氧率。在一些实施例中,包括突变或弱化的SGI1或SGI2基因的突变体光合生物展现出比同一物种的对照光合生物更高的脂质生产力。在一些实施例中,包括突变或弱化的SGI1或SGI2基因的突变体光合生物在光能自养培养物中展现出更大的脂质生产力。

本申请的发明人还令人惊讶地发现,在光合生物中调节SGI1或SGI2基因以及调节SRP54基因的协同效应(synergistic effect)。在一些实施例中,与仅调节SGI1或SGI2基因的突变体光合生物相比,在调节SRP54和SGI1或SGI2基因的突变体光合生物中,叶绿素进一步减少、生物质增加更多、在每个叶绿素的基础上的更高碳固定、在每个TOC的基础上的更高碳固定、更高脂质生产力。

SGI1基因

如本文所描述的,显著生长改善基因1(SGI1)多肽是包含两个结构域的多肽:响应接收或“RR”结构域(Pfam PF00072)和Myb结构域(Pfam PF00249),其中RR结构域定位于Myb结构域的N末端。RR结构域和Myb结构域被氨基酸序列分开,发现所述氨基酸序列在SGI1多肽中是不良保守的或不保守的,有时在本文中被称为两个结构域之间的接头,其中所述接头的长度范围可以为例如一到300个氨基酸,或十到200个氨基酸。接头区可以任选地包含核定位序列(NLS)。

响应接收“RR”结构域(Pfam PF00072)的存在负责其作为CheY样多肽的生物信息注释。RR结构域大约从拟小球藻属SGI1多肽(SEQ ID NO:3)的氨基酸36延伸到氨基酸148,并且在保守结构域数据库(CDD)中还被表征为“信号接收结构域”,cd00156,大约从氨基酸37延伸到氨基酸154。所述RR结构域在蛋白质直系同源簇数据库中也被表征为“CheY样接收(REC)结构域”,COG0784,以及被表征为Interpro“CheY样超家族”结构域,IPR011006,其中这两个表征的结构域均从SEQ ID NO:3的拟小球藻属SGI1多肽的大约氨基酸33延伸到大约氨基酸161。RR结构域发现于细菌双组分调节系统(如包含称为CheY的多肽的细菌趋化性双组分系统)中,其中,其从传感器配偶体接收信号。此类系统的RR结构域通常发现于DNA结合结构域的N末端,并且包含可以被磷酸化的磷酸受体位点,所述位点可能负责其激活或去激活。

SGI1蛋白内的RR结构域可以表征为例如Pfam PF00072,或表征为“信号接收结构域”或简单地“接收结构域”,和/或可以分类为保守结构域数据库(CDD)中的cd00156、蛋白质直系同源簇数据库中的COG0784或Interpro“CheY样超家族”结构域的IPR011006。RR结构域发现于细菌双组分调节系统(如包含称为CheY的多肽的细菌趋化性双组分系统)中,其中,其从传感器配偶体接收信号。此类系统的RR结构域通常发现于DNA结合结构域的N末端,并且包含可以被磷酸化的磷酸受体位点,所述位点可能负责其激活或去激活。

SGI1蛋白内的myb结构域可以表征为例如pfamPF00249:“Myb样DNA结合结构域”,和/或可以鉴定为保守结构域TIGR01557“myb样DNA结合结构域,SHAQKYF类(作为SEQ IDNO:102公开的“SHAQKYF”)”,或鉴定为Interpro同源异型样结构域超家族结构域(IPR009057)和/或Interpro Myb结构域(IPR017930)。

除了在myb结构域的N末端具有RR结构域之外,当使用隐马尔可夫模型(HMM)进行扫描时,所述隐马尔可夫模型被设计成在查询蛋白质氨基酸序列与藻类中的SGI1同源物区域的保守氨基酸匹配程度基础上对蛋白质进行评分,本文所提供的SGI1蛋白的评分可以为300或更高、320或更高、340或更高、350或更高、360或更高或370或更高,其中在多肽的比较区域内高度保守的氨基酸位置比不良保守的氨基酸位置权重更大,以达到评分。当基于包含包括使用开发的RR结构域、接头和myb结构域的单个连续序列的藻类SGI1多肽的蛋白质序列用HMM模型进行扫描时,评分为350或更高,如370或更高的多肽包含但不限于:藻类和植物物种的多肽,拟小球藻属1185(SEQ ID NO:3)、胶球藻(SEQ ID NO:9)、海洋发光蚝球藻(SEQ ID NO:10)、莱茵衣藻(SEQ ID NO:11)、卡特氏团藻(SEQ ID NO:13)、四爿藻属105(SEQ ID No:14、15和16)、卵囊藻属(SEQ ID NO:17)、微胞藻属RCC299(SEQ ID NO:18)、细小微胞藻(SEQ ID NO:19)、伪泥炭藓(SEQ ID NO:20)、小立碗藓(SEQ ID NO:21)、拟南芥(SEQ ID NO:22)、圆叶拟南芥(SEQ ID NO:23)、琴叶拟南芥(SEQ ID NO:24)、向日葵(SEQID NO:25)、葡萄(SEQ ID NO:26)、无油樟(SEQ ID NO:27)、蓖麻(SEQ ID NO:28)、番茄(SEQID NO:29)、马铃薯(SEQ ID NO:30)、陆地棉(SEQ ID NO:31)、可可(SEQ ID NO:32)、菜豆(Phaeolis vulgaris)(SEQ ID NO:33)、大豆(SEQ ID NO:34)、藜麦(SEQ ID NO:35)、苹果(Domestica)(SEQ ID NO:36)、玉米(SEQ ID NO:37)、芜菁(SEQ ID NO:38)和稻(SEQ IDNO:39)以及与前述任一项具有至少50%、至少55%、至少60%、至少65%、至少70%、至少75%、至少80%、至少85%、至少90%、至少95%、至少96%、至少97%、至少98%或至少99%同一性的多肽,其中所述多肽具有RR结构域和myb结构域,并且RR结构域是myb结构域的N末端。在各个实施例中,SGI1多肽来自植物或藻类物种(algal species)。对如本文所提供的SGI1多肽进行编码的基因,例如如本文所提供的,在突变体中被破坏或其表达被弱化的基因可以是各个实施例中编码如本文所公开的多肽的植物或藻类物种的天然存在的基因。

在一些实施例中,如本文所提供的SGI1多肽是藻类SGI1多肽,例如,具有天然存在的藻类SGI1多肽的序列,其中所述藻类多肽包含RR结构域和myb结构域,并且RR结构域是myb结构域的N末端。藻类多肽可以任选地与本文所公开的任何藻类SGI1多肽具有至少50%、至少55%、至少60%、至少65%、至少70%、至少75%、至少80%、至少85%、至少90%、至少95%、至少96%、至少97%、至少98%或至少99%同一性。在一些实施例中,SGI1基因可以是对藻类SGI1多肽进行编码的基因,例如具有天然存在的藻类SGI1多肽的序列的多肽。编码具有天然存在的藻类SGI多肽的序列的多肽的SGI1基因可以是具有基因编码序列的天然存在的基因序列的基因,或可以具有与天然存在的基因序列不同的序列。在各个实施例中,如本文所公开的,在突变体光合生物中被弱化、突变或破坏的SGI1基因可以是通过BLAST进行鉴定的基因,例如使用本文所公开的序列和/或通过HMM扫描,其中HMM基于连续氨基酸序列,例如通过比较至少六个SGI多肽得到,其中所述连续氨基酸序列包含RR结构域和myb结构域,其中RR结构域是myb结构域的N末端,并且其中RR与myb结构域之间存在不属于任一个结构域的接头序列。

在一些实施例中,SGI多肽具有藻类SGI1多肽的序列或是与天然存在的藻类SGI1多肽具有至少85%、至少90%或至少95%同一性的天然存在的藻类SGI1多肽的变体,和/或与SEQ ID NO:3、SEQ ID NO:9、SEQ ID NO:10、SEQ ID NO:11、SEQ ID NO:12、SEQ ID NO:13、SEQ ID NO:14、SEQ ID NO:15、SEQ ID NO:16、SEQ ID NO:17、SEQ ID NO:18或SEQ IDNO:19中的任一项具有至少50%、至少55%、至少60%、至少65%、至少70%、至少75%、至少80%、至少85%、至少90%、至少95%、至少96%、至少97%、至少98%或至少99%同一性。

在一些实施例中,SGI多肽具有植物SGI1多肽的序列或是与天然存在的藻类SGI多肽具有至少85%、至少90%或至少95%同一性的天然存在的植物SGI1多肽的变体,和/或与SEQ ID NO:20、SEQ ID NO:21、SEQ ID NO:22、SEQ ID NO:23、SEQ ID NO:24、SEQ ID NO:25、SEQ ID NO:26、SEQ ID NO:27、SEQ ID NO:28、SEQ ID NO:29、SEQ ID NO:30、SEQ IDNO:31、SEQ ID NO:32、SEQ ID NO:33、SEQ ID NO:34、SEQ ID NO:35、SEQ ID NO:36、SEQ IDNO:37、SEQ ID NO:38或SEQ ID NO:39中的任一项具有至少50%、至少55%、至少60%、至少65%、至少70%、至少75%、至少80%、至少85%、至少90%、至少95%、至少96%、至少97%、至少98%或至少99%同一性。

发现作为SEQ ID NO:1提供的拟小球藻属SGI1基因序列对包含两个主要功能结构域的多肽(SEQ ID NO:3)进行编码,所述两个功能结构域均出现在具有619个氨基酸的蛋白质的一半的N末端中。示例性拟小球藻属SGI1 cDNA序列被提供为SEQ ID NO:2。

在SGI1多肽的myb结构域的C末端的区域中,即在蛋白质的(大约)的C末端一半中,没有发现保守的蛋白质结构域。另一方面,RR结构域和Myb结构域(其中myb结构域定位于RR结构域的C末端)可以于在绿色植物(Viridiplantae)(绿色植物(green plant),涵盖藻类)基因组中编码的许多蛋白质中发现。生物信息学分析用于鉴定另外的植物和藻类物种中的SGI1的可能直向同源物。

为了鉴别另外的光合生物中的一类SGI1蛋白,构建隐马尔可夫模型(HMM)以-用于在拟小球藻属SGI1中发现RR结构域myb结构域架构。作为第一步,拟小球藻属SGI1多肽序列(SEQ ID NO:3)用于BLAST搜索包含植物和藻类基因组的JGI Phytozome数据库v.12。还将四种专有藻类基因组(来自拟小球藻属、拟南芥、四爿藻属和卵囊藻属)添加到所搜索的数据库中。当搜索达到大约2,000次命中时,停止搜索。然后通过InterProScan(可从EMBL-EBI[欧洲分子生物学实验室-欧洲生物信息学研究所(European Molecular BiologyLaboratories-European Bioinformatics Institute),例如,在ebi.ac.uk]获得)来分析这些结果,以确保所选结果具有Interpro CheY样超家族结构域(IPR011006)和InterproHomeobox样或Myb结构域(IPR009057或IPR017930)两者。此步骤将所选命中的数量减少到介于900与1,000之间,其中多肽在藻类和高等植物两者的多肽中清楚地鉴别出了双结构域架构(RR结构域N端-myb结构域)。所得序列用于基于序列同源性来组装系统发育树(phylogenetic tree)。系统发育树示出了来自藻类物种的相关多肽的清晰分组,包含拟小球藻属、四爿藻属、卵囊藻属、衣藻属、团藻属、蚝球藻属、微胞藻属和胶球藻属的SGI1同源物。

表:藻类物种中的SGI1直向同源物

生物 多肽序列 HMM评分
拟小球藻属1185 SEQ ID NO:3 400.20
胶球藻 SEQ ID NO:9 403.0
海洋发光蚝球藻 SEQ ID NO:10 425.8
莱茵衣藻 SEQ ID NO:11 413.3
佐夫色绿藻 SEQ ID NO:12 292.6
卡特氏团藻 SEQ ID NO:13 441.4
四爿藻属105 SEQ ID NO:14 403.6
四爿藻属105 SEQ ID NO:15 403.0
四爿藻属105 SEQ ID NO:16 402.9
卵囊藻属 SEQ ID NO:17 426.9
微胞藻属RC299 SEQ ID NO:18 418.4
细小微胞藻 SEQ ID NO:19 405.9

为了建立其它光合生物中可能的SGI1直向同源物的标准,然后基于SGI1多肽序列的藻类簇开发隐马尔可夫模型(HMM)。HMM是基于SGI1多肽的N末端部分开发的,所述SGI1多肽涵盖RR和myb结构域两者,包含两个保守结构域之间的接头区。不包含任何可识别的保守结构的myb结构域的多肽C末端的序列从模型构建中排除。HMMER 3.1b2用于使用来自以下的多序列比对(MSA)来构建HMM:拟小球藻属、卵囊藻属和四爿藻属多肽的专有序列以及莱茵衣藻、卡特氏团藻、佐夫色绿藻、胶球藻、微胞藻属RCC299和海洋发光蚝球藻的多肽的公共数据库的序列。使用ETE3 toolkit和eggnog41工作流程来生成蛋白质的N末端一半的多序列比对(MSA)。此程序在内部使用程序Muscle、MAFFT、ClustalΩ,以及用于比对的M-coffee、用于比对修剪的trimAI和用于系统发生干扰的PhyML。与用于同源性比较的单个蛋白质序列不同,HMM例如从多个蛋白质序列中捕获信息,并且因此能够区分高度保守与高度趋异的残基,并在确定序列的相关性时考虑到这一点。当HMM用于对序列进行评分时,高度保守的残基比高度趋异的残基接收更多的权重,由此提供比更简单的PSA更优越的灵敏度和准确度。

SGI1 HHM用于将得分分配给在BLAST搜索中所鉴定的多肽,所述多肽也被验证为具有两个保守结构域(RR和myb)。在生物信息学搜索中,几乎在藻类物种和单一植物多肽中发现的最高评分允许鉴定其它藻类物种中的兴趣蛋白质(表1)。这些代表可能的直向同源物,其基因可以被弱化或敲除以在其它生物中提供高生产力突变体。

表2:植物物种中的SGI1直向同源物

生物 多肽序列 HMM评分
伪泥炭藓 SEQ ID NO:20 397.3
小立碗藓 SEQ ID NO:21 372.3
拟南芥 SEQ ID NO:22 371.1
圆叶拟南芥 SEQ ID NO:23 475.9
琴叶拟南芥 SEQ ID NO:24 395.5
向日葵 SEQ ID NO:25 391.2
葡萄 SEQ ID NO:26 390.6
无油樟 SEQ ID NO:27 390.1
蓖麻 SEQ ID NO:28 390.1
番茄 SEQ ID NO:29 388.4
马铃薯 SEQ ID NO:30 387.2
陆地棉 SEQ ID NO:31 385.8
可可 SEQ ID NO:32 383.0
菜豆 SEQ ID NO:33 381.6
大豆 SEQ ID NO:34 381.4
藜麦 SEQ ID NO:35 373.7
苹果 SEQ ID NO:36 372.6
玉米 SEQ ID NO:37 371.5
芜菁 SEQ ID NO:38 370.5
SEQ ID NO:39 369.6

图1A中示出了SGI1基因的示意图。

在一些实施例中,调节SGI1基因,如藻类物种中的SGI1基因的突变、弱化或敲除,例如,增加光系统II中的光化学最大量子产率(Fv/FM)(约10%-14%),同时与其所源自的野生型菌株相比,展现出减小的触角尺寸(即,功能吸收横截面)。

在一些实施例中,调节SGI1基因也可能引起光系统II(PSII)和光系统I(PSI)的天线尺寸减小(即,功能吸收横截面)(相对于野生型下降40%-50%),在饱和光下关于PSII(1/τ’Qa)受体侧的高电子传递速率(增加约35%到约130%,并且相对于野生型在这些工程化的突变体中至少大约100%)和高固碳率(Pmax)(相对于野生型,高达至少30%-40%),同时,如通过多反应监测蛋白质测定所确定的,保持在每个TOC的基础上的光系统的数量。

SGI2基因

本申请的发明人已经鉴定显著生长改善基因2(SGI2)作为存在于光合生物(例如,藻类)中的直向同源物,这一类调节基因的植物称为双组分系统(TCS),因为已知所述植物调节重要的细胞过程,包含细菌细胞周期进展和发育(Skerker等人,2015;“调控细菌中生长和细胞周期进展的双组分信号转导途径:系统-水平分析(Two-component signaltransduction pathways regulating growth and cell cycle progression in abacterium:a system-level analysis)”,《PLoS生物学(PLoS Biology.)》3(10):e334),氮感测(Sanders等人,1992;“其共价中间体激活转录的蛋白磷酸酶NtrC的磷酸化位点(Phosphorylation site of NtrC,a protein phosphatase whose covalentintermediate activates transcription)”《细菌学期刊(Journal of Bacteriology)》174(15):5117-22)以及细菌趋化性(Sanders等人,1989;“趋化性响应调控蛋白CheY的磷酸化位点的鉴定(Identification of the site of phosphorylation of the chemotaxisresponse regulator protein,CheY)”;《生物化学期刊(The Journal of BiologicalChemistry)》264(36):21770-8)。在细菌中,这些蛋白质通常由检测特定环境刺激的组氨酸激酶和介导细胞应答的对应的响应调控结构域(PF00072)组成,主要通过靶基因的差异表达。然而,在光合生物中,SGI2基因包括对应的响应调控结构域(PF00072),并且缺少双组分系统的另一个结构域。

图2A中示出了SGI1基因的示意图,并且图2B中示出了对应的蛋白质的示意图。

发现一种示例性拟小球藻属SGI2基因序列被提供为SEQ ID NO:4,其编码包括响应调控结构域(SEQ ID NO:6)的多肽(SEQ ID NO:5)。

各种光合生物中的示例性直向同源多肽序列在下表3中示出。

表3:各种光合生物中的直向同源SGI2序列

光合生物 多肽序列
卵囊藻属 SEQ ID NO:40
四爿藻属 SEQ ID NO:41
拟南芥 SEQ ID NO:42
拟南芥 SEQ ID NO:43
拟南芥 SEQ ID NO:44
拟南芥 SEQ ID NO:45
拟南芥 SEQ ID NO:46
大豆 SEQ ID NO:47
葡萄 SEQ ID NO:48
可可 SEQ ID NO:49
SEQ ID NO:50
玉米 SEQ ID NO:51
小立碗藓 SEQ ID NO:52
卡特氏团藻 SEQ ID NO:53
莱茵衣藻 SEQ ID NO:54
佐夫小球藻 SEQ ID NO:55
胶球藻C-169 SEQ ID NO:56

示例性拟小球藻属SGI2 cDNA序列被提供为SEQ ID NO:7。其它光合生物中的SGI2基因的直向同源cDNA序列在下表4中示出。

表4:其它光合生物中的SGI2基因的直向同源cDNA序列

光合生物 cDNA序列
卵囊藻属 SEQ ID NO:57
四爿藻属 SEQ ID NO:58
大豆 SEQ ID NO:59
葡萄 SEQ ID NO:60
可可 SEQ ID NO:61
SEQ ID NO:62
玉米 SEQ ID NO:63
小立碗藓 SEQ ID NO:64
卡特氏团藻 SEQ ID NO:65
莱茵衣藻 SEQ ID NO:66
胶球藻 SEQ ID NO:67

在一些实施例中,光合生物的SGI2多肽包括与SEQ ID NO:6至少35%、40%、45%、50%、55%、60%、65%、70%、75%、80%、85%、90%、95%相同的氨基酸序列。在一些实施例中,光合生物的SGI2多肽包括与至少100个、150个、200个、250个氨基酸或SEQ ID NO:5、40、41、42、43、44、45、46、47、48、49、50、51、52、53、54、55或56的全长至少35%、40%、45%、50%、55%、60%、65%、70%、75%、80%、85%、90%、95%相同的氨基酸序列。

在一些实施例中,光合生物包括对SGI2多肽进行编码的多核苷酸,其中所述多核苷酸的核酸序列与至少100个、150个、200个、250个核苷酸或SEQ ID NO:4、7、57、58、59、60、61、62、63、64、65、66或67的全长至少35%、40%、45%、50%、55%、60%、65%、70%、75%、80%、85%、90%、95%相同。

在一些实施例中,调节SGI2基因,如光合生物(如藻类物种)中的SGI2基因的突变、弱化或敲除,增加了光系统II中的光化学最大量子产率(Fv/FM)(约10%-14%),减少的叶绿素/每总有机碳(TOC),增加的生物质。

SPR54基因

美国专利申请公开2016/0304896中已经描述了SPR54基因的调节,所述美国专利申请通过引用整体并入本文。示例性拟小球藻属叶绿体SRP54(cpSRP54)cDNA序列被提供为SEQ ID NO:8提供,其编码具有SEQ ID NO:68的多肽。

其它非限制性的示例性cpSRP54直向同源多肽包含GenBank登录号:EDP00260,用于莱茵衣藻(SEQ ID NO:75);EEH59526,用于细小微胞藻(SEQ ID NO:76);EEH59526,用于微胞藻属(SEQ ID NO:77);ACB42577,用于载色宝琳虫(Paulinella chromatophora)(SEQID NO:78);ABO94038,用于海洋发光蚝球藻(SEQ ID NO:79);Q01H03,用于金牛蚝球藻(SEQID NO:80);EFJ41797,用于卡特氏团藻(SEQ ID NO:81);EEC48599,用于三角褐指藻(Phaeodactylum tricornutum)(SEQ ID NO:82);EED94755,用于假微型海链藻(Thalassiosira pseudonana)(SEQ ID NO:83);EGB12501,用于抑食金球藻(Aureococcusanophagefferens)(SEQ ID NO:84);CBN76263,用于长囊水云(Ectocarpus siliculosus)(SEQ ID NO:85)。

在一些实施例中,光合生物的cpSRP54基因编码与上文所公开的cpSRP54至少55%、至少60%、至少65%、至少70%、至少75%、至少80%或至少85%、至少90%或至少95%序列同一性的多肽。

光合生物的SGI2、SGI1和SRP54的组合、SGI2和SRP54基因的组合或SGI1、SGI2和SRP54基因的组合的调节

光合生物的SGI2、SGI1和SRP54的组合、SGI2和SRP54基因的组合或SGI1、SGI2和SRP54基因的组合的调节生产突变体光合生物。SGI1、SGI2、SRP54基因可以通过UV诱变、γ射线或基因工程技术来调节。可以改变基因序列,可以部分或完全缺失基因序列,可以改变基因的表达。

在一些实施例中,SGI1、SGI2和/或SRP54基因可以可操作地连接到如美国申请公开2017/0058303中所描述的藻类启动子和终止子序列,所述美国申请通过引用整体并入本文。

在一些实施例中,相对于对照细胞,突变体光合生物(例如,植物、藻类)的总叶绿素具有至少20%减少、至少30%减少、至少40%减少、至少50%减少、至少55%减少、至少60%减少、至少65%减少或至少70%减少,任选地进一步,其中所述突变体具有至少相对于对照细胞增加的叶绿素a与叶绿素b的比率,进一步任选地,其中叶绿素a与叶绿素b的比率为至少约2.8:1、至少约3:1、至少约3.2:1、约3.3:1、至少约3.5:1、至少约3.7:1、至少约3.9:1、至少约4:1或至少约4.3:1。

在一些实施例中,突变体光合生物(例如,植物或藻类)展现出:(a)在介于约100与约2800μmol光子m-2sec-1、介于约150与约2800μmol光子m-2sec-1、介于约75与约2800μmol光子m-2sec-1、介于约40与约2800μmol光子m-2sec-1或介于约10与约2800μmol光子m-2sec-1之间的所有辐照度下,相对于同一物种的对照光合生物更高的qP;

(b)在介于约100与约2800μmol光子m-2sec-1、介于约150与约2800μmol光子m-2sec-1、介于约75与约2800μmol光子m-2sec-1、介于约40与约2800μmol光子m-2sec-1或介于约10与约2800μmol光子m-2sec-1之间的所有辐照度下,相对于对照藻类更低的NPQ;

(c)在介于约100与约2800μmol光子m-2sec-1、介于约150与约2800μmol光子m-2sec-1、介于约75与约2800μmol光子m-2sec-1、介于约40与约2800μmol光子m-2sec-1或介于约10与约2800μmol光子m-2sec-1之间的所有辐照度下,相对于光合生物(例如,藻类)更高的Y(II);

(d)介于约100与约2800μmol光子m-2sec-1、介于约150与约2800μmol光子m-2sec-1、介于约75与约2800μmol光子m-2sec-1、介于约40与约2800μmol光子m-2sec-1、或介于约10与约2800μmol光子m-2sec-1之间,相对于对照藻类更高的Fv/FM

(e)介于约250与约2800μmol光子m-2sec-1、介于约150与约2800μmol光子m-2sec-1、介于约75与约2800μmol光子m-2sec-1、介于约40与约2800μmol光子m-2sec-1或介于约10与约2800μmol光子m-2sec-1之间,相对于对照藻类更高的ESR(II);

(f)相对于对照藻类,在每个叶绿素的基础上的析氧增加了至少50%、至少100%、至少200%、至少300%、至少350%或至少400%;并且

(g)相对于同一物种的对照光合生物,在每个叶绿素的基础上的碳固定增加了至少50%、至少60%、至少70%、至少80%、至少90%或至少100%。

在一些实施例中,突变体光合生物显示出比在相同条件下培养的对照藻类高至少5%、至少6%、至少8%或至少10%、至少15%、至少25%、或至少30%的生物质生产力。

在一些实施例中,在具有模拟自然日光的可变光强度的昼夜循环培养中,相对于对照藻类,突变体光合生物(例如植物、藻类)显示出更大的生产力,任选地其中光强度的峰值处于约1900与约2000μmol光子m-2sec-1之间。

在一些实施例中,相对于不具有一个或多个改变的或弱化的基因的同一物种的对照光合生物,突变体光合生物(例如,植物或藻类)具有更大的脂质生产力,例如至少5%、至少10%、至少15%、至少20%或至少25%更大的脂质生产力。

定义

除非另外定义,否则本文中所使用的所有技术术语和科学术语的含义与本发明所属领域的普通技术人员通常所理解的含义相同。在冲突的情况下,以本申请(包含定义)为准。除非上下文另外要求,否则单数术语应该包含复数含义,并且复数术语应该包含单数含义。本文引用的所有公开、专利和其它参考文献均出于所有目的通过引用整体并入本文,如同每个单独的公开或专利申请被专门且单独地指示通过引用并入。

如在本公开和权利要求中所使用的,除非上下文另外明确指明,否则单数形式的“一个(a)”“一种(an)”和“所述(the)”也包含复数形式。

在本申请内所提供的所有范围包含所述范围的上端和下端的值。

在本文中,在短语如“A和/或B”中所使用的术语“和/或”旨在包含“A和B”、“A或B”、“A”和“B”。

术语“基因”被广泛地用于指对多肽或表达的RNA进行编码的核酸分子(典型地DNA,但任选地RNA)的任何区段。因此,基因包含对表达的RNA进行编码的序列(其可以包含多肽编码序列或例如功能性RNA,如核糖体RNA、tRNA、反义RNA、微RNA、短发夹RNA、核酶等)。基因可以进一步包括其表达所需或影响其表达的调控序列,以及与蛋白质或处于其天然状态的RNA编码序列相关的序列,例如内含子序列、5'或3'非翻译序列等。在一些实例中,“基因”可以仅指DNA或RNA分子的蛋白质编码部分,其可以包含或可以不包含内含子。基因的长度优选地大于50个核苷酸,更优选地长度大于100个核苷酸,并且长度可以例如介于50个核苷酸与500,000个核苷酸之间,如长度介于100个核苷酸与100,000个核苷酸之间或长度介于约200个核苷酸与约50,000个核苷酸之间或长度介于约200个核苷酸与约20,000个核苷酸之间。基因可以从多种来源获得,包含从兴趣来源克隆或从已知或预测的序列信息合成。

术语“核酸”或“核酸分子”是指DNA或RNA(例如,mRNA)的区段,并且还包含具有经过修饰的主链(例如,肽核酸、锁核酸)或经过修饰的或非天然存在的核碱基的核酸。核酸分子可以是双链的、部分双链的或单链的;包括基因或其部分的单链核酸可以是编码(有义)链或非编码(反义)链。

核酸分子可以“源自”指示的来源,其包含从所指示的来源分离(全部或部分)核酸区段。核酸分子也可以通过例如直接克隆、PCR扩增或从指示的多核苷酸来源人工合成或基于与指示的多核苷酸来源相关的序列而源自指示的来源。源自特定来源或物种的基因或核酸分子还包含相对于来源核酸分子具有序列修饰的基因或核酸分子。例如,源自来源(例如,特定参照基因)的基因或核酸分子可以包含相对于来源基因或核酸分子的一种或多种突变,这些突变是非预期的或有意引入的,并且如果一种或多种突变(包含取代、缺失或插入)是有意引入的,则这些序列改变可以通过细胞或核酸的随机或靶向突变、通过扩增或其它基因合成或分子生物学技术或通过化学合成或其任何组合来引入。源自编码功能性RNA或多肽的参照基因或核酸分子的基因或核酸分子可以编码功能性RNA或多肽,所述功能性RNA或多肽与参照或来源功能性RNA或多肽或与其功能性片段具有至少75%、至少80%、至少85%、至少90%或至少95%序列同一性。例如,源自编码功能性RNA或多肽的参照基因或核酸分子的基因或核酸分子可以编码功能性RNA或多肽,所述功能性RNA或多肽与参照或来源功能性RNA或多肽或与其功能性片段具有至少85%、至少90%、至少95%、至少96%、至少97%、至少98%或至少99%序列同一性。

如本文所使用的,“分离的”核酸或蛋白质是从其天然环境或核酸或蛋白质在自然界中存在的环境中去除的。例如,分离的蛋白质或核酸分子是从在其天然或自然环境中与其相关的细胞或生物中去除的。在一些实例中,分离的核酸或蛋白质可以部分地或基本上纯化,但是分离不需要特定水平的纯化。因此,例如,分离的核酸分子可以是已经从其在自然界中被整合到的染色体、基因组或附加体切离的核酸序列。

“纯化的”核酸分子或核苷酸序列或蛋白质或多肽序列基本上不含细胞材料和细胞组分。纯化的核酸分子或蛋白质可以基本上不含例如除缓冲液或溶剂之外的化学品。“基本上不含”不旨在意指除了这些新核酸分子之外的其它组分是不可检测的。

术语“天然存在的”和“野生型”是指在自然界中发现的形式。例如,天然存在的或野生型核酸分子、核苷酸序列或蛋白质可以存在于从天然来源分离的物质中并且未通过人为操纵有意地修饰。

如本文所使用的,“弱化的”意指量、程度、强度(intensity)或强度(strength)降低。弱化的基因表达可以指所讨论的基因的显著降低的量和/或转录速率,或经过编码的蛋白质的翻译、折叠或组装的显著降低的量和/或转录速率。作为非限制性实例,弱化的基因可以是由于基因调控序列的改变或破坏而具有减少的表达的突变或破坏的基因(例如,通过部分或全部缺失、截短、移码或插入突变而破坏的基因),或可以是由减少基因表达的构建体(例如,反义RNA、微RNA、RNAi分子、或核酶)所靶向的基因。

“外源核酸分子”或“外源基因”是指已经被引入(“转化”)到细胞中的核酸分子或基因。转化的细胞可以称为重组细胞,其中可以引入另外的一个或多个外源基因。如果用核酸分子转化的细胞的后代已经遗传外源核酸分子,则其也称为“转化的”。相对于被转化的细胞,外源基因可以来自不同物种(并且因此是“异源的”)或来自同一物种(并且因此是“同源的”)。“内源性”核酸分子、基因或蛋白质是天然核酸分子、基因或蛋白质,因为其存在于宿主中或由宿主天然产生。

本文所使用的术语“天然的”用于指代核酸序列或氨基酸序列,因为所述核酸序列或氨基酸序列天然存在于宿主中。本文所使用的术语“非天然的”用于指代不天然存在于宿主中的核酸序列或氨基酸序列。已经从细胞中去除、经受实验室操作、并引入或重新引入到宿主细胞的核酸序列或氨基酸序列被认为是“非天然的”。引入到宿主细胞中的合成或部分地合成的基因是“非天然的”。非天然基因进一步包含对宿主微生物内源的基因,所述基因可操作地连接到已经重组到宿主基因组中的一种或多种异源调控序列。

“重组的”或“工程化的”核酸分子是已经通过人为操纵改变的核酸分子。作为非限制性实例,重组核酸分子包含执行以下的任何核酸分子:1)在体外已经部分地或完全地合成或修饰,例如使用化学或酶促技术(例如,通过使用化学核酸合成,或通过使用用于核酸分子的复制、聚合、消化(核酸外切或核酸内切)、连接、逆转录、转录、碱基修饰(包含例如甲基化)、整合或重组(包含同源和位点特异性重组)的酶);2)包含本质上不连接的连接核苷酸序列;3)已经使用分子克隆技术工程化,使得其相对于天然存在的核酸分子序列缺少一个或多个核苷酸;和/或4)已经使用分子克隆技术进行了操作,使得其相对于天然存在的核酸序列具有一个或多个序列改变或重排。作为非限制性实例,cDNA是重组DNA分子,如已经通过体外一个或多个聚合酶反应产生的或接头已经附着到的或已经整合到载体(如克隆载体或表达载体)中的任何核酸分子。

如本文所使用的,术语“重组蛋白质”是指通过基因工程产生的蛋白质。

当应用于生物时,术语重组、工程化或基因工程化是指已经通过将异源或外源重组核酸序列引入到生物中而被操作的生物,并且包含基因敲除、靶向突变、基因置换和启动子置换、缺失或插入,以及将转基因或合成基因引入到生物中。重组或基因工程化生物也可以是已经引入到其的用于基因“敲低”的构建体的生物。此类构建体包含但不限于RNAi、微RNA、shRNA、siRNA、反义和核酶构建体。还包含其基因组已经通过大范围核酸酶、锌指核酸酶、TALEN或Cas/CRISPR系统的活性改变的生物。外源或重组核酸分子可以整合到重组/基因工程化生物的基因组中或在其它实例中,可以不整合到宿主基因组中。如本文所使用的,“重组微生物”或“重组宿主细胞”包含本发明的重组微生物的后代或衍生物。因为某些修饰可能由于突变或环境影响而在后续世代中发生,所以此类后代或衍生物实际上可能不与亲本细胞相同,但仍包含在如本文所使用的术语的范围内。

术语“启动子”是指能够结合细胞中的RNA聚合酶并启动下游(3'方向)编码序列的转录的核酸序列。启动子包含以高于背景的可检测水平启动转录所必需的最小数量的碱基或元件。启动子可以包含转录起始位点以及负责结合RNA聚合酶的蛋白质结合结构域(共有序列)。真核启动子通常但不总是含有“TATA”盒和“CAT”盒。原核启动子可以含有-10和-35原核启动子共有序列。来自多种不同来源的大量启动子(包含组成型、诱导型和阻抑型启动子)是本领域所众所周知的。代表性来源包含例如藻类、病毒、哺乳动物、昆虫、植物、酵母和细菌细胞类型,并且来自这些来源的合适的启动子是容易获得的,或可以基于公开在线获得的序列或例如,从保藏机构(如ATCC)以及其它商业或个体来源合成地制备。启动子可以是单向的(启动一个方向的转录)或双向的(启动任一方向的转录)。启动子可以是组成型启动子、阻抑型启动子或诱导型启动子。除了RNA聚合酶结合以启动转录的基因近端启动子之外,启动子区还可以包含基因上游的额外序列,所述额外序列可以处于基因的转录起始位点的1kb、2kb、3kb、4kb、5kb或更多内,其中所述额外序列可以影响下游基因的转录速率并且任选地影响启动子对发育、环境或生物化学(例如,代谢)条件的反应性。

当关于多核苷酸、基因、核酸、多肽或酶使用时,术语“异源的”是指来自来源或源自除宿主生物物种之外的来源的多核苷酸、基因、核酸、多肽或酶。相比而言,“同源的”多核苷酸、基因、核酸、多肽或酶在本文中用于表示源自宿主生物物种的多核苷酸、基因、核酸、多肽或酶。当提及基因调控序列或提及用于保持或操纵基因序列的辅助核酸序列时(例如启动子、5'非翻译区、3'非翻译区、poly A添加序列、内含子序列、剪接位点、核糖体结合位点、内部核糖体进入序列、基因组同源区、重组位点等),“异源的”意指调控序列或辅助序列不与调控序列或辅助核酸序列与之并置在构建体、基因组、染色体或附加体中的基因天然地缔合。因此,可操作地连接到其在其天然状态下(即,在非基因工程化生物的基因组中)不可操作地连接的基因的启动子在本文中被称为“异源启动子”,即使所述启动子可以源自与其连接的基因相同的物种(或在一些情况下,同一生物)。

如本文所使用的,术语“蛋白质”或“多肽”旨在涵盖单数“多肽”以及复数“多肽”,并且是指由通过酰胺键(也称为肽键)线性连接的单体(氨基酸)构成的分子。术语“多肽”是指两个或更多个氨基酸中的任何链或多条链,并且不是指产物的特定长度。因此,肽、二肽、三肽、寡肽、“蛋白质”、“氨基酸链”或用于指代两个或更多个氨基酸的一条链或多条链的任何其它术语被包含在“多肽”的定义内,并且术语“多肽”可以代替或与这些术语中的任何一个可互换地使用。

基因和蛋白质登录号(通常在基因或物种名称之后的括号中提供)是在美国国立卫生研究院(United States National Institutes of Health)维护的美国国家生物技术信息中心(NCBI)网站(ncbi.nlm.nih.gov)可公开获得的序列记录的唯一标识符。“GenInfo标识符”(GI)序列标识号是特定于核苷酸或氨基酸序列的。如果序列以任何方式改变,则分配新的GI号。序列修订历史工具可用于跟踪出现在特定GenBank记录中的序列的各种GI号、版本号和更新日期。基于登录号和GI号来搜索和获得核酸或基因序列或蛋白质序列在例如细胞生物学、生物化学、分子生物学和分子遗传学领域中是众所周知的。

如本文所使用的,相对于核酸或多肽序列的术语“同一性百分比”或“同源性”被定义为在比对序列达到最大百分比同一性并且在必要时引入空位以实现最大百分比同源性之后,候选序列中与已知多肽相同的核苷酸或氨基酸残基的百分比。N末端或C末端插入或缺失不应被解释为影响同源性,并且多肽序列中少于约30个、少于约20个或少于约10个氨基酸残基的内部缺失和/或插入不应被解释为影响同源性。核苷酸或氨基酸序列水平的同源性或同一性可以使用程序blastp、blastn、blastx、tblastn和tblastx(Altschul(1997),《核酸研究(Nucleic Acids Res.)》25,3389-3402,和Karlin(1990),《美国国家科学院院刊(Proc.Natl.Acad.Sci.USA)》87,2264-2268)所采用的算法,通过BLAST(基本局部比对搜索工具)分析来确定,其被定制用于序列相似性搜索。BLAST程序所使用的方法是首先考虑查询序列与数据库序列之间具有和不具有缺口的相似区段,然后评估所鉴定的所有匹配的统计显著性,并且最后仅概括满足预先选择的显著性阈值的那些匹配。关于序列数据库相似性搜索中的基本问题的讨论,参见Altschul(1994),《自然遗传学(Nature Genetics)》6,119-129。直方图、描述、比对、预期(即,用于报告针对数据库序列的匹配的统计显著性阈值)、截断、矩阵和过滤器(低复杂度)的搜索参数可以处于默认设置。blastp、blastx、tblastn和tblastx所使用的默认评分矩阵是BLOSUM62矩阵(Henikoff(1992),《美国国家科学院院刊》89,10915-10919),推荐长度超过85的查询序列(核苷酸碱基或氨基酸)。

对于设计用于比较核苷酸序列的blastn,评分矩阵通过M(即,一对匹配残基的奖励评分)与N(即,错配残基的罚分)的比率来设定,其中M和N的默认值可以分别为+5和-4。四个blastn参数可以调整如下:Q=10(空位产生罚分);R=10(空位延伸罚分);wink=1(在沿着查询的每个winkth位置处生成字命中);和gapw=16(设置其中生成空位对准的窗口宽度)。用于氨基酸序列比较的等效Blastp参数设置可以为:Q=9;R=2;wink=1;以及gapw=32。GCG软件包版本10.0中可获得的序列之间的Bestfit比较可以使用DNA参数GAP=50(空位产生罚分)和LEN=3(空位延伸罚分),并且蛋白质比较中的等效设置可以为GAP=8和LEN=2。

因此,当提及本发明的多肽或核酸序列时,包含的是与全长多肽或核酸序列或与其片段(其包括整个蛋白质的至少50个、至少75个、至少100个、至少125个、至少150个或更多个氨基酸残基的连续序列)具有至少40%、至少45%、至少50%、至少55%、至少70%、至少65%、至少70%、至少75%、至少80%、或至少85%,例如至少86%、至少87%、至少88%、至少89%、至少90%、至少91%、至少92%、至少93%、至少94%、至少95%、至少96%、至少97%、至少98%、至少99%或约100%序列同一性的序列同一性;此类序列的变体,例如其中至少一个氨基酸残基已经被插入所公开的含有插入和取代的一个或多个序列的N-和/或C-末端和/或内部。预期的变体可以另外地或可替代地包含通过例如同源重组或定点或PCR诱变而含有预定突变的那些变体,以及其它物种的对应多肽或核酸,包含但不限于本文所描述的那些多肽或核酸,含有插入和取代的多肽或核酸家族的等位基因或其它天然存在的变体;和/或衍生物,其中所述多肽已经通过取代、化学、酶促或其它适当方式用除了含有插入和取代的天然存在的氨基酸之外的部分(例如,可检测的部分如酶)共价修饰。

如本文所使用的,短语“保守性氨基酸取代”或“保守性突变”是指一个氨基酸被另一个具有共同性质的氨基酸置换。用于定义单独的氨基酸之间的共同性质的功能性方法是分析同源生物的对应蛋白质之间氨基酸变化的归一化频率(Schulz(1979),《蛋白质结构原理(Principles of Protein Structure)》,施普林格出版社(Springer-Verlag))。根据此类分析,可以定义氨基酸基团,其中一个基团内的氨基酸优先地彼此交换,并且因此在其对整体蛋白质结构的影响方面彼此最相似(Schulz(1979),《蛋白质结构原理》,施普林格出版社)。以这种方式定义的氨基酸基团的实例可以包含:“带电/极性基团”,其包含Glu、Asp、Asn、Gln、Lys、Arg和His;“芳香族或环状基团”,其包含Pro、Phe、Tyr和Trp;以及“脂肪族基团”,其包含Gly、Ala、Val、Leu、Ile、Met、Ser、Thr和Cys。在每个基团内,也可以鉴定亚基。例如,带电/极性氨基酸的基团可以被细分为亚基,包含:“带正电亚基”,其包括Lys、Arg和His;“带负电亚基”,其包括Glu和Asp;以及“极性亚基”,其包括Asn和Gln。在另一个实例中,芳香族或环状基团可以被细分为亚基,包含:“氮环亚基”,其包括Pro、His和Trp;以及“苯基亚基”,其包括Phe和Tyr。在另一个进一步的实例中,脂肪族或环状基团可以被细分为亚基,包含:“大脂肪族非极性亚基”,其包括Val、Leu和Ile;“脂肪族微极性亚基”,其包括Met、Ser、Thr和Cys;以及“小残基亚基”,其包括Gly和Ala。保守性突变的实例包含上文亚基内的氨基酸的氨基酸取代,如但不限于:Lys取代Arg,或反之亦然,使得可以保持正电荷;Glu取代Asp,或反之亦然,使得可以保持负电荷;Ser取代Thr,或反之亦然,使得可以保持游离的-OH;并且Gln取代Asn,或反之亦然,使得可以保持游离的-NH2。“保守性变体”是包含一个或多个氨基酸的多肽,所述一个或多个氨基酸已被取代以用具有共同性质(例如,属于与上文所描绘的氨基酸基团或亚基相同的氨基酸基团或亚基)的氨基酸置换参考多肽的一个或多个氨基酸(例如,其序列已经在出版物或序列数据库中公开或其序列已经通过核酸测序确定的多肽)。

如本文所使用的,术语基因的“调节(modulating)”或“调节(modulation)”是指改变所述基因的核酸序列、完全地或部分地缺失所述基因、引起所述基因中的断裂、改变所述基因的表达、抑制基因的表达或使所述基因的表达沉默。在一些实施例中,改变基因的序列是通过插入一个或多个核苷酸、缺失一个或多个核苷酸、取代核苷酸。改变序列可以通过UV辐照、γ辐照、基因工程来实现。

如本文所使用的,“弱化基因表达”意指以减少全功能性蛋白质的产生的任何方式减少或消除基因的表达。

如本文所使用的,“表达”包含至少在RNA生产水平上的基因表达,并且“表达产物”包含所得产物,例如多肽或功能性RNA(例如,核糖体RNA、tRNA、反义RNA、微RNA、shRNA、核酶等)。术语“增加的表达”包含基因表达的改变以促进增加的mRNA产生和/或增加的多肽表达。“增加的产生”包含与多肽的天然产生或酶活性相比,多肽表达的量、多肽的酶活性水平或两者的组合的增加。

本发明的一些方面包含特定多核苷酸序列的表达的部分、基本或完全缺失、沉默、失活或下调。这些基因可以部分地、基本上或完全地缺失、沉默、失活,或其表达可以被下调,以便影响由其编码的多肽所进行的活性,如酶的活性。通过插入破坏基因的功能和/或表达的核酸序列(例如,病毒插入、转座子诱变、大范围核酸酶工程化、同源重组或本领域已知的其它方法),基因可以部分地、基本上或完全地缺失、沉默、失活或下调。术语“消除(eliminate)”、“消除(elimination)”和“敲除(knockout)”可以与术语“缺失”、“部分缺失”、“基本缺失”或“完全缺失”互换使用。在某些实施例中,可以通过定点同源重组工程化兴趣微生物,以敲除特定的兴趣基因。在仍其它实施例中,RNAi或反义DNA(asDNA)构建体可以用于部分地、基本上或完全地沉默、失活或下调特定兴趣基因。

某些核酸分子或特定多核苷酸序列的这些插入、缺失或其它修饰可以被理解为涵盖“一种或多种基因修饰”或“一种或多种转化”,使得这些微生物或宿主细胞的所得菌株可以被理解为“经过基因修饰的”、“基因工程化的”或“转化的”。

如本文所使用的,“上调的(up-regulated)”或“上调(up-regulation)”包含兴趣基因或核酸分子的表达或酶活性的增加,例如,与未被上调的在其它方面相同的基因或酶中的表达或活性相比,基因表达或酶活性的增加。

如本文所使用的,“下调的(down-regulated)”或“下调(down-regulation)”包含兴趣基因或核酸分子的表达或酶活性的降低,例如,与未被下调的在其它方面相同的基因或酶中的表达或活性相比,基因表达或酶活性的降低。

如本文所使用的,“突变体”是指一种非天然存在的并在基因中具有突变的生物,所述突变是由于经典诱变(例如,使用γ辐照、UV、或化学诱变剂)而出现的。如本文所使用的,“突变体”还指由于基因工程化而具有改变的基因结构或表达的重组细胞,作为非限制性实例,所述基因工程化可以包含过表达,包含在不同的时间、生物、或环境调控下的基因表达和/或与天然发生的程度不同的基因表达和/或在重组细胞中非天然表达的基因的表达;同源重组,包含敲除和敲入(例如,用对具有比野生型多肽更高或更低活性的多肽进行编码的基因和/或显性阴性多肽进行基因置换);通过RNAi、反义RNA或核酶等的基因弱化;以及使用大范围核酸酶、TALEN和/或CRISPR技术等的基因组工程化。兴趣突变体生物通常具有与缺少突变的对应野生型或祖先株的表型不同的表型,其中所述表型可以通过生长测定、产物分析、光合性质、生物化学测定等来评估。当提及基因“突变体”时,意指相对于自然或野生型基因,所述基因具有至少一个碱基(核苷酸)改变、缺失或插入。突变(一个或多个核苷酸的改变、缺失和/或插入)可以在基因的编码区中,或可以在内含子、3'UTR、5'UTR或启动子区中,例如转录起始位点的2kb内或翻译起始位点的3kb内。作为非限制性实例,突变体基因可以是在启动子区内具有可以增加或减少基因表达的插入的基因;可以是具有缺失的基因,从而导致非功能性蛋白质、截短蛋白质、显性阴性蛋白质或无蛋白质的产生;可以是具有一个或多个点突变的基因,所述点突变导致经过编码的蛋白质的氨基酸改变或导致基因转录物的异常剪接等。如本文所使用的,“突变体”是指一种非天然存在的并在基因中具有突变的生物,所述突变是由于经典诱变(例如,使用γ射线照射、UV或化学诱变剂)而出现的。如本文所使用的,“突变体”还指由于基因工程化而具有改变的基因结构或表达的重组细胞,作为非限制性实例,所述重组细胞可以包含过表达,包含在不同的时间、生物、或环境调节下的基因表达和/或与天然发生的程度不同的基因表达和/或在重组细胞中非天然表达的基因表达。

术语“Pfam”是指由Pfam协会(Pfam Consortium)维护的蛋白质结构域和蛋白质家族的大型集合,并且可在以下获得:维康信托基金会(Welcome Trust),桑格研究所(SangerInstitute);pfam.sbc.su.se(斯德哥尔摩生物信息学中心(Stockholm BioinformaticsCenter);珍利亚农场(Janelia Farm),霍华德·休斯医学研究所(Howard Hughes MedicalInstitute);国家农学研究所(Institut national de la Recherche Agronomique)。Pfam的最新版本是基于UniProt蛋白质数据库版本2012_06的Pfam 27.0(2013年3月)。使用多序列比对和隐马尔可夫模型(HMM)来鉴定Pfam结构域和家族。Pfam-A家族或结构域分配是使用蛋白质家族的代表性成员通过策划的种子比对而产生的高质量分配,并且基于种子比对概述隐马尔可夫模型。(除非另有说明,否则所查询的蛋白质与Pfam结构域或家族的匹配是Pfam-A匹配。)然后,将属于家族的所有经过鉴定的序列用于自动生成所述家族的完全比对(Sonnhammer(1998),《核酸研究(Nucleic Acids Research)》26,320-322;Bateman(2000),《核酸研究》26,263-266;Bateman(2004),《核酸研究》32,《数据库专辑(Database Issue)》,D138-D141;Finn(2006),《核酸研究-数据库专辑(Database Issue)》34,D247-251;Finn(2010),《核酸研究-数据库专辑》38,D211-222)。通过访问Pfam数据库,例如使用任何上述网站,可以使用HMMER同源性搜索软件(例如,HMMER2、HMMER3或更高版本)针对HMM来查询蛋白质序列。将所查询的蛋白质鉴定为处于Pfam家族(或具有特定Pfam结构域)的显著匹配是其中位评分大于或等于Pfam结构域的聚集阈值的那些匹配。期望值(e值)也可以用作在Pfam中包含所查询的蛋白质或确定所查询的蛋白质是否具有特定Pfam结构域的标准,其中低e值(远小于1.0,例如小于0.1,或小于或等于0.01)表示匹配归因于机会的低概率。

如本文所使用的,术语“光合生物”是指可以将光能转化为化学能的生物。在一些实施例中,化学能可以稍后被释放以给这些生物的活动提供燃料(能量转化)。在一些实施例中,这种化学能储存于碳水化合物分子(如糖)中,所述碳水化合物分子从二氧化碳和水中合成。

光合生物的非限制性实例包含植物、藻类和蓝藻细菌。藻类的非限制性实例属于属:曲壳藻属、茧形藻属、双眉藻属、纤维藻属、星胞藻属、黄金色藻属、波利氏藻属、包特氏菌属、气球藻属、葡萄藻属、荀球藻属、角毛藻属、四鞭藻属、衣藻属、绿球藻属、绿梭藻属、小球藻属、蓝隐藻属、金球藻属、球钙板藻属、隐甲藻属、隐藻属、小环藻属、杜氏藻属、椭球藻属、球石藻属、独球藻属、衣迪斯藻属、裸藻属、真眼点藻属、伏氏藻属、脆杆藻属、丽丝藻属、红球藻属、嗜盐古菌、异弯藻属、膜胞藻属、等鞭金藻属、鳞孔藻属、微芒藻属、蒜头藻属、单针藻属、微球藻属、微拟球藻属、舟形藻属、新绿藻属、肾鞭藻属、肾爿藻属、菱形藻属、棕鞭藻属、鞘藻属、卵囊藻属、蚝球藻属、巴夫藻属、拟小球藻属、帕氏藻属、普莱格门、褐指藻属、噬菌体属、微绿藻属、扁藻属、颗石藻属、宽球藻属、原壁菌属、假小球藻属、拟新绿藻属、假十字趾藻属、塔胞藻属、葡串藻属、栅列藻属、骨条藻属、螺旋藻属、裂丝藻属、四爿藻属、海链藻属、黄丝藻属、无隔藻属、鲜绿球藻属、魏氏藻属和团藻属。

植物的非限制性实例包含沙生芥(Arabidopsis arenicola)、沙芥(Arabidopsisarenosa)、Arabidopsis cebennensis、Arabidopsis croatica、圆叶拟南芥、琴叶拟南芥、Arabidopsis neglecta、Arabidopsis pedemontana、Arabidopsis suecica、拟南芥、玉米、稻、小麦、马铃薯、洋葱、大蒜、大豆、番茄、陆地棉、草本棉(Gossypium herbaceum)、木本棉(Gossypium arboreum)、毛棉(Gossypium tomentosum)、黑芥(Brassica nigra)和芸薹属(Brassica sp)。

如本文所使用的,术语“突变体光合生物”或“突变体藻类”是指一种光合生物或藻类,其中至少SGI1、SGI2、SGI1和SRP54的组合、SGI2和SRP54的组合或SGI1、SGI2和SRP54的组合被调节。此类调节可以包含核酸序列的改变或一个或多个基因表达的改变。

如本文所使用的,SGI1和SRP54基因的组合的调节是指在同一光合生物中的SGI1的调节和SRP54基因的调节。类似地,SGI2和SRP54基因的组合的调节是指在同一光合生物中的SGI2的调节和SRP54基因的调节。同样地,SGI1、SGI2和SRP54基因的组合的调节是指在同一光合生物中的SGI1的调节、SGI2的调节和SRP54基因的调节。

如本文所使用的,术语对照光合生物是指一种光合生物,所述光合生物在所有相关方面与突变体光合生物在基因上基本上相同,除了对照光合生物不具有突变或弱化的SRP54、SGI1、SGI2或两个或更多个基因的组合之外。例如,对照光合生物是同一物种,并且除了改变cpSRP54、胞质SRP54、SGI1或SGI2基因或用于使cpSRP54弱化的构建体之外,存在于突变体中的胞质SRP54、SGI1、SGI2基因在基因上是相同的,除了不影响通过正常繁殖在诱变期间可能发生的细胞生理学的小的基因组变化(例如,“SNP”)之外。在各个实施例中,对照光合生物是一种菌株,从所述菌株中衍生出具有胞质SRP54、cpSRP54、SGI1、SGI2或至少两个基因的组合的弱化表达的突变体光合生物。

当提及光合生物(如藻类)时,术语“适应低光”意指所述光合生物在暴露于低光强度持续一段时间之后具有增加的叶绿素和光合性质,所述时间足以使叶绿素和光合性质的改变在低光条件下稳定。低光可以是例如小于200μE·m-2·s-1,并且优选地,约100μE·m-2·s-1或更小或50μE·m-2·s-1或更小,并且用于适应的时间段可以是至少约四小时、至少约六小时、至少约八小时、或至少约十二小时、至少24小时或至少48个小时,并且可能长达2天、3天、4天或5天。

“cDNA”是包括mRNA分子的核苷酸序列的至少一部分的DNA分子,除了所述DNA分子取代了存在于mRNA序列中的核碱基胸腺嘧啶或T以代替尿苷或U。cDNA可以是双链或单链的,并且可以是例如mRNA序列的互补序列。在优选的实例中,cDNA不包含一个或多个内含子序列,所述内含子序列存在于cDNA所对应的天然存在的基因中(即,存在于生物的基因组中的基因)。例如,cDNA可以具有来自天然存在的基因的内含子上游的序列,所述序列与天然存在的基因的内含子下游的序列并置,其中上游和下游序列在自然界中不在DNA分子中并置(即,这些序列在天然存在的基因中不并置)。cDNA可以通过mRNA分子的逆转录产生,或可以例如通过化学合成和/或通过使用一种或多种限制酶、一种或多种连接酶、一种或多种聚合酶(包含但不限于可用于聚合酶链式反应(PCR)的耐高温聚合酶)、一种或多种重组酶等基于对cDNA序列的了解来合成,其中对cDNA序列的了解可以任选地基于对来自基因组序列的编码区或从序列中编译的多个部分cDNA的鉴别。

“在低光适应中失调的”藻类突变体(或“在高光适应下锁定”或LIHLA突变体)是不展现出表型和基因表达变化的突变体,所述变化是低光适应的野生型藻类细胞的特征,包含:叶绿素的显著增加和大部分光收获复合蛋白(LHCP)基因的表达的显著增加。当适应低光时,在低光适应中失调的藻类突变体相对于在野生型细胞的低光适应期间上调的多个基因(例如,至少十个、至少二十个、至少三十个、至少四十个或至少五十个基因)的低光适应的野生型细胞具有减少的表达。进一步地,在低光适应中失调的藻类突变体相对于在野生型细胞的低光适应期间下调的(例如,至少五个、至少六个、至少七个、至少八个、至少九个或至少十个基因)低光适应的野生型细胞具有增加的基因表达。进一步地,如本文所公开的,当突变体和野生型细胞两者均适应低光时,在低光适应中失调的藻类突变体可以具有与野生型细胞的光合性质显著不同的光合性质。

“光合性质”、“光合性质”,“光生理性质”或光生理参数”包含但不限于最大光合速率Pmax(在每个细胞或每毫克叶绿素的基础上进行计算)、光合作用饱和下的强度Ek(如通过析氧所测量的)以及α(“alpha(α)”)光合作用(析氧)与辐照度强度(P/I)曲线的初始斜率。另外的光合性质包含可以使用荧光检测进行测量的各种参数,包含例如光系统II中的光化学最大量子产率Fv/FM;光系统II(PSII)的光合量子产率,φPSII;光化学淬灭,或开放PSII中心的比例,qP;非光化学淬灭,NPQ;PSII电子传递速率,ETRPSII;PSI电子传递速率,ETRPSI;PSI的功能性吸收横截面尺寸(σPSI)和PSII的功能性吸收横截面(σPSII)。此处的列表并不详尽,并且术语不排除测量光合作用的各个方面的其它参数。

提及“基本上相同”的性质旨在意指这些性质处于参考值的10%以内,并且优选地处于5%以内。

虽然类似于或等同于本文所描述的那些方法和材料的方法和材料可以用于本发明的实践或测试中,但是下面描述了合适的方法和材料。材料、方法和实例仅是说明性的并且不旨是限制性的。根据详细描述和权利要求,本发明的其它特征和优点将变得显而易见。

基因弱化

突变体光合生物可以是通过任何可行的方法生成的突变体,所述方法包含但不限于UV辐照、γ辐照或化学诱变,并且筛选具有本文所公开的光合性质的低叶绿素突变体。用于生成微生物菌株的突变体的方法是众所周知的。可以通过本领域已知的方法来鉴定突变体,包含例如基因组测序、PCR、cpSRP54或cytoSRP54蛋白的免疫检测以及表达分析(例如,逆转录/PCR)。

本文所提供的突变体光合生物也可以在SGI1、SGI2、cpSRP54、cytoSRP54、SGI1和cpSRP54基因的组合或SGI2和cpSRP54的组合中进行基因工程化,例如,其已经通过同源重组靶向敲除或基因置换(例如,用可以编码相对于野生型多肽活性降低的多肽的基因突变形式)。在另外的实例中,可以通过定点同源重组来工程化兴趣藻类菌株,以将具有或不具有表达控制序列的特定兴趣基因(如启动子)插入特定基因组基因座中,或将启动子插入宿主微生物的遗传基因座中,以影响特定基因或一组基因在基因座处的表达。

例如,通过同源重组进行的基因敲除或置换可以通过核酸(例如,DNA)片段的转化来进行,所述核酸片段包含与待改变的基因组区域同源的序列,其中所述同源序列被外源序列(通常为允许选择整合的构建体的选择性标记基因)间断。在外源序列或突变的基因序列的任一侧上的基因组同源侧翼序列的长度可以是例如至少50个、至少100个、至少200个、至少300个、至少400个、至少500个、至少600个、至少700个、至少800个、至少900个、至少1,000个、至少1,200个、至少1,500个、至少1,750个或至少2,000个核苷酸。基因敲除或基因“敲入”构建体(其中外源序列侧翼为靶基因序列)可以提供于载体中,所述载体可以任选地被线性化,例如,在经历同源重组的区域之外被线性化,或可以被提供为不在载体的背景下的线性片段,例如,敲除或敲入构建体可以是分离的或合成的片段,包含但不限于PCR产物。在一些实例中,分裂标记系统可以用于通过同源重组生成基因敲除,其中可以引入两个DNA片段,所述片段可以再生可选择标记并且通过三个交换事件破坏兴趣基因座(Jeong等人(2007),《FEMS微生物学快报(FEMS Microbiol Lett)》273:157-163)。

一方面,本发明提供了经过基因修饰的生物,例如具有一种或多种用于使SGI1、SGI2、cpSRP54、cytoSRP54、SGI1和cpSRP54基因的组合或SGI2和cpSRP54基因的组合的表达弱化的基因修饰的微生物。如本文所使用的,“弱化SGI1、SGI2、cpSRP54、cytoSRP54、SGI1和cpSRP54基因的组合或SGI2和cpSRP54基因的组合的表达”意指以减少全功能性蛋白质产生的任何方式减少或消除一种或多种上述基因的表达。

例如,被工程化为具有弱化的SGI1、SGI2、cpSRP54、cytoSRP54、SGI1和cpSRP54基因的组合或SGI2和cpSRP54基因的组合的表达的重组光合生物可以具有破坏的SGI1、SGI2、cpSRP54、cytoSRP54、SGI1和cpSRP54基因的组合或SGI2和cpSRP54基因的组合,其中所述重组微生物可以具有SGI1、SGI2、cpSRP54、cytoSRP54、SGI1和cpSRP54基因的组合或SGI2和cpSRP54基因的组合,其包含至少一个减少或消除基因表达的插入、突变或缺失,使得不产生或以比同一物种的对照光合生物所产生的量更低的量产生全功能性SGI1、SGI2、cpSRP54、cytoSRP54、SGI1和cpSRP54基因的组合或SGI2和cpSRP54基因的组合或cytoSRP54基因。被破坏的SGI1、SGI2、cpSRP54、cytoSRP54、SGI1和cpSRP54基因的组合、SGI2和cpSRP54的组合或cytoSRP54基因可以通过例如,通过同源重组和/或通过大范围核酸酶、锌指核酸酶的活性介导的插入或基因置换而被破坏(Perez-Pinera等人,(2012),《化学生物学当代评论(Curr.Opin.Chem.Biol.)》16:268-277),TALEN(WO 2014/207043;WO2014/076571)或RNA引导的核酸内切酶,如CRISPR系统的cas蛋白(例如,Cas9蛋白)。

CRISPR系统,最近由Hsu等人(《细胞(Cell)》157:1262-1278,2014)综述,除了Cas核酸酶多肽或复合物之外,还包含靶向RNA(通常称为“crRNA”),其通过与靶位点序列的互补性与基因组靶位点相互作用,还包含反式激活(“tracr”)RNA,其与Cas多肽复合,并且还包含与靶向crRNA结合(通过互补性)的区域。

本发明考虑了两个RNA分子(“crRNA”和“tracrRNA”)的用途,这两个RNA分子可以被共转化到表达cas蛋白或用cas蛋白转染的宿主菌株(或在宿主菌株中表达)中以用于基因组编辑,或包含与靶序列互补的序列以及与cas蛋白相互作用的序列的单一指导RNA的用途。也就是说,在一些策略中,本文所使用的CRISPR系统可以包括两个分开的RNA分子(RNA多核苷酸:“tracr-RNA”和“targeter-RNA”或“crRNA”,参见下文),并且在本文中被称为“双分子DNA靶向RNA(double-molecule DNA-targeting RNA)”或“双分子DNA靶向RNA(two-molecule DNA-targeting RNA)”。可替代地,如实例中所示出的,DNA靶向RNA还可以包含用于与Cas蛋白相互作用的反式激活序列(除了靶向同源(“cr”)序列之外),即DNA靶向RNA可以是单个RNA分子(单个RNA多核苷酸),并且在本文中被称为“嵌合引导RNA”、“单引导RNA”或“sgRNA”。术语“DNA靶向RNA”和“gRNA”是包含性的,是指双分子DNA靶向RNA和单分子DNA靶向RNA(即,sgRNA)两者。单分子引导RNA和两种RNA系统均已经在文献中并且例如,在美国专利申请公开号US 2014/0068797中进行了详细描述,所述文献通过引用整体并入本文。

任何Cas蛋白可以用于本文的方法中,例如,Cas1、Cas1B、Cas2、Cas3、Cas4、Cas5、Cas6、Cas7、Cas8、Cas9(也称为Csn1和Csx12)、Cas10、Csy1、Csy2、Csy3、Cse1、Cse2、Csc1、Csc2、Csa5、Csn2、Csm2、Csm3、Csm4、Csm5、Csm6、Cmr1、Cmr3、Cmr4、Cmr5、Cmr6、Csb1、Csb2、Csb3、Csx17、Csx14、Csx10、Csx16、CsaX、Csx3、Csx1、Csx15、Csf1、Csf2、Csf3、Csf4、其同源物或其修饰形式。在一些实施例中,Cas蛋白是II类Cas蛋白。作为非限制性实例,Cas蛋白可以是Cas9蛋白,如酿脓葡萄球菌(Staphylococcus pyogenes)、嗜热链球菌(S.thermophilus)、肺炎链球菌(S.pneumonia)、金黄色葡萄球菌(S.aureus)或脑膜炎奈瑟氏球菌(Neisseria meningitidis)的Cas9蛋白。其它兴趣Cas蛋白包含但不限于Cpf1 RNA引导的核酸内切酶(Zetsche等人(2015),《细胞》163:1-13)以及C2c1、C2c2、C2c3 RNA引导的核酸酶(Shmakov等人(2015),《分子细胞(Molecular Cell)》60:1-13)。还考虑了在美国专利申请公开号US 2014/0068797中提供为SEQ ID NO:1-256和795-1346的Cas9蛋白,以及可以结合来自多于一个Cas9蛋白的结构域的嵌合Cas9蛋白,以及鉴定的cas9蛋白的变体和的突变体。(例如,由引入到宿主细胞中的核酸分子编码的Cas9蛋白可以包括相对于野生型Cas9蛋白的至少一个突变;例如,Cas9蛋白可以在所述蛋白的切割结构域之一中失活,从而产生“切口酶”变体。突变的非限制性实例包含D10A、H840A、N854A和N863A。)可以对兴趣宿主细胞的对Cas蛋白进行编码的核酸序列进行密码子优化。

Cas核酸酶活性切割靶DNA以产生双链断裂。然后通过细胞以两种方式之一来修复这些断裂:非同源末端连接或同源性定向修复。在非同源末端连接(NHEJ)中,双链断裂通过将断裂末端彼此直接连接来修复。在这种情况下,没有新的核酸材料插入到位点中,但是一些核酸材料可能丢失,从而导致缺失或改变,通常导致突变。在同源性定向修复中,将可以与切割的靶DNA序列具有同源性的供体多核苷酸(有时称为“供体DNA”或“编辑DNA”)用作用于修复切割的靶DNA序列的模板,从而导致将基因信息从供体多核苷酸转移到靶DNA中。这样,新的核酸材料可以插入/复制到位点中。由于NHEJ和/或同源性定向修复(例如,使用供体DNA分子)引起的靶DNA的修饰可以导致例如基因校正、基因置换、基因标记、转基因插入、核苷酸缺失、基因破坏、基因突变等。

在一些实例中,通过定点修饰多肽(例如,Cas核酸酶、锌指核酸酶、大范围核酸酶或TALEN)切割DNA可以用于通过切割靶DNA序列并且允许细胞在不存在外源提供的供体多核苷酸的情况下修复序列来从靶DNA序列中缺失核酸材料。此类NHEJ事件可以在切割的末端的重新连接的位点处导致突变(“错误修复”),从而导致基因破坏。

可替代地,如果将DNA靶向RNA以及供体DNA共同施用于表达cas核酸酶的细胞,则主题方法可以用于向靶DNA序列添加(即,插入或置换)核酸材料(例如,通过插入诱变“敲除”或“敲入”编码蛋白质(例如,可选择标记和/或任何兴趣蛋白质)、siRNA、miRNA等核酸,以修饰核酸序列(例如,引入突变)。

在特定实施例中,供体DNA可以包含基因调控序列(例如,启动子),所述基因调控序列可以使用CRISPR靶向,插入基因的编码区上游和基因的推定的近端启动子区的上游,例如,cpSRP54基因的编码区起始ATG上游至少50bp、至少100bp、至少120bp、至少150bp、至少200bp、至少250bp、至少300bp、至少350bp、至少400bp、至少450bp或至少500bp。供体DNA可以包含可能干扰天然启动子的序列,如可选择标记或任何方便的序列。插入在SGI1、SGI2、cpSRP54、cytoSRP54、基因的组合或开放阅读框(例如,在5'UTR或cpSRP54基因的转录起始位点的上游)的组合的起始ATG上游的另外的序列可以减少或甚至消除内源性SGI1、SGI2、cpSRP54、cytoSRP54、基因的组合的表达。可替代地或另外,天然SGI1、SGI2、cpSRP54、cytoSRP54或基因的组合可以使其内源性启动子全部或部分地被较弱或不同调控的启动子或非启动子序列置换。

在一些实例中,引入到宿主细胞中用于生成高效基因组编辑细胞系的核酸分子编码相对于对应的野生型酶突变的Cas9酶,使得突变的Cas9酶缺乏切割含有靶序列的靶多核苷酸的一条或两条链的能力。例如,来自酿脓链球菌的Cas9的RuvC I催化结构域中的天冬氨酸至丙氨酸取代(D10A)将Cas9从切割两条链的核酸酶转化为切口酶(切割单链的酶)。致使Cas9成为切口酶的突变的其它实例包含但不限于H840A、N854A和N863A。在一些实施例中,Cas9切口酶可以与一个或多个引导序列(例如,两个引导序列)组合使用,所述引导序列分别靶向DNA靶的有义链和反义链。此组合允许两条链被切割并用于诱导NHEJ。两种切口酶靶(紧密接近但靶向DNA的不同链内)可以用于诱导诱变的NHEJ。使用在交错位置处切割相对菌株的酶对基因座的此类靶向还可以减少非靶切割,因为两条链必须被精确地且特异性地切割以实现基因组突变。

在另外的实例中,可以在细胞中表达其切割DNA的能力方面受损的突变体Cas9酶,其中还引入靶向靶向基因的转录或翻译起始位点上游的序列的一个或多个引导RNA。在这种情况下,Cas酶可以结合靶序列并阻断靶向基因的转录(Qi等人(2013),《细胞》152:1173-1183)。

在一些情况下,Cas多肽(如Cas9多肽)是融合多肽,包括例如:i)Cas9多肽(其可以任选地是如上所述的变体Cas9多肽);以及b)共价连接的异源多肽(也被称为“融合配偶体”)。异源核酸序列可以连接到另一核酸序列(例如,通过基因工程)以生成对嵌合多肽进行编码的嵌合核苷酸序列。在一些实施例中,Cas9融合多肽是通过将Cas9多肽与提供亚细胞定位的异源序列融合生成的(即,异源序列是亚细胞定位序列,例如,用于靶向细胞核的核定位信号(NLS);用于靶向线粒体的线粒体定位信号;用于靶向叶绿体的叶绿体定位信号;ER滞留信号等)。在一些实施例中,异源序列可以提供标签(即,异源序列是可检测标记),以便于追踪和/或纯化(例如,荧光蛋白,例如,绿色荧光蛋白(GFP)、YFP、RFP、CFP、mCherry、tdTomato等;血凝素(HA)标签;FLAG标签;Myc标签等)。

宿主细胞可以用例如载体构建体进行基因工程化(例如转导、转化或转染),所述载体构建体可以是例如用于同源重组的载体,所述载体包含与SGI1、SGI2、cpSRP54、cytoSRP54的一部分、SGI1和cpSRP54基因的组合或宿主细胞的SGI2和cpSRP54基因基因座的组合或与其相邻的区域同源的核酸序列,或可以是用于表达以下任何一种或组合的表达载体:Cas蛋白(例如,II类Cas蛋白)、CRISPR嵌合引导RNA、crRNA和/或tracrRNA、RNAi构建体(例如,shRNA)、反义RNA或核酶。载体可以是例如质粒、病毒颗粒、噬菌体等形式。用于表达用于基因组编辑的多肽或RNA的载体也可以设计用于整合到宿主中,例如通过同源重组。含有本文所描述的多核苷酸序列的载体,例如,与宿主SGI1、SGI2、cpSRP54、cytoSRP54、SGI1和cpSRP54基因的组合或SGI2和cpSRP54基因序列(包含cpSRP54或cytoSRP54编码序列的上游和下游的序列)的组合具有同源性的序列,以及任选地,可选择标记或报告基因可以用于转化合适的宿主,以导致SGI1、SGI2、cpSRP54、cytoSRP54、SGI1和cpSRP54基因的组合或SGI2和cpSRP54基因的组合的弱化。

在一些实例中,重组光合生物可以减少但不消除SGI1、SGI2、cpSRP54、cytoSRP54、SGI1和cpSRP54基因的组合或SGI2和cpSRP54基因的组合的表达,并且重组光合生物可以具有叶绿素约10%到约90%的减少,例如,总叶绿素约20%减少到约80%的减少。如本文提供的经过基因修饰的微生物可以在一些实例中包含用于弱化SGI1、SGI2、cpSRP54、cytoSRP54、SGI1和cpSRP54基因的组合或SGI2和cpSRP54基因的组合的表达的核酸构建体。例如,宿主微生物可以包含用于表达减少SGI1、SGI2、cpSRP54、cytoSRP54、SGI1和cpSRP54基因的组合或SGI2和cpSRP54基因的组合的表达的RNAi分子、核酶或反义分子的构建体。在一些实例中,如本文所提供的重组微生物可以包含用于减少SGI1、SGI2、cpSRP54、cytoSRP54、SGI1和cpSRP54基因的组合或SGI2和cpSRP54基因的组合的表达的至少一种引入的(外源或非天然的)构建体。

使用本领域已知的方法,例如,RNA-Seq或逆转录PCR(RT-PCR),可以选择工程化菌株用于表达SGI1、SGI2、cpSRP54、cytoSRP54、SGI1和cpSRP54基因的组合或SGI2和cpSRP54基因的组合,其相对于不包含用于弱化SGI1、SGI2、cpSRP54、cytoSRP54、SGI1和cpSRP54基因的组合或SGI2和cpSRP54基因表达的组合的基因修饰的对照细胞减少,但不消除。

如本文所提供的基因工程化的菌株可以被工程化以包含用于通过减少对SGI1、SGI2、cpSRP54、cytoSRP54、SGI1和cpSRP54基因的组合或SGI2和cpSRP54基因的组合进行编码的基因的mRNA的量、稳定性或翻译性来弱化基因表达的构建体。例如,使用本领域已知的方法,可以用靶向SGI1、SGI2、cpSRP54、cytoSRP54、SGI1和cpSRP54基因的组合或SGI2和cpSRP54基因的组合的mRNA的反义RNA、RNAi或核酶构建体转化光合生物,如植物或藻类或不等鞭毛菌株(heterokont strain)。例如,可以将包含基因的全部或部分转录区的反义RNA构建体引入到微生物中以减少基因表达(Shroda等人(1999),《植物细胞(The PlantCell)》11:1165-78;Ngiam等人(2000),《应用环境微生物学期刊(Appl.Environ.Microbiol.)》66:775-782;Ohnuma等人(2009),《原生质(Protoplasma)》236:107-112;Lavaud等人(2012),《公共科学图书馆期刊(PLoS One)》7:e36806)。可替代地或另外,可以将靶向cpSRP54或cytoSRP54基因的RNAi构建体(例如,对短发夹RNA进行编码的构建体)引入到微生物(如藻类或不等鞭毛体)中,以用于减少cpSRP54或cytoSRP54基因的表达(参见,例如,Cerruti等人(2011),《真核細胞(Eukaryotic Cell)》(2011)10:1164-1172;Shroda等人(2006),《当代遗传学(Curr.Genet.)》49:69-84))。

核酶是以位点特异性方式切割核酸的RNA-蛋白质复合物。核酶具有拥有核酸内切酶活性的特异性催化结构域。例如,美国专利第5,354,855号报告了某些核酶可以充当核酸内切酶,其序列特异性大于已知的核糖核酸酶的序列特异性,并且接近DNA限制酶的序列特异性。催化性RNA构建体(核酶)可以被设计成与对本文所提供的基因进行编码的mRNA碱基配对,以切割mRNA靶。在一些实例中,核酶序列可以整合在反义RNA构建体内以介导靶的切割。可以考虑各种类型的核酶,其设计和用途是本领域已知的,并且例如在Haseloff等人(1988),《自然(Nature)》334:585-591中描述的。

借助于互补碱基对相互作用与位点退火,将核酶靶向给定序列。这种靶向需要两个同源性段。这些同源序列的段位于上文所定义的催化性核酶结构的侧翼。同源序列的每个段的长度可以为7到15个核苷酸不等。定义同源序列的唯一要求是,在靶RNA上,其被作为切割位点的特定序列分开。对于锤头状核酶,切割位点是靶RNA上的二核苷酸序列,其为尿嘧啶(U),然后是腺嘌呤、胞嘧啶或尿嘧啶(A、C或U)(Thompson等人(1995),《核酸研究》23:2250-68)。此二核苷酸在任何给定RNA中存在的频率在统计学上是16个中的3个。因此,对于给定的1,000个碱基的靶信使RNA,187个二核苷酸切割位点在统计学上是可能的。

已经详细讨论了核酶指导的RNA切割活性的一般设计和优化(Haseloff和Gerlach(1988),《自然》334:585-591;Symons(1992),《生物化学年鉴(Ann Rev Biochem)》61:641-71;Chowrira等人(1994),《生物化学期刊(J Biol Chem)》269:25856-64;Thompson等人(1995),见上文)。设计和测试用于有效切割靶RNA的核酶是本领域的技术人员众所周知的方法。用于设计和测试核酶的科学方法的实例是由Chorwrira等人(1994),见上文;以及Lieber和Strauss(1995),《分子细胞生物学(Mol Cell Biol.)》15:540-51描述的,每个所述文献通过引入并入。用于下调给定基因的有效和优选序列的鉴定是制备和测试给定序列的问题,并且是本领域的技术人员已知的常规实践的“筛选”方法。

RNAi构建体的用途在上文所引用的文献中以及例如US2005/0166289和WO 2013/016267中进行了描述。将与靶基因具有同源性的双链RNA递送到细胞或通过RNAi构建体(例如,RNAi短发夹(sh)构建体)的表达在细胞中产生。构建体可以包含与靶基因相同的序列,或与靶基因的序列至少70%、80%、90%、95%或95%与100%之间相同。构建体可以具有至少20个、至少30个、至少40个、至少50个、至少100个、至少200个、至少300个、至少400个、至少500个、至少600个、至少700个、至少800个、至少900个、或至少1kb的与靶基因同源的序列。可以使用选择用于RNAi构建体(如产生shRNA的构建体)的连续或诱导型表达的启动子来工程化表达载体。

用于基因弱化的核酸构建体,例如核酶、RNAi或反义构建体,可以包含至少十五个、至少二十个、至少三十个、至少四十个、至少五十个或至少六十个核苷酸,所述核苷酸与内源性SGI1、SGI2、cpSRP54、cytoSRP54、SGI1和cpSRP54基因的组合或待工程化的微生物的SGI2和cpSRP54基因的组合的序列的至少一部分具有至少80%同一性,如至少85%、至少90%、至少95%或至少99%或互补性。用于基因弱化的核酸构建体,例如核酶、RNAi或反义构建体,可以包含至少十五个、至少二十个、至少三十个、至少四十个、至少五十个或至少六十个核苷酸,所述核苷酸与天然存在的基因(如对多肽进行编码的基因,其与内源性SGI1、SGI2、cpSRP54、cytoSRP54、SGI1和cpSRP54基因的组合或SGI2和cpSRP54基因的组合具有至少50%、至少55%、至少60%、至少65%、至少70%、至少75%、至少80%或至少85%、至少90%或至少95%序列同一性)的序列具有至少80%,如至少95%、或约100%同一性或互补性。例如,用于基因弱化的核酸构建体,例如核酶、RNAi或反义构建体,可以包含至少十五个、至少二十个、至少三十个、至少四十个、至少五十个或至少六十个核苷酸,所述核苷酸与天然存在的cpSRP54基因(如本文所提供的任何基因)的序列具有至少80%同一性或互补性。核苷酸序列可以是例如,约30个核苷酸到约3千碱基或更大,例如,长度为30-50个核苷酸、长度为50到100个核苷酸、长度为100到500个核苷酸、长度为500个核苷酸到1kb、长度为1kb到2kb或2到5kb。例如,反义序列的长度可以为约100个核苷酸到约1kb。例如,用于基因弱化的核酸构建体,例如核酶、RNAi或反义构建体,可以包含至少十五个、至少二十个、至少三十个、至少四十个、至少五十个、至少六十个或至少100个核苷酸,所述核苷酸与内源性SGI1、SGI2、cpSRP54、cytoSRP54、SGI1和cpSRP54基因的组合或SGI2和cpSRP54基因的组合或其部分具有至少50%、至少55%、至少60%、至少65%、至少70%、至少75%、至少80%、或至少85%,例如,至少86%、至少87%、至少88%、至少89%、至少90%、至少91%、至少92%、至少93%、至少94%或至少95%同一性或互补性。

反义、RNAi或核酶构建体中所使用的启动子可以是在宿主生物中起作用并且适合于将靶基因的表达减少到期望量所需的表达水平的任何启动子。在藻类和不等鞭毛体中起作用的启动子是本领域已知的并且在本文中公开的。可以使用任何可行的方法(包含本文公开的任何方法)将构建体转化为藻类。用核酸分子(所述核酸分子用于弱化SGI1、SGI2、cpSRP54、cytoSRP54、SGI1和cpSRP54基因的组合或SGI2和cpSRP54基因表达的组合,如但不限于反义、RNAi或核酶构建体)转化的重组生物或微生物,可以具有本文所描述的SGI1、SGI2、cpSRP54、cytoSRP54、SGI1和cpSRP54基因的组合或SGI2和cpSRP54突变体的组合的性质,包含例如,相对于不包含导致弱化的基因表达的外源核酸分子的宿主生物或微生物,减少的叶绿素、增加的光合效率以及增加的培养生产力。

核酸分子和构建体

本领域的技术人员应了解,许多转化方法可以用于微生物的基因转化,并且因此可用于本发明的方法。“稳定转化”旨在意指引入到生物中的核酸构建体整合到生物的基因组中,或是稳定的附加型构建体的一部分,并且能够被其后代遗传。“瞬时转化”旨在意指将多核苷酸引入到生物中,并且不整合到基因组中,或以其它方式通过连续世代而被建立并稳定地遗传。

遗传转化可以导致转基因、来自细胞核或质体的构建体的稳定插入和/或表达,并且在一些情况下可以导致转基因的瞬时表达。这些转化方法也可以用于引入引导RNA或编辑DNA。已经报告,对于多于30种不同的微藻株(其属于绿藻、红藻和褐藻、硅藻、裸藻藻和鞭毛藻的至少约22个物种)而言,微藻的基因转化是成功的(参见,例如,Radakovits等人,《真核细胞》,2010;以及Gong等人,《工业微生物学与生物技术期刊(J.Ind.Microbiol.Biotechnol.)》,2011)。此类有用的转化方法的非限制性实例包含在玻璃珠或碳化硅晶须存在下的搅拌细胞,如由以下报告的,例如,Dunahay,《生物技术(Biotechniques)》,15(3):452–460,1993;Kindle,《美国国家科学院院刊》,1990;Michael和Miller,《植物学杂志(Plant J.)》,13,427–435,1998。电穿孔技术已经成功用于若干种微藻物种的基因转化,所述微藻物种包含微拟球藻属(参见例如,Chen等人,《藻类学杂志(J.Phycol.)》,44:768–76,2008)、小球藻属(参见,例如,Chen等人,《当代遗传学》,39:365-370,2001;Chow和Tung,《植物细胞报告(Plant Cell Rep.)》,第18卷,第9期,778-780,1999)、衣藻属(Shimogawara等人,《遗传学(Genetics)》,148:1821–1828,1998)、杜氏藻属(Sun等人,《分子生物技术(Mol.Biotechnol.)》,30(3):185-192,2005)。微弹轰击,也称为微粒轰击、基因枪转化或生物弹轰击,已经成功用于若干种藻类物种,包含例如硅藻物种(如褐指藻属)(Apt等人,《分子和普通遗传学(Mol.Gen.Genet.)》,252:572-579,1996)、小环藻属和舟形藻属(Dunahay等人,《藻类学杂志》,31:1004–1012,1995)、细柱藻属(Fischer等人,《藻类学杂志》,35:113-120,1999)以及角毛藻属(Miyagawa-Yamaguchi等人,《藻类学研究(Phycol.Res.)》59:113–119,2011),以及绿色藻类物种如小球藻属(El-Sheekh,《植物生物学(Biologia Plantarum)》,第42卷,第2期:209-216,1999)和团藻物种(Jakobiak等人,《原生生物(Protist)》,155:381-93,2004)。另外,农杆菌介导的基因转移技术也可用于微藻的基因转化,如已经由以下所报告的,例如,Kumar,《植物科学(Plant Sci.)》,166(3):731-738,2004以及Cheney等人,《藻类学杂志》,第37卷,增刊11,2001。

如本文所描述的转化载体或构建体将典型地包括标记基因,所述标记基因在靶宿主细胞(例如,藻类细胞)上赋予可选择或可评分的表型,或可以与包含标记的构建体共转化。已经成功地开发了用于有效分离藻类的基因转化体的许多可选择标记。常见的可选择标记包含抗生素抗性、荧光标记和生物化学标记。若干种不同的抗生素抗性基因已经成功地用于选择微藻转化体,包含灭瘟素(blastocydin)、博来霉素(bleomycin)(参见例如,Apt等人,1996,见上文;Fischer等人,1999,见上文;Fuhrmann等人,《植物杂志》,19,353-61,1999,Lumbreras等人,《植物学杂志》,14(4):441-447,1998;Zaslavskaia等人,《藻类学杂志》,36:379–386,2000)、壮观霉素(spectinomycin)(Cerutti等人,《遗传学》,145:97–110,1997;Doetsch等人,《当代遗传学》,39,49-60,2001;Fargo,《分子细胞生物学》,19:6980-90,1999)、链霉素(streptomycin)(Berthold等人,《原生生物》,153:401-412,2002)、巴龙霉素(paromomycin)(Jakobiak等人,《原生生物》见上文;Sizova等人,《基因(Gene)》,277:221-229,2001)、诺尔丝菌素(nourseothricin)(Zaslavskaia等人,2000,见上文)、G418(Dunahay等人,1995,见上文;Poulsen和Kroger,《FEBS快报(FEBS Lett.)》,272:3413–3423,2005;Zaslavskaia等人,2000,见上文)、潮霉素(hygromycin)(Berthold等人,2002,见上文)、氯霉素(Poulsen和Kroger,2005,见上文)以及许多其它微藻转化体。用于微藻类(如衣藻属)的另外可选择标记可以是提供对以下的抗性的标记:卡那霉素和阿米卡星抗性(Bateman,《分子和普通遗传学》263:404-10,2000)、沙阿霉素(zeomycin)和腐草霉素(phleomycin)(例如,ZEOCINTM腐草霉素D1)抗性(Stevens,《分子和普通遗传学》251:23-30,1996)以及巴龙霉素(paramomycin)和新霉素(neomycin)抗性(Sizova等人,2001,见上文)。已经使用的其它荧光或生色标记包含荧光素酶(Falciatore等人,《水产生物技术期刊(J.Mar.Biotechnol.)》,1:239–251,1999;Fuhrmann等人,《植物分子生物学(PlantMol.Biol.)》,2004;Jarvis和Brown,《当代遗传学》,19:317–322,1991)、β-葡萄糖醛酸酶(Chen等人,2001,见上文;(Cheney等人,2001,见上文;Chow和Tung,1999,见上文;El-Sheekh,1999,见上文;Falciatore等人,1999,见上文;Kubler等人,《水产生物技术期刊》,1:165–169,1994)、β-半乳糖苷酶(Gan等人,《应用藻类学期刊(J.Appl.Phycol.)》,15:345–349,2003;Jiang等人,《植物细胞报告》,21:1211–1216,2003;Qin等人,《高技术通讯(HighTechnol.Lett.)》,13:87–89,2003)和绿色荧光蛋白(GFP)(Cheney等人,2001,见上文;Ender等人,《植物细胞》,2002;Franklin等人,《植物学杂志》,2002;56,148,210)。

本领域的技术人员将容易理解,多种已知的启动子序列可以有用地部署用于根据本发明的微藻物种的转化系统。例如,通常用于驱动微藻中的转基因表达的启动子包含多种形式的花椰菜花叶病毒启动子35S(CaMV35S),其已经用于甲藻门(dinoflagellate)和绿藻门(Chlorophyta)两者(Chow等人,《植物细胞报告》,18:778-780,1999;Jarvis和Brown,《当代遗传学》,317-321,1991;Lohuis和Miller,《植物学杂志》,13:427-435,1998)。还报告了来自猿猴病毒的SV40启动子在若干种藻类中是有活性的(Gan等人,《应用植物学期刊》,151345-349,2003;Qin等人,《水生生物学(Hydrobiologia)》398-399,469-472,1999)。来自衣藻属的RBCS2(二磷酸核酮糖羧化酶,小亚基)(Fuhrmann等人,《植物学杂志》,19:353-361,1999)和PsaD(光系统I复合物的丰富蛋白质;Fischer和Rochaix,《FEBS快报》581:5555-5560,2001)的启动子也可以是有用的。HSP70A/RBCS2和HSP70A/β2TUB(微管蛋白)的融合启动子(Schroda等人,《植物学杂志》,21:121-131,2000)也可以用于改善的转基因表达,其中HSP70A启动子在置于其它启动子上游时可以充当转录激活因子。兴趣基因的高水平表达也可以在例如硅藻物种中,在对硅藻岩藻黄质-叶绿素a/b结合蛋白进行编码的fcp基因(Falciatore等人,《水产生物技术期刊》,1:239-251,1999;Zaslavskaia等人,《藻类学杂志》36:379-386,2000)或对黄绿藻门黄质-叶绿素a/b结合蛋白进行编码的vcp基因(参见美国专利第8,318,482号)的启动子的控制下实现。如果需要,诱导型启动子可以在转基因微藻类中提供快速且严格控制的基因表达。例如,对硝酸盐还原酶进行编码的NR基因的启动子区可以用作此类诱导型启动子。NR启动子活性通常被铵抑制并在铵被硝酸盐置换时被诱导(Poulsen和Kroger,《FEBS快报》272:3413-3423,2005),因此当微藻细胞在铵/硝酸盐存在下生长时,可以关闭或打开基因表达。可以在本文所提供的构建体和转化系统中发现用途的另外的藻类启动子包含以下专利申请中所公开的那些藻类启动子:美国专利第8,883,993号;美国专利申请公开号US2013/0023035;美国专利申请公开号US 2013/0323780;以及美国专利申请公开号US 2014/0363892。

宿主细胞可以是未转化的细胞或是已经用至少一种核酸分子转染的细胞。例如,被工程化以具有弱化cpSRP54基因表达的藻类宿主细胞可以进一步包含可以赋予任何所期望特性的一种或多种基因,如但不限于增加的兴趣生物分子(如一种或多种蛋白质、颜料、醇或脂质)的产量。

从光合生物产生产物的方法

本文还提供了通过培养具有增加的光合效率的光合生物来从光合生物(如藻类)产生产物的方法,如本文所公开的SGI1、SGI2、cpSRP54、cytoSRP54、SGI1和cpSRP54基因的组合或SGI2和cpSRP54突变体的组合。所述方法包含在合适的培养基中培养光合生物突变体SGI1、SGI2、cpSRP54、cytoSRP54、SGI1和cpSRP54基因的组合或SGI2和cpSRP54的组合以提供光合生物培养物,并且从所述培养物中回收生物质或至少一种产物。在一些实施例中,所述产物是脂质。包括光合生物的培养物优选地是光能自养培养物,并且培养基优选地不包含显著量的还原碳,即,培养物不包含藻类可以用于生长的形式或水平的还原碳。

在一些实施例中,光合生物可以在任何合适的容器中培养,包含烧瓶或生物反应器,其中所述光合生物可以暴露于人造光或自然光。可以在光/黑暗周期中培养包括突变的光合生物的培养物,所述光/黑暗周期可以是例如,自然的或程序化的光/黑暗周期,并且作为说明性实例,可以提供十二小时的光到十二小时的黑暗、十四小时的光到十小时的黑暗、十六小时的光到八小时的黑暗等。

培养是指通过使用所选和/或受控条件来有意促进一个或多个细胞的生长(例如,增加细胞大小、细胞含量和/或细胞活性)和/或繁殖(例如,通过有丝分裂增加细胞数量)。生长和繁殖两者的结合可以被称为增殖。如本文的实例所表明的,相对于在低光适应中未失调的同一菌株的培养野生型藻类细胞,本文所提供的对低光强度展现出失调的适应性的突变体可以随着时间推移(例如,在一周或更长的时间段内)实现培养物的更高细胞密度。例如,可以将cpSRP54突变体培养持续至少五天、至少六天、至少七天、至少八天、至少九天、至少十天、至少十一天、至少十二天、至少十三天、至少十四天或至少十五天,或至少一周、二周、三周、四周、五周、六周、七周、八周、九周或十周或更长时间。

可以用于培养重组微生物的所选和/或受控条件的非限制性实例可以包含使用限定的培养基(具有已知特征,如pH、离子强度和/或碳源)、指定温度、氧张力、二氧化碳水平、生物反应器中的生长等或其组合。在一些实施例中,可以使用光和还原碳源两者使微生物或宿主细胞混合营养生长。可替代地,微生物或宿主细胞可以光养培养。当光养生长时,藻类菌株可以有利地使用光作为能源。无机碳源(如CO2或碳酸氢盐)可以用于由微生物合成的生物分子。如本文所使用的,“无机碳”包含不能被生物用作可持续能源的含碳化合物或分子。通常,“无机碳”可以是CO2(二氧化碳)、碳酸、碳酸氢盐(bicarbonate salt)、碳酸盐、碳酸氢盐(bicarbonate salt)等或其组合的形式,其不能被进一步氧化以得到可持续的能量,也不能用作生物的还原力(reducing power)的来源。光能自养生长的微生物可以在培养基中生长,其中无机碳基本上是唯一碳源。例如,在其中无机碳基本上是唯一碳源的培养物中,可以在培养基中提供的任何有机(还原)碳分子或有机碳化合物不能由细胞吸收和/或代谢为能量和/或不以足以为细胞培养物的生长和增殖提供可持续能量的量存在。

根据本发明的方法可以使用的微生物和宿主细胞可以在世界各地的不同位置和环境中找到。用于脂质和/或其它产物的最佳繁殖和生成的特定生长培养基可以变化,并且可以被优化以促进生物质或产物(如脂质、蛋白质、色素、抗氧化剂)的生长、繁殖或产生。固体和液体生长培养基通常可从多种来源获得,如用于制备适用于多种微生物菌株的特定培养基的说明。例如,各种淡水和盐水培养基可以包含在Barsanti(2005),《藻类:解剖学、生物化学和生物技术(Algae:Anatomy,Biochemistry&Biotechnology)》,CRC出版社(CRCPress),用于培养藻类的培养基和方法中所描述的那些淡水和盐水培养基。藻类培养基配方也可以在各种藻类培养物保藏中心的网站上找到,包含作为非限制性实例,藻类的UTEX培养物保藏中心(www.sbs.utexas.edu/utex/media.aspx);藻类和原生动物的培养物保藏中心(www.ccap.ac.uk);和Katedra Botaniky(botany.natur.cuni.cz/algo/caup-media.html)。

培养方法可以任选地包含诱导用于产生产物(如但不限于参与脂质产生的蛋白质、一种或多种蛋白质、抗氧化剂或色素)的一种或多种基因的表达,和/或调控微生物中的代谢途径。诱导表达可以包含向培养物中添加营养物或化合物、从培养基中去除一种或多种组分、增加或减少光和/或温度和/或促进兴趣基因表达的其它操作。此类操作可以在很大程度上取决于可操作地连接到兴趣基因的(异源)启动子的性质。

在本发明的一些实施例中,在适应低光强度下失调的微生物可以在配备有人工光源和/或具有一个或多个壁的“光生物反应器”中进行培养,所述一个或多个壁对于光(包含阳光)是足够透明的以使得、促进和/或维持可接受的微生物生长和增殖。为了产生脂肪酸产物或甘油三酯,光合微生物或宿主细胞可以另外地或可替代地在摇瓶、试管、小瓶、微量滴定皿(microtiter dishes)、皮氏培养皿(Petri dishes)等或其组合中进行培养。

另外地或可替代地,重组光合微生物或宿主细胞可以在池塘、水道、海基型生长容器、沟渠、水沟、航道等或其组合中生长。在这样的系统中,温度可以是未调控的或可以采用各种加热或冷却方法或装置。与标准生物反应器一样,无机碳源(如但不限于CO2、碳酸氢盐、碳酸盐等)包含但不限于空气、富含CO2的空气、烟道气等或其组合可以提供给培养物。在提供烟道气和/或其它无机源(除CO2之外还可能含有CO)时,可能需要预处理此类来源,使得引入(光)生物反应器的CO水平不构成关于微生物的生长、增殖和/或存活的危险和/致命剂量。

突变体光合生物可以包含对用于产生产物的多肽进行编码的一种或多种非天然基因,所述产物如但不限于脂质、着色剂或色素、抗氧化剂、维生素、核苷酸、核酸、氨基酸、激素、细胞因子、肽、蛋白质或聚合物。例如,经过编码的多肽可以是酶、代谢调节剂、辅因子、载剂蛋白或转运蛋白。所述方法包含培养cpSRP54突变体或cytoSRP54突变体以产生生物质或至少一种藻类产物,所述突变体包含对参与产物产生的多肽进行编码的至少一种非天然基因。产物(如脂质和蛋白质)可以通过本领域的普通技术人员已知的回收方法从培养物中进行回收,如通过全培养物提取,例如使用有机溶剂。在一些情况下,脂肪酸产物的回收可以通过细胞均质化来增强。例如,可以通过在升高的温度和/或压力下用溶剂提取藻类而从藻类中分离脂质(如脂肪酸、脂肪酸衍生物和/或甘油三酯),如在共同未决的、共同转让的美国专利申请公开号US2013/0225846中所描述的,所述美国专利申请通过引用整体并入本文。

在审阅本公开之后,其它替代性实施例和方法对本领域的技术人员而言将是显而易见的。本文给出的一般方法的讨论仅旨在用于说明性目的。下文提供了以下非限制性实例。

实例

实例1

过表达CAS9的拟小球藻属菌株的生成

在美国专利申请公开2016/0304896中描述了过表达Cas9的拟小球藻属菌株的生成,所述美国专利申请通过引用整体并入。

简而言之,将载体pSGE-6709工程化以用于在拟小球藻属中表达酿脓链球菌Cas9基因。所述载体包含以下三个元素:(1)Cas9表达盒,其含有针对拟小球藻属进行密码子优化的工程化的Cas9基因并且含有来自拟小球藻属的内含子,其还包含N末端FLAG标签、核定位信号和可操作地连接到拟小球藻属RPS17启动子,以及肽接头,其由拟小球藻属RPS17终止子终止可选择标记表达盒,其含有针对拟小球藻属进行密码子优化的来自土曲霉(Aspergillus terreus)的杀稻瘟菌素抗性基因并且含有拟小球藻属内含子,可操作地连接到拟小球藻属RPS4启动子并由拟小球藻属RPS4终止子终止,GFP报告基因表达盒,其含有TurboGFP基因(Evrogen公司(Evrogen),莫斯科,俄罗斯),其由拟小球藻属ACP1启动子驱动并由拟小球藻属ACP1终止子终止。

通过基因枪法将载体转化为拟小球藻属。使用伯乐基因枪系统(BioRadGene Gun System)完成拟小球藻属野生型菌株WT-1185的转化,基本上如美国专利公开号2014/0154806中所描述的,所述美国专利通过引用并入本文。将用于转化的DNA沉淀在金颗粒上,这些金颗粒粘附在管子长度的内部,并且燃烧一阵氦气通过定位在基因枪内的管子,以将这些DNA包被的金颗粒推进到拟小球藻属菌株WT-1185细胞中,所述细胞粘附在固体非选择性培养基(含有PM074藻类生长培养基的2%琼脂平板)上。使用基因枪在600psi下,从距平板3-6cm的距离发射两个子弹每细胞圈。第二天,将细胞转移到用于转化的菌落生长的选择性培养基上。

通过流式细胞术和鉴别经过转化的菌株来筛选菌落的完全GFP外显率,所述经过转化的菌株的单个荧光峰偏移到比野生型荧光峰更高的值。为了证明Cas9表达,通过抗Cas9蛋白质印迹来测试完全渗透的Cas9菌株的Cas9表达,所述菌株相对于未转化的细胞显示出明显偏移的荧光峰。基于这些筛选,将分离株6709-2向前进行并给予菌株标识符GE-15699。

实例2:

使用完全渗透的拟小球藻属CAS9编辑系敲除CPSRP54

在美国专利申请公开2016/0304896中描述了使用完全渗透的拟小球藻属Cas9编辑系敲除cpSRP54,所述美国专利申请通过引用整体并入。简而言之,设计了嵌合gRNA(SEQID NO:103),最后三个核苷酸代表PAM,并且在体外合成以靶向拟小球藻属编码序列中的叶绿体SRP54基因。

将GE-15699通过电穿孔用1-2μg纯化的嵌合引导RNA和1μg可选择标记DNA进行转化,所述可选择标记DNA含有针对拟小球藻属进行密码子优化的博来霉素抗性“BleR”基因,并且含有来自拟小球藻属(SEQ ID:70)的内含子。BleR基因可操作地连接到拟小球藻属RPS4启动子(SEQ ID:71),并且由拟小球藻属RPS4终止子(SEQ ID:72)终止。

通过对100mL种子培养物进行接种来进行电穿孔,所述种子培养物在使用转化前六天被接种成1×106个细胞/毫升,以在转化前两天将1L培养物接种成1×106个细胞/毫升。在转化当天,通过在5000x g下离心持续20分钟来沉淀细胞,用0.1um过滤的385mM山梨糖醇洗涤三次,并且在385mM山梨糖醇中重悬到5×109细胞/毫升。在不同条件下,在伯乐(BioRad)Gene Pulser XcellTM中,在0.2cm比色皿中进行100μL浓缩细胞的电穿孔。用于优化电穿孔的DNA是线性化的pSG6640,包含bleR和TurboGFP表达盒。TurboGFP盒包含可操作地连接到TurboGFP基因(SEQ ID NO:24)的拟小球藻属ACP1启动子(SEQ ID NO:67)和拟小球藻属ACP1终止子(SEQ ID NO:68)。在电穿孔预冷的细胞和比色皿后,立即添加1mL冷山梨糖醇并且用于将细胞转移到10mL PM074中。在过夜回收之后,将细胞浓缩并铺展到含有250mg/L博莱霉素(zeocin)的直径为13cm的PM074培养基上,并且在基因枪部分中所列出的条件下生长。

电穿孔条件为1.0-1.2kV(5000-6000V/cm)、200-300欧姆和25-50μF。使用更大量的DNA增加了所得博莱霉素抗性菌落的数量,尽管在大于4μg的量下效果达到稳定。在电穿孔之后,将细胞铺在含有250μg/ml博莱霉素的琼脂培养基(PM130)上,以选择掺入ble盒的转化体。使用设计用于跨天然靶向基因座(oligo-AE596和oligo-AE597)扩增的引物,通过菌落PCR筛选转化体。引物被设计成在不存在整合(例如,BleR盒的“敲入”)到基因座中的情况下产生700bp条带,或如果存在单个ble盒整合到靶向基因座中的话,则产生4.3kb条带。另外,还使用设计成扩增从cpSRP54基因(oligo-AE597)延伸到可选择标记的片段的引物来进行菌落PCR。根据整合的ble盒的朝向,1.2kb条带将由从ble盒内向外跨越进入到cpSRP54基因中的引物405/597或引物406/597的扩增来产生。结果显示,在不存在同源性臂的情况下,BleR盒敲入到靶向基因座中的高频率(在此样品中介于40%与45%之间)。cpSRP54敲除导致浅绿色表型。

实例3

使用完全渗透的拟小球藻属CAS9编辑系敲除SGI2

使用完全渗透的拟小球藻属Cas9编辑系敲除SGI2基本上如上文针对cpSRP54所述进行。简而言之,设计了嵌合gRNA(SEQ ID NO:104),最后三个核苷酸代表PAM,并且在体外合成以靶向拟小球藻属编码序列中的叶绿体SGI2基因。

将GE-15699通过电穿孔用1-2μg纯化的嵌合引导RNA和1μg可选择标记DNA进行转化,所述可选择标记DNA含有针对拟小球藻属进行密码子优化的博来霉素抗性“BleR”基因,并且含有来自拟小球藻属(SEQ ID:70)的内含子。BleR基因可操作地连接到拟小球藻属RPS4启动子(SEQ ID:71),并且由拟小球藻属RPS4终止子(SEQ ID:72)终止。

选择Ble抗性菌落,并且通过PCR确认敲除。

实例4

使用完全渗透的拟小球藻属CAS9编辑系敲除SGI1

SGI1敲除菌株24183是从表达Cas9的母菌株GE--15699开始产生的。将GE-15699细胞电穿孔为嵌合gRNA(SEQ ID NO:105,SEQ ID NO:105的最后三个核苷酸代表PAM),以及DNA盒,所述DNA盒含有侧翼为亚硝酸盐还原酶启动子和终止子的密码子优化的Cre基因,并且如图10A所示。所述盒还含有先前已经使用的ble和GFP基因。Ble和GFP的侧翼为lox2272位点。当表达Cre时,lox位点重组,在这些序列之间环出DNA。围绕CRISPR靶的SGI1基因的同源序列也位于所述盒的末端,以增强单拷贝整合。在SGI1基因座中存在盒的情况下,通过DNA测序来确认序列。使用ddPCR确认拷贝数是单拷贝整合体。然后,将菌株在非含铵培养基(non-ammonium containing media)中培养,以表达Cre。当表达Cre时,lox位点重组,在这些序列之间环出DNA。

实例5

使用完全渗透的拟小球藻属CAS9编辑系双敲除SGI2和CPSRP54

使用完全渗透的拟小球藻属Cas9编辑系双敲除SGI2和SRP54基本上如上文针对cpSRP54所述进行。简而言之,设计了两种嵌合gRNA,一种针对cpSRP54(SEQ ID NO:69)并且另一种针对SGI2(SEQ ID NO:73),最后三个核苷酸代表PAM,并且在体外合成以靶向拟小球藻属编码序列中的叶绿体SGI1基因。

将GE-15699通过电穿孔用1-2μg纯化的嵌合引导RNA和1μg可选择标记DNA进行转化,所述可选择标记DNA含有针对拟小球藻属进行密码子优化的博来霉素抗性“BleR”基因,并且含有来自拟小球藻属(SEQ ID:70)的内含子。BleR基因可操作地连接到拟小球藻属RPS4启动子(SEQ ID:71),并且由拟小球藻属RPS4终止子(SEQ ID:72)终止。

选择Ble抗性菌落,并且通过PCR确认敲除。

实例6

使用完全渗透的拟小球藻属CAS9编辑系双敲除SGI1和CPSRP54

将如上所述的拟小球藻属SGI1敲除菌株24183用靶向cpSRP54(SEQ ID NO:69)的嵌合gRNA与包括ble和GFP序列的DNA盒进行电穿孔(图10B),以产生SGI1和cpSRP54的双敲除。选择Ble抗性菌落,并且通过PCR确认敲除。生成了三种双敲除菌株:STR24538、STR24540和STR24541,所述双敲除菌株在光生理性质和物理表型上相同。

实例7

包括单拷贝CAS9基因的拟小球藻属SGI1敲除菌株的生成

将针对拟小球藻属进行密码子优化的并且包括来自拟小球藻属(SEQ ID:70)、GFP基因、Cre基因、lox位点和Cas9基因的内含子的博来霉素抗性“BleR”基因克隆到pCC1BAC载体中。Cas9基因可操作地连接到拟小球藻属RPS17启动子,并且包括29个天然PBP内含子,并且位于lox2272位点之外。Cas9基因由拟小球藻属RPS17终止子终止。BleR基因可操作地连接到拟小球藻属RPS4启动子(SEQ ID:71),并且由拟小球藻属RPS4终止子(SEQ ID:72)终止。GFP基因可操作地连接到拟小球藻属ACP1启动子,并由拟小球藻属ACP1终止子终止。Cre基因可操作地连接到拟小球藻属亚硝酸盐还原酶启动子和拟小球藻属亚硝酸盐还原酶终止子。这些基因的侧翼是SGI1(CheY)序列的用作同源重组位点的一部分。图17示出了重组pCC1BAC载体的示意图。

转化WT拟小球藻属宿主菌株:STR00010

将Cas9基因WT拟小球藻属宿主菌株与靶向SGI1基因(SEQ ID NO:74)的gRNA和PvuI消化的且旋转纯化的选择盒(NAS00460,SEQ ID NO:86)共转化。

选择盒(NAS00460)包括片段,所述片段包含SGI1同源重组(HR)上臂的上游1.7kb载体主链(对应于SEQ ID NO:86的序列1-1761)和SGI1 Hr下臂的下游没有载体的部分、针对拟小球藻属进行密码子优化的并且含有来自拟小球藻属(SEQ ID:70)、GFP基因(对应于SEQ ID NO:86的序列8260-8961)和Cas9基因的内含子的博莱霉素抗性“BleR”基因。选择盒在lox位点内含有ble和GFP。CRE基因(对应于SEQ ID NO:86的序列10418-13326)包括6个亚硝酸盐还原酶密码子优化的拟小球藻属内含子,处于亚硝酸盐还原酶诱导型启动子之下(对应于SEQ ID NO:86的序列9906-10417)。Cre基因由亚硝酸盐还原酶终止子终止(对应于SEQ ID NO:86的序列13327-15140)。包含29个天然PBP内含子的Cas9基因对应于SEQ IDNO:86的序列15754到序列25918。Cas9基因处于拟小球藻属RPS17启动子之下(对应于SEQID NO:86的序列15166-15753),并且含有29个天然PBP内含子,并且位于lox位点之外。Cas9基因由拟小球藻属RPS17终止子终止(对应于SEQ ID NO:86的序列25919-26373)。T

BleR基因可操作地连接到拟小球藻属RPS4启动子(SEQ ID:71),并且由拟小球藻属RPS4终止子(SEQ ID:72)终止。GFP基因可操作地连接到拟小球藻属ACP1启动子(对应于SEQ ID NO:86的序列7688到8259),并且由拟小球藻属ACP1终止子(对应于SEQ ID NO:86的序列8692-9830)终止。SGI1同源重组(HR)上臂对应于SEQ ID NO:86的序列1762-3578。SGI1同源重组(HR)下游臂对应于SEQ ID NO:86的序列26448-28447。5'lox2272位点对应于SEQID NO:86的序列3831-3864,并且3'lox2272对应于SEQ ID NO:86的序列9839-9872。所有序列均处于SGI1 CRISPR靶的上游和下游的2kb同源区内。

在共同转化SGI1 gRNA(SEQ ID NO:105)和选择盒(SEQ ID NO:86)时,将SGI1基因敲除,并且通过同源重组将包括Cas9、BleR和GFP基因的选择盒插入到SGI1位点中。BleR和GFP基因的侧翼是lox2272位点,而选择盒的Cas9和Cre基因位于lox2272位点之外,但位于充当同源重组位点的SGI1序列的部分内。

一旦将选择盒插入到SGI1基因座中,Cre基因可操作地连接到诱导型亚硝酸盐还原酶启动子。因此,当微生物在包括亚硝酸盐的生长培养基中生长时,Cre基因表达被诱导。在Cre基因表达时,Cre酶作用于lox2272位点并去除侧翼于lox位点内的BleR和GFP序列。这导致一种系统,在所述系统中,可以在其它序列的子序列转化期间重新引入可选择标记(例如,GFP、其它抗生素标记,例如,BleR)。

筛选用于Cas9插入的经过转化的拟小球藻属菌株

将转化的拟小球藻属细胞在含有铵的选择性平板上铺板到单一菌落,以抑制CRE表达,再次在选择性抑制平板上铺板修补菌落,并且使用PCR和GFP位移筛选敲除。用于确认敲除的PCR引物如下所示:

AE803:AGGCTACTCTCAGACATGACGGTGGCTCTG(SEQ ID NO:87)

ST815:GCCACAAATGAAGGTTGGCAGGGTCAGTGC(SEQ ID NO:88)

将PCR阳性反应送去测序,以确认敲除(盒的插入)和完美的HR。本申请的发明人惊讶地且出乎意料地发现,是Cas9基因的单拷贝被插入到SGI1基因座中。

实例8

使用完全渗透的拟小球藻属单拷贝CAS9编辑系三重敲除SGI1、SGI2和CPSRP54

产生如上所述的拟小球藻属SGI1敲除菌株STR24129,其具有插入到SGI1基因座中的Cas9和Cre的单拷贝,其中使用SGI1敲除引导序列:ACACCACCTTAAGGCACATGAGG(SEQ IDNO:89)将标记物(ble/GFP)侧接移除。

将SGI1敲除菌株STR24129用作敲除SGI2和SRP54基因的转化宿主。将宿主菌株STR24129与靶向SGI2和SRP54基因的gRNA和选择盒(pSGE06866)共转化,所述选择盒包括包含用于每个靶(例如,SRP54和SGI2)的同源重组(HR)臂的Ultramers。BleR基因可操作地连接到拟小球藻属RPS4启动子(SEQ ID:71),并且由拟小球藻属RPS4终止子(SEQ ID:72)终止。GFP基因可操作地连接到拟小球藻属ACP1启动子,并由拟小球藻属ACP1终止子终止。选择盒包括由lox位点围绕的ble和GFP标记,用于潜在的标记再循环。当表达Cre时,lox位点重组,在这些序列之间环出DNA。

将转化的宿主细胞铺在选择性平板上,挑取修补菌落和单一菌落,并使用PCR筛选敲除。将PCR阳性反应送去测序,以确认每个靶的敲除(盒的插入)。

图16A和16B示出了用于敲除拟小球藻属SRP54和拟小球藻属SGI2的选择盒的示意图。gRNA的序列、具有HR臂的Ultramer如下所示。

SRP54-EMRE3EUKT592650

gRNA序列:GGCGTGGGACATGGTGCGCAAGG(SEQ ID NO:90)

具有HR臂以扩增pSGE06866的Ultramer:

ST938_HR_SRP54-UP

TGAAGCACCCCCCGGCCTCTCCCCCCGCAGGGCCGCCCCTCCCGCCTCGTCGTGC(SEQ ID NO:91)

ST939_HR_SRP54-DOWN

CGCAACGCTCTCCCTCCCCACCCCCCAGCCTCACATCCGCCTCAAGCAGCGCCCTG(SEQ ID NO:92)

引物序列:

ST949_CasPipe9GT_SRP54-fwd:caagctatgcgaggaagggagggtc(SEQ ID NO:93)

ST950_CasPipe9GT_SRP54-rev:ctgccgcaagtgagtgtgctgtc(SEQ ID NO:94)

用于筛选的其它引物-位于选择盒中:

JV946-接头5-用于:caccagatataggtgacccgataac(SEQ ID NO:95)

AE608 ble rev:AAAACTCCACTGCACCTGCAACAT(SEQ ID NO:96)

SGI2-EMRE3EUKT590485

gRNA序列:

ST937_crRNA_064_EMRE3EUKT590485:TGCGGTGAAGCTTGGAGCTG(SEQ ID NO:97)

将HR臂置于PSGE06866上的Ultramer序列

ST940_HR_SGI2-UP

TTGCCGTCGACGAGACTTCGGGGCGCGCATTTATCGACTCTCTTGAAGATACACCGGTT(SEQ IDNO:98)

ST941_HR_SGI2-DOWN

TCCAATTGTAGATATCATATTGTTTCCGGACCTACCTTACGCACTGAGTGCTGCCAGATGTTCTT(SEQID NO:99)

引物序列:

ST046CasPipe9GT-064-fwd:gaggtgggtggtagtgcttcgcgaggtg(SEQ ID NO:100)

ST047CasPipe9GT-064-rev:atcacagctcacagggcagacactgcgtc(SEQ ID NO:101)

引物序列:

将引物JV946和AE608也用作筛选引物。

实例9

SGI2蛋白的结构域架构的生物信息学分析

使用在线工具InterProScan(工具版本5.27,数据库版本66.0,来自EMBL-EBI,辛克斯顿,剑桥郡,CB10 1SD,英国),分析来自拟小球藻属、卵囊藻属、四爿藻属、拟南芥的示例性SGI2蛋白的结构域架构。

如图3-9所示,在SGI2蛋白的N末端鉴定了单个保守的响应接收结构域。

实例10

各种SGI2蛋白的响应接收结构域的生物信息学分析

使用BLOSUM62矩阵、空位罚分10和延伸罚分0.5,将拟小球藻属响应接收结构域(SEQ ID NO:6)与来自其它藻类物种和各种植物的其它直向同源蛋白质对进行了局部比对。拟小球藻属响应接收结构域(SEQ ID NO:6)与各种光合生物的局部比对在下表5中示出。

表5:拟小球藻属响应接收结构域与各种直向同源蛋白质的局部比对结果。

拟小球藻属的响应接收结构域示出了与其它藻类物种更高百分比的同一性以及与各种植物物种的高度相似性。

实例11

低叶绿素拟小球藻属菌株WT-1185突变体的筛选

在敲除拟小球藻属基因的SGI1、SGI2、双敲除SGI1和cpSRP54或双敲除SGI2和cpSRP54之后,如上所述,选择来自浅色菌落的细胞并允许其在低光(100μmol光子m-2sec-1)下生长一到五天,之后使用BD FACSAria II流式细胞仪(BD生物科学公司(BDBiosciences),圣何塞,加利福尼亚州)通过流式细胞术对其进行分选,以选择具有低叶绿素荧光的细胞。通常,选择与总细胞群体相比具有最低大约0.5%到2%叶绿素荧光的细胞部分。在所分选的细胞铺板之后,通过目视选择浅绿色或黄色菌落,进一步初步筛选通过流式细胞术分离的天线减少的细胞系。为了从其它减少的色素突变体和假阳性中筛选推定的触角减少的细胞系,在光生理测量之前,对所选菌落进行中等通量的二次培养筛选,以使分离物适应低光条件。在低光适应期间监测叶绿素荧光,以选择保留了高光适应状态的减少的叶绿素荧光特性的克隆。当从高光转移到低光时,所选克隆仅显示叶绿素的少量增加(相对于野生型细胞)。

使用在75cm2组织培养烧瓶中的165ml培养物,在恒定的高光下(大约1,700μmol光子m-2sec-1)进行半连续培养物测定,以鉴定相对于野生型祖先株WT-1185,具有增加的生产力(增加的生物质产生速率,测量为总有机碳(TOC)积聚)的菌株。用给定突变体菌株的种子培养物接种两个75cm2烧瓶。烧瓶具有塞子,所述塞子具有与注射器过滤器连接的管子,以用于递送通过培养物鼓泡的富含CO2的空气(1%CO2)。将烧瓶的宽度(最窄的尺寸)与LED灯组进行比对。培养物的深度(从最靠近光源的烧瓶壁到烧瓶后部壁的距离)大约为8.0cm。在光周期开始时,通过去除65%的培养物体积并且将其用稀释的新鲜PM119培养基置换而每天对培养物进行稀释,以适应由于培养物(212ml di H2O到1L PM119培养基)中发生蒸发而导致的盐度增加。从去除以进行稀释的培养物中取出用于TOC分析的样品。

实例12

拟小球藻属突变体的半连续生产力测定

对发现在低光条件下具有减少的叶绿素的拟小球藻属菌株进行分析,以增加生产力。在生产力测定中,突变体的光能自养培养物在恒定光半连续模式(CL-SCPA)下生长若干天,其中每天去除培养物样品用于生物质确定。每天将光保持在恒定的1900-2000μmol光子m-2sec-1下持续24小时。在此测定中,用给定突变体菌株的种子培养物接种225cm2烧瓶中的PM119培养基。每个菌株启动三种培养物。烧瓶包含搅拌棒并具有塞子,所述塞子具有与注射器过滤器连接的管子,以用于递送通过培养物鼓泡的富含CO2的空气(1%CO2)。将烧瓶的宽度(最窄的尺寸)与LED灯组进行比对。从光源向后延伸的烧瓶的“深度”尺寸为13.7cm。考虑到烧瓶的位置,烧瓶中细胞距光源表面的最远距离大约为15.5cm。通过去除65%的培养物体积并且将其用稀释的新鲜PM119培养基置换而每天对培养物进行稀释,以适应由于培养物中发生蒸发而导致的盐度增加。从去除以进行稀释的培养物中取出用于TOC分析的样品。一旦培养物达到稳定状态,半连续生产力测定运行持续12天。

通过从每天去除的样品中测量总有机碳(TOC)来评估测定的生产力。总有机碳(TOC)通过用去离子水将2mL细胞培养物稀释到20mL总体积来确定。每次测量注射三次,注射到Shimadzu TOC-Vcsj分析仪中,以确定总碳(TC)和总无机碳(TIC)。将燃烧炉设定为720℃,并且通过从TC中减去TIC来确定TOC。4点校准范围为2ppm到200ppm,对应于非稀释培养物的20-2000ppm,其中相关系数r2>0.999。

已经描述了本发明的多个实施例。然而,应当理解的是,本文所描述的实施例的元素可以被组合以形成另外的实施例,并且在不脱离本发明的精神和范围的情况下可以进行各种修改。因此,其它实施例、替代方案和等效物处于本文所描述且要求保护的本发明的范围内。

实例13

拟小球藻属突变体的半连续尿素分批测定

在SCUBA(半连续尿素分批测定)中,突变体的光能自养培养物在充满氮的昼夜光半连续模式下生长若干天,然后在氮耗尽的分批模式下生长。光被编程为模仿加利福尼亚州的因皮里尔河谷(Imperial Valley)的平均5月4日从黑暗到中午的2000μmol光子m-2sec-1。每天在“黄昏”取样。在此测定中,用给定突变体菌株的种子培养物接种500ml方形烧瓶中的基于尿素的420ml的PM153培养基。

PM152是一种基于PM074的营养耗尽培养基,但包含尿素而不是硝酸盐作为氮源。其通过将1.3mlF/2藻类饲料A部分(水生生态系统公司(Aquatic Eco-Systems))和1.3ml‘溶液C’添加到最终体积为1升的速溶海盐溶液(17.5g/L)中而制成(水生生态系统公司,阿波普卡,佛罗里达州)。溶液C为38.75g/L NaH2PO4 H2O、758mg/L盐酸硫胺素、3.88mg/L维生素B12和3.84mg/L生物素。

每个菌株启动三种培养物。烧瓶包含搅拌棒并具有塞子,所述塞子具有与注射器过滤器连接的管子,以用于递送通过培养物鼓泡的富含CO2的空气(1%CO2)。将烧瓶与朝向0.0875m2光的孔对准,并且从光源向后延伸的烧瓶的“深度”尺寸为8cm。对于半连续生物质的测定,通过去除40%的培养物体积并且将其用稀释的新鲜PM153培养基置换而每天对培养物进行稀释,以适应由于培养物中发生蒸发而导致的盐度增加。从去除以进行稀释的培养物中取出用于TOC分析的样品。将半连续生产力测定运行,直到培养物达到稳定状态。半连续后,将培养物从测定中去除,使用离心法沉淀并重悬于420ml氮耗尽PM152培养基中。使用与半连续模式相同的生长条件,将培养物分批培养4-5天。在分批模式期间,采集FAME样品以确定脂质生产力,并且采集TOC样品以确定FAME/TOC。

实例14

SGI1、SGI2基因、SGI1和SRP54以及SGI2和SRP54基因的双敲除的拟小球藻属敲除突变体的叶绿素含量、触角大小和光生理学

通过用甲醇提取细胞并通过分光光度法分析上清液来确定高生产力突变体的叶绿素含量。简而言之,将500μl培养液等分试样吸移到2.0ml扭曲顶部试管中,并使用台式微量离心机以15,000rpm持续沉淀10分钟。从沉淀中吸出上清液,并将每个沉淀物重悬于1.5ml的99.8%的甲醇(先前用碳酸镁中和)中。将0.2ml玻璃珠(直径为0.1mm)添加到每个小瓶中,并敲打持续3分钟。将1.0ml上清液转移到新的1.7ml翻转式管子中,并且在台式微量离心机中以15,000rpm离心持续10分钟。所得沉淀物为白色,表明已经进行完全提取。将0.8ml的每种上清液吸移到光学玻璃比色皿中,并且立即在720nm、665nm和652nm的波长下读取吸收波长。使用99.8%甲醇坯料在双光束模式下进行分光光度测量。以下等式用于计算叶绿素浓度:叶绿素a[g m-3]=16.72(A665-A720)+9.16(A652-A720)以及叶绿素b[g m-3]=34.09(A652-A720)–15.28(A665-A720)。叶绿素a和b的量在每个细胞和每个TOC的基础上进行标准化。虽然在SGI1-2261突变体中的每个细胞的总叶绿素量有所不同,但相对于野生型细胞其普遍减少了约30%到约65%,这与所观察到的触角大小的减少一致。在每个TOC的基础上,相对于野生型细胞,SGI1突变体中总叶绿素的减少范围为约30%到约50%。

除叶绿素含量之外,还分析了SGI1和SGI2敲除突变体以及SGI1和SRP54以及SGI2和SRP54的双敲除的PSII功能性吸收横截面、PSI功能性吸收横截面、1/τ'Qa(在光饱和时,光系统II的受体侧上的电子传递的光饱和速率,线性光合电子传递的效率的测量)以及最大固碳率Pmax。在上述恒定的光半连续培养测定(CL-SCPA)中培养野生型和突变体菌株的细胞。

使用荧光诱导和弛豫(FIRe)技术(其被开发用于测量光合生物的一系列综合光合和生理特征)对各种光合参数进行分析(Gorbunov和Falkowski(2005),“用于监测水生生态系统中的光合过程和初级产生的荧光诱导和弛豫(FIRe)技术以及仪器(FluorescenceInduction and Relaxation(FIRe)Technique and Instrumentation for MonitoringPhotosynthetic Processes and Primary Production in Aquatic Ecosystems)”:《光合作用:全球视野的基本面(Photosynthesis:Fundamental Aspects to GlobalPerspectives)》,第13届国际光合作用大会,蒙特利尔,2004年8月29日到9月3日。(编辑:A.van der Est和D.Bruce),亚伦出版社(Allen Press),第2卷,第1029-1031页)。FIRe技术依赖于对叶绿素“可变荧光”图谱的测量和分析(由Falkowski等人,2004,“叶绿素可变荧光技术在海洋生态系统中的发展和应用(Development and Application of VariableChlorophyll Fluorescence Techniques in Marine Ecosystems)”所综述的:《叶绿素A荧光:光合作用的特征(Chlorophyll a Fluorescence:A Signature of Photosynthesis)》(C Papageorgiou和Govingjee编辑,施普林格出版社,第757-778页),这取决于叶绿素荧光与光合过程效率之间的关系。这项技术提供了一组参数,这些参数表征了光合的光收集过程、光系统II(PSII)中的光化学以及光合电子向下传递到碳固定。本文进行的测量使用了由新泽西州东布朗士维克罗格斯大学的Maxim Gorbunov公司生产的微型FIRe装置。可从海鸟科技公司(Sea-Bird Scientific)(哈利法克斯,加拿大,satlantic.com和planet-ocean.co.uk)商购获得FIRe装置。公司手册中提供了有关FIRe装置使用的进一步信息。所有测量均使用恒定光(2000μmol光子m-2sec-1)半连续培养物(CL-SCPA)进行的(参见实例3)。为了获得FV/FM和σPSII,在黑暗中进行了荧光诱导和弛豫(FIRe)动力学的测量。表6中所呈现的Fv/FM和σPSII的值计算为6次测量的平均值(2次生物复制中每一次复制的3次测量),这些参数的误差不超过5%。

PSI横截面的测量是使用改良的JTS-10光谱仪(所述光谱仪具有过滤器组)进行的,以用于在520nm处通过配备有定制的单翻转闪光灯(STF)来测量电致变色位移(ECS)。样品室中的峰值功率密度足够高,以确保在大约10μs内完全关闭反应中心。所得激发率为每10μs每个反应中心约1-3次命中(取决于光系统的功能性吸收横截面)。STF生成短的超亮蓝光脉冲(455nm,其中半带宽为30nm),并且脉冲定时由JTS-10光谱仪的触发器控制。脉冲持续时间由STF脉冲控制盒控制,并且可以使用前板上的电位计在1μs到50μs的范围内进行调整。为了测量PSI横截面,使用配备有积分球(integrating sphere)的Perkin ElmerLambda 650分光光度计,基于细胞悬浮液的吸收光谱的测量,在最大叶绿素(约440nm)处将培养物稀释到约0.2的OD。在DCMU和羟胺存在的情况下,使用强度范围为4000到120,000μmol光子m-2s-1的10μs闪烁来测量ECS。用简单的指数函数拟合实验曲线

其中ECSMECSM是最大的ECS信号;ItIt是光子密度,单位为光子/平方米;并且σPSIσPSI是PSI的功能性横截面。所获得的野生型拟小球藻属(WT-1185)的PSI功能性横截面的值为(4.0±0.5)×10-18(4.0±0.5)×10-18m2。这些值接近在相同条件下(σPSII=(4.3±0.1)×10-18σPSII=(4.3±0.1)×10-18m2)生长的PSII的功能性横截面所获得的值。这些参数的误差估计不超过20%。

固碳率(C14 Pmax)使用在含有0.5g l-1(5.95mM)碳酸氢钠的培养基中归一化为5μg chl ml-1的培养物来测量。将20.4μCi ml-1C14标记的碳酸氢钠添加到每种培养物中,并暴露于2500μE中持续10分钟。立即用2N HCl酸化样品,并且允许其排气过夜。第二天,使用Beckman LS6500闪烁计数器测量样品并进行量化。

τ'Qa(在饱和光条件下所测量的PSII受体侧上的电子传递时间—由线性光合电子传递的最慢步骤有效确定)是从FIRe光曲线和暗诱导的弛豫动力学(DIRK)图谱测量的。相对于野生型的体积PSII浓度估计为(Fv/σ530PSII)。这些参数的误差估计不超过15%。使用以下等式来估计光学吸收横截面(在光源发射光谱上的平均值):

其中[Chl/TOC]是样品的叶绿素/TOC,OD(λ)OD(λ)是在波长λλ下所测量的样品的光密度,ΔlΔl是比色皿(1cm)中所测量的光束路径长度,I(λ)I(λ)是用于在波长λλ下生长藻类的光源强度。

表6.用FIRe技术所测量的荧光和光合参数

概述并评估了野生型拟小球藻属菌株WT-1185的光生理数据、叶绿素含量和生产力数据,拟小球藻属中的SRP54和SGI2基因的单敲除以及SGI2和SRP54基因的双敲除。所有测量均使用CL-SCPA培养物进行。为了获得FV/FM和σPSII,在黑暗中进行了荧光诱导和弛豫(FIRe)动力学的测量。针对Fv/Fm和σPSII呈现的值计算为6次测量的平均值(2次生物复制中每一次复制的3次测量)—这些参数的误差不超过5%。τ'Qa(在饱和光条件下所测量的PSII受体侧上的电子传递时间—由线性光合电子传递的最慢步骤有效确定)是从FIRe光曲线和DIRK图谱测量的。PSI横截面的测量如上所述进行。结果在以下表7中概括。

表7.光生理学、叶绿素和生产力数据

PSII的功能性吸收横截面显著减少(50%),并且功能性PSII复合物的数量也有所减少。细胞具有提高的碳固定能力(Pmax增加26%)。与野生型菌株相比,SGI2或SRP54的单敲除示出了TOC生产力增加了至少17%。总体而言,双SGI2/SRP54敲除菌株示出了TOC生产力提高了32%(两次在CL-SCPA测定上运行双SGI2/SRP54敲除菌株,均显示生产力>40克/平方米/天),其中如图11所示,观察到针对拟小球藻属的生产力的增加最高,并且比单敲除SRP54或SGI2的平均提高更高。结果表明,当SGI2和SRP54基因两者被敲除时,似乎对生产力有协同效应。

评估了野生型拟小球藻属菌株WT-1185的光生理数据,SRP54和SGI1基因的单敲除以及具有拟小球藻属中的SGI1和SRP54基因的双敲除的三种菌株。所有测量均使用CL-SCPA培养物进行。为了获得FV/FM和σPSII,在黑暗中进行了荧光诱导和弛豫(FIRe)动力学的测量。针对FV/FM和σPSII呈现的值计算为6次测量的平均值(2次生物复制中每一次复制的3次测量)—这些参数的误差不超过5%。τ'Qa(在饱和光条件下所测量的PSII受体侧上的电子传递时间—由线性光合电子传递的最慢步骤有效确定)是从FIRe光曲线和DIRK图谱测量的。结果在表8中概括。

表8.拟小球藻属菌株的光生理学

与单个SGI1或SRP54基因敲除相比,SGI1/SRP54双敲除菌株PSII的功能性横截面显著减少,并且电子传递的光饱和率降低,这表明光合作用的速率提高。功能性PSII复合物的数量也有所增加。与单个敲除SRP54或SGI1相比,双敲除菌株中光系统II中的光化学最大量子产率(FV/FM)有所提高。

实例15

SGI1/SGI2、SGI1/SRP54和SGI1/SGI2/SRP54敲除突变体的微近似分析

为了确定SGI1/SGI2、SGI1/SRP54和SGI1/SGI2/SRP54敲除突变体的总体生物质组分,对来自以每天40%稀释的在半连续模式下生长的培养物的样品进行量化分析,以确定总有机碳(TOC)和半连续培养物中细胞的脂质含量。在培养物达到稳定状态之后,所去除的每天稀释的培养物的等分试样用于分析脂质、蛋白质和碳水化合物。藻类培养物样品的总有机碳(TOC)通过用去离子水将2mL细胞培养物稀释到20mL总体积来确定。每次测量注射三次,注射到Shimadzu TOC-Vcsj分析仪中,以确定总碳(TC)和总无机碳(TIC)。将燃烧炉设定为720℃,并且通过从TC中减去TIC来确定TOC。4点校准范围为2ppm到200ppm,对应于非稀释培养物的20-2000ppm,其中相关系数r2>0.999。

为了确定脂质含量,对使用GeneVac HT-4X干燥的2mL样品进行了FAME分析。向干燥的沉淀物中添加以下:500μL 500mM KOH甲醇溶液、200μL的含有0.05%丁基化羟基甲苯的四氢呋喃、40μL 2mg/ml C11:0游离脂肪酸/C13:0甘油三酯/C23:0脂肪酸甲酯内标混合物和500μL玻璃珠(直径425–600μm)。将小瓶用顶部开口的PTFE隔垫衬里盖加盖,并且在SPEX GenoGrinder中以1.65krpm放置持续7.5分钟。然后将样品在80℃下持续加热五分钟并使其冷却。为了衍生化,在80℃下加热持续30分钟之前,向样品中添加500μL 10%三氟化硼甲醇溶液。在添加2mL庚烷和500μL 5M NaCl之前,将试管冷却。然后将样品以2krpm涡旋持续五分钟,并且最后以1krpm离心持续三分钟。使用Gerstel MPS自动进样器,对庚烷层进行采样。定量使用80μg C23:0FAME内标。

图12A和12B分别示出了表明拟小球藻属野生型菌株(STR00010)、SRP54敲除突变体(STR00625)、SGI1敲除突变体(STR24183)、SGI1/SRP54双敲除突变体(STR24538和STR24540)的半连续区域TOC生产力和分批TOC的测定结果。SRP54敲除突变体、SGI1敲除突变体、SGI1/SRP54双敲除突变体示出了相对于拟小球藻属野生型菌株增加的TOC生产力。

图13A和13B分别示出了表明拟小球藻属野生型菌株(STR00010)、SRP54敲除突变体(STR00625)、SGI1敲除突变体(STR00012)、SGI2/SRP54双敲除突变体(STR00516)以及SGI1/SGI2/SRP54三重敲除突变体(STR25761和STR25762)的半连续区域TOC生产力和分批TOC的测定结果。SGI1敲除突变体、SGI2/SRP54双敲除突变体以及SGI1/SGI2/SRP54三重敲除突变体示出了相对于拟小球藻属野生型菌株增加的TOC生产力。

图14示出了拟小球藻属野生型菌株(STR00010)、SRP54敲除突变体(STR00625)、SGI1敲除突变体(STR24183)、SGI1/SRP54双敲除突变体(STR24538和STR24540)的分批FAME生产力测定的结果。SGI1敲除突变体和SGI/SRP54敲除突变体示出了相对于拟小球藻属野生型菌株增加的FAME生产力。

图15示出了拟小球藻属野生型菌株(STR00010)、SGI1敲除突变体(STR00012)、SGI2/SRP54双敲除突变体(STR00516)以及SGI1/SGI2/SRP54三重敲除突变体(STR25761和STR25762)的分批FAME生产力测定的结果。

本申请中的标题仅仅是为了方便读者,而不以任何方式限制本发明或其实施例的范围。

本说明书中所提及的所有公开和专利申请通过引用并入本文,其程度如同每个单独的公开或专利申请被专门地且单独地指示通过引用并入。

序列表

<110> 合成基因组学公司

I.阿加维

F.I.库兹米诺夫

R.R.拉达科维茨

J.H.维卢托

S.波茨

R.斯普雷菲科

W.F.兰伯特

J.N.格雷纳

<120> 用于改善生长的光合生物基因调节

<130> SGI2140-2WO

<150> US 62/690,205

<151> 2018-06-26

<150> US 62/612,251

<151> 2017-12-29

<160> 105

<170> PatentIn版本3.5

<210> 1

<211> 4531

<212> DNA

<213> 拟小球藻属

<400> 1

atgtctggtt cagctggatc gggccaggct actctcagac atgacggtgg ctctgctggc 60

ggcagtgggc ctgtctcaga cggtttttca ccggccggcc tgaaggtaaa gtagaaagac 120

actcatacac atcttggttc ggcgttgaaa gtaggtcatt aacatactct ataaccaata 180

tttgtaggtt ctggtcgtgg acgacgacct catgtgcctt aaggtggtgt cagccatgtt 240

gaagaggtgc agctatcaag gtgaggtctt tactggtgtc tgttattgct gtaacatcat 300

ttcgctgttg cacaatttaa acatttgtaa tttactgttg ttattgcagt ggccacttgt 360

agcagtggca gcgaggcact gacacttcta cgtgaacgca acgaggacgg atcctccgac 420

cagttcgacc tcgtactgtc agatgtttac atgccgggta tgtcgtattc ctttgtaaac 480

tttacaatat gcgtctagtt tgacgcgtac actttgtaca ctttgcaaaa acgcaccctg 540

cgaggtctgc catttggtca ctacaacttg gccaccttgg ttgcaagttt gcaagttcgc 600

tctacgtcaa cgctgcaaaa tgaaccaatt gttttgcact gaccctgcca accttcattt 660

gtggctgcag acatggacgg tttcaagctg cttgaacaca tcggtctaga gttggagctt 720

cccgttatca gtaagttgat cgagccgagt ccagagcgaa gcctgcttct atactattag 780

cagctgtctt ttgatatttg acagcttgac ttgatatggt cacagagcat acttgcaacc 840

aggttacctg ttgaactagc aactgtgccc aagcatctct tcaagcacct ccgtcagtcc 900

atagggtact gttgatttgt actctgcaat actgcactgt aatgcgctgt gaatcactgc 960

ccttcacctc tagatggtgc ttccctggag ccctccccca cctccgcctc aagcccctca 1020

catgcctctc ccccccctgc agtgatgtca tccaacgggg acacgaatgt cgtgctgcgg 1080

ggggtcaccc acggggctgt ggactttctg atcaagcccg ttcgaattga ggagctgcgg 1140

aacgtgtggc agcacgtggt gcgtcgtcgt tccatggcgc tggccaggac gccagacgag 1200

gggggacact cggacgagga ctctcaggtg cccttggcag cttctgggcg gcttgctgtg 1260

tcggatgcca cttggactgg ggatgcacga ggggtggggg gacaatggga gatgggccat 1320

agtaggccag agttgatggc agtggtggtg ggggggagta ggcgggagag aagcagccat 1380

cctggtgttg gttttgatga ttgagtgcat ggggatgatg cacaggtgag ctgactggat 1440

gccttgtctt gctgtgctgc gctgcagcgg cacagtgtga aacgcaagga gtcggagcag 1500

agcccgctgc agctcagcac agagcagggc gggaacaaga agccaagagt ggtgtggtcg 1560

gtggagatgc accaacaggt gtgcttgcgg gcgggtgtat acgggggagg ggggccagct 1620

gctggctgac ctggcgtgcg cggtgcattg cacttggcga tgaggggcgt gcttcagtat 1680

gtagctggga cgcaattggt tgtgctgtgt gaccagtgca caaaatacat ccctgaattc 1740

cagtgggttg aacagagttg tcctggaggt gggaagcaaa cgcgcacgtg gtagagggga 1800

gcagggtgca gaacagccgc agcaggggtg ttgcgcagtg tgcaggtatc ctgcctccat 1860

gccccgggcc atgggcatac tacgctggta ccgtcaggat gggcgttgag cctggcttgg 1920

ggggcagggg gcgagcgaat gcggaatggg agcggcaggt gctgggaggg tggctgactg 1980

gcttgcagga gcgcaagtcc tgtcgggggc gtcgtcctgt tccctcctgc ccgcttcacc 2040

cacgttcact ctcatgcctc cacactcctg ctgctgacac acctgtcgcc acctccgctg 2100

cagtttgtga acgcggtcaa ctccctgggc attgacaagg cggtgcccaa gcggattctg 2160

gacctgatga acgtggaggg gctgacgcgc gagaacgtgg ccagccatct gcaggtgcct 2220

gccatgaccc ctcccaccag ggacctggtg ttttgacacc ctggaactcc tctttgacgg 2280

agcctccagt tcaattccag caatcgaatt gaatcaaaaa gcatgtgcac ccacgtgctg 2340

tttgaatgtc ccatgtggta ggaaacacaa ctgccccctt gccatttgct ggagggtgcc 2400

cgctgcgcca tgcccgagtg cgctgtgctc agcgttgtgc tgcgcccccc gctgactgaa 2460

gctgacagcg tgcggctgag gagggtactg ggggaggggg ggtgggaggc ggccgctggc 2520

ggcggaaggg agggtgtgca cgcatggaca cagggccttt ccgccctgca cggcctctac 2580

tgcaccctgc cacgtgatgt atcgacatgg tgggccatgc tgtgctgtgc cgctgcagaa 2640

gtaccgcctg tacctgaagc gggtggaggg agtgcaatcg ggtgcggcag cctccaagca 2700

gcaccagcac ccgcagtatc accagcagca gcagcagcag caagcgcaac ctcgtgcagc 2760

tgtctcccct gcagcagctt cctttggtgc cctttccttg ggagccccgc agcaggcgca 2820

gcagggcatg ccgcagctgg ggatgcctgt gcaggtgaag actgcccccc cccccctccc 2880

cctttccatc ttccctccat cagcctgctg ttccttaccc ttgtcaaccc gtctctcctt 2940

tttcgcaagc agcgcaccac cccccatgca cgccttgcct ggcactgttg tcagctgccc 3000

ccctagaaat acacaaggtg tgggtgcaac tggtgggacc ccctcccccc cccccctggg 3060

gctgcagggt ctccctccaa acttggcagc catgggatcc cagccgccgc acatcccctt 3120

ccagcaggcc ctggccatgc aggcggcggc tgcggcggct gcagccagcg gcgcgctccc 3180

cgggagtctg cccccctaca tgccaccccc ggggatgatg ccccccggca tgccgggggg 3240

ggtccccggt atgggagggg tggtggggca tcctcaggta cgggcagcac atgagtgggc 3300

aggggtattg gagaggggaa gggcagggag gttgcatgtg aggggctgca tggcaaagag 3360

gctgcagcgc aggtgttgct tgcagcactt cccctcggtg gcgcttgcat caaattttga 3420

atcctccccc gatgggcacg cccgtgtgtg ggggggggtg ggatggggga tgggggtggt 3480

tttgtggcat gtcgggcgct ttcatctacc cgggcccctg cccctgcctg tacgcgtgcg 3540

catgtgtgca gatgcccgcc ccagggatgg actttgcggg tttcaacggg tatggcaacg 3600

ctgcgggggg gctgatgttt ggcgggcagc agcaggcgca gcacgcgcag cagcacgcgt 3660

cagcgcaagc gggctcgctg gcgcagcagc aggcgcagca agtatccatg ggcttgggcc 3720

ttatgccccc cccgttgggg ttcccgccca cctcgctcgc cgcgccagcc ccgcgctccg 3780

cagcaactga gcccgccgca gccccactcc ccctgacgtc ctcgccgcca gctgcttcag 3840

caggcggcag cggcggccca gcagcagctg ctccgcagca cagcagcggc gccgcagcag 3900

cccaagcccc ccatcaccac ccacagtgct cggagcaggg agcggggggg ctcccgcccc 3960

cgctgcccgc gtccagcgcc ccgcagtcct atcccctccc tcccccctcc tcgcaggccg 4020

ctttgcatga cccggacgaa cactaccccc caggctcggc agaggtgagc acgtcccccc 4080

gccccctccc cccccccccc cccccttccc ttcaccctgg cttggcgtgc aatgaaaccc 4140

taaataaccc taaaacctca ttatcagttg caaattggac ccgtgaagcg ggcgggggca 4200

actgcgctct gctggtgtca gcgctgtctc tgccggttcc tgcccagcgt gcgcctgcat 4260

gcaagggggg atgggggggg ggaggcattt aacaataggc cagtcatctc caatccaccg 4320

tcaatttcag ccccctcccc ccccctccct catccccttg cagatgcacc accagcacct 4380

cccagggctg tgtggcttta acccggacga cctgctgggg gggcagctgg gggacatggg 4440

gttcctgggg gagctggggg gggcggtggg aggaaagcac gaacaggacg acttcctgga 4500

cctgctgctg aagggggagg aggagctgtg a 4531

<210> 2

<211> 1860

<212> DNA

<213> 拟小球藻属

<400> 2

atgtctggtt cagctggatc gggccaggct actctcagac atgacggtgg ctctgctggc 60

ggcagtgggc ctgtctcaga cggtttttca ccggccggcc tgaaggttct ggtcgtggac 120

gacgacctca tgtgccttaa ggtggtgtca gccatgttga agaggtgcag ctatcaagtg 180

gccacttgta gcagtggcag cgaggcactg acacttctac gtgaacgcaa cgaggacgga 240

tcctccgacc agttcgacct cgtactgtca gatgtttaca tgccggacat ggacggtttc 300

aagctgcttg aacacatcgg tctagagttg gagcttcccg ttatcatgat gtcatccaac 360

ggggacacga atgtcgtgct gcggggggtc acccacgggg ctgtggactt tctgatcaag 420

cccgttcgaa ttgaggagct gcggaacgtg tggcagcacg tggtgcgtcg tcgttccatg 480

gcgctggcca ggacgccaga cgagggggga cactcggacg aggactctca gcggcacagt 540

gtgaaacgca aggagtcgga gcagagcccg ctgcagctca gcacagagca gggcgggaac 600

aagaagccaa gagtggtgtg gtcggtggag atgcaccaac agtttgtgaa cgcggtcaac 660

tccctgggca ttgacaaggc ggtgcccaag cggattctgg acctgatgaa cgtggagggg 720

ctgacgcgcg agaacgtggc cagccatctg cagaagtacc gcctgtacct gaagcgggtg 780

gagggagtgc aatcgggtgc ggcagcctcc aagcagcacc agcacccgca gtatcaccag 840

cagcagcagc agcagcaagc gcaacctcgt gcagctgtct cccctgcagc agcttccttt 900

ggtgcccttt ccttgggagc cccgcagcag gcgcagcagg gcatgccgca gctggggatg 960

cctgtgcagg gtctccctcc aaacttggca gccatgggat cccagccgcc gcacatcccc 1020

ttccagcagg ccctggccat gcaggcggcg gctgcggcgg ctgcagccag cggcgcgctc 1080

cccgggagtc tgccccccta catgccaccc ccggggatga tgccccccgg catgccgggg 1140

ggggtccccg gtatgggagg ggtggtgggg catcctcaga tgcccgcccc agggatggac 1200

tttgcgggtt tcaacgggta tggcaacgct gcgggggggc tgatgtttgg cgggcagcag 1260

caggcgcagc acgcgcagca gcacgcgtca gcgcaagcgg gctcgctggc gcagcagcag 1320

gcgcagcaag tatccatggg cttgggcctt atgccccccc cgttggggtt cccgcccacc 1380

tcgctcgccg cgccagcccc gcgctccgca gcaactgagc ccgccgcagc cccactcccc 1440

ctgacgtcct cgccgccagc tgcttcagca ggcggcagcg gcggcccagc agcagctgct 1500

ccgcagcaca gcagcggcgc cgcagcagcc caagcccccc atcaccaccc acagtgctcg 1560

gagcagggag cgggggggct cccgcccccg ctgcccgcgt ccagcgcccc gcagtcctat 1620

cccctccctc ccccctcctc gcaggccgct ttgcatgacc cggacgaaca ctacccccca 1680

ggctcggcag agatgcacca ccagcacctc ccagggctgt gtggctttaa cccggacgac 1740

ctgctggggg ggcagctggg ggacatgggg ttcctggggg agctgggggg ggcggtggga 1800

ggaaagcacg aacaggacga cttcctggac ctgctgctga agggggagga ggagctgtga 1860

<210> 3

<211> 619

<212> PRT

<213> 拟小球藻属

<400> 3

Met Ser Gly Ser Ala Gly Ser Gly Gln Ala Thr Leu Arg His Asp Gly

1 5 10 15

Gly Ser Ala Gly Gly Ser Gly Pro Val Ser Asp Gly Phe Ser Pro Ala

20 25 30

Gly Leu Lys Val Leu Val Val Asp Asp Asp Leu Met Cys Leu Lys Val

35 40 45

Val Ser Ala Met Leu Lys Arg Cys Ser Tyr Gln Val Ala Thr Cys Ser

50 55 60

Ser Gly Ser Glu Ala Leu Thr Leu Leu Arg Glu Arg Asn Glu Asp Gly

65 70 75 80

Ser Ser Asp Gln Phe Asp Leu Val Leu Ser Asp Val Tyr Met Pro Asp

85 90 95

Met Asp Gly Phe Lys Leu Leu Glu His Ile Gly Leu Glu Leu Glu Leu

100 105 110

Pro Val Ile Met Met Ser Ser Asn Gly Asp Thr Asn Val Val Leu Arg

115 120 125

Gly Val Thr His Gly Ala Val Asp Phe Leu Ile Lys Pro Val Arg Ile

130 135 140

Glu Glu Leu Arg Asn Val Trp Gln His Val Val Arg Arg Arg Ser Met

145 150 155 160

Ala Leu Ala Arg Thr Pro Asp Glu Gly Gly His Ser Asp Glu Asp Ser

165 170 175

Gln Arg His Ser Val Lys Arg Lys Glu Ser Glu Gln Ser Pro Leu Gln

180 185 190

Leu Ser Thr Glu Gln Gly Gly Asn Lys Lys Pro Arg Val Val Trp Ser

195 200 205

Val Glu Met His Gln Gln Phe Val Asn Ala Val Asn Ser Leu Gly Ile

210 215 220

Asp Lys Ala Val Pro Lys Arg Ile Leu Asp Leu Met Asn Val Glu Gly

225 230 235 240

Leu Thr Arg Glu Asn Val Ala Ser His Leu Gln Lys Tyr Arg Leu Tyr

245 250 255

Leu Lys Arg Val Glu Gly Val Gln Ser Gly Ala Ala Ala Ser Lys Gln

260 265 270

His Gln His Pro Gln Tyr His Gln Gln Gln Gln Gln Gln Gln Ala Gln

275 280 285

Pro Arg Ala Ala Val Ser Pro Ala Ala Ala Ser Phe Gly Ala Leu Ser

290 295 300

Leu Gly Ala Pro Gln Gln Ala Gln Gln Gly Met Pro Gln Leu Gly Met

305 310 315 320

Pro Val Gln Gly Leu Pro Pro Asn Leu Ala Ala Met Gly Ser Gln Pro

325 330 335

Pro His Ile Pro Phe Gln Gln Ala Leu Ala Met Gln Ala Ala Ala Ala

340 345 350

Ala Ala Ala Ala Ser Gly Ala Leu Pro Gly Ser Leu Pro Pro Tyr Met

355 360 365

Pro Pro Pro Gly Met Met Pro Pro Gly Met Pro Gly Gly Val Pro Gly

370 375 380

Met Gly Gly Val Val Gly His Pro Gln Met Pro Ala Pro Gly Met Asp

385 390 395 400

Phe Ala Gly Phe Asn Gly Tyr Gly Asn Ala Ala Gly Gly Leu Met Phe

405 410 415

Gly Gly Gln Gln Gln Ala Gln His Ala Gln Gln His Ala Ser Ala Gln

420 425 430

Ala Gly Ser Leu Ala Gln Gln Gln Ala Gln Gln Val Ser Met Gly Leu

435 440 445

Gly Leu Met Pro Pro Pro Leu Gly Phe Pro Pro Thr Ser Leu Ala Ala

450 455 460

Pro Ala Pro Arg Ser Ala Ala Thr Glu Pro Ala Ala Ala Pro Leu Pro

465 470 475 480

Leu Thr Ser Ser Pro Pro Ala Ala Ser Ala Gly Gly Ser Gly Gly Pro

485 490 495

Ala Ala Ala Ala Pro Gln His Ser Ser Gly Ala Ala Ala Ala Gln Ala

500 505 510

Pro His His His Pro Gln Cys Ser Glu Gln Gly Ala Gly Gly Leu Pro

515 520 525

Pro Pro Leu Pro Ala Ser Ser Ala Pro Gln Ser Tyr Pro Leu Pro Pro

530 535 540

Pro Ser Ser Gln Ala Ala Leu His Asp Pro Asp Glu His Tyr Pro Pro

545 550 555 560

Gly Ser Ala Glu Met His His Gln His Leu Pro Gly Leu Cys Gly Phe

565 570 575

Asn Pro Asp Asp Leu Leu Gly Gly Gln Leu Gly Asp Met Gly Phe Leu

580 585 590

Gly Glu Leu Gly Gly Ala Val Gly Gly Lys His Glu Gln Asp Asp Phe

595 600 605

Leu Asp Leu Leu Leu Lys Gly Glu Glu Glu Leu

610 615

<210> 4

<211> 2802

<212> DNA

<213> 拟小球藻属

<400> 4

atggctgccc ccccagtatc tatctcttcc aattttccaa aggttagtat ttacgtaaca 60

tttgccgaca gttgggcaat aacgctgagt tggagtgttg ccaacaagct tttgtgccgt 120

ttccagggtt tgcgggttct cttggtcgat caacagccaa gtaggagcca tattgaagcg 180

cagctgatgc agccggatct taattacaca ggtttgctgc agttttgcac attccaagct 240

tggcctttct ccgtgccaaa cccagcgcgc tgagctcttg ttgtttgttg cagttactgg 300

ttgcgagagc gtttctgaag ctctttcata ttgccgctcg ggagtaagca gctttgacgt 360

ggtgcttgcg gaggtgggtg gtagtgcttc gcgaggtgca cagtgcgcac cgggaaaaac 420

ttgaaaagta tttgtaaaat taattttgaa acttctgtat tattttacac ctcttaacaa 480

tgcacccaat gtttgttatg agcgccacgt taccggacgt ttgttgcagg caaggatcgt 540

tgccgtcgac gagacttcgg ggcgcgcatt tatcgactct cttgaagata caccggttat 600

tcttatgtcg gagggaagca cgacgggcga cgttcttcgt gcggtgaagc ttggagctgt 660

ggactggctg gataagcctc tctccgtcct gaagctcaag aacatctggc agcactcagt 720

gcgtaaggta ggtccggaaa caatatgata tctacaattg gaaagttacc agtgtcaact 780

atggaaaacg ttgtactggg tgctagtttc agttgagcca gttgcctgta tatgcatata 840

aggggcagtg acgcagtgtc tgccctgtga gctgtgatgc atcagggtgc acctgaagct 900

ggcagtggat cactccaccc aagatgttgt tgcaatccaa tgtgttgctg atgccttgct 960

tttctgactt gcaaacatgg tgtgggataa aagcgttgct agacagccac cgtgctccac 1020

gttgtcttct gcatgcaaaa ctgcagatga tgcagcgcac cacgttttac gacacttgct 1080

ccgagcagcc aacccagccg gcgcgcagca agctttcttc aggaatcgaa tcgccgagca 1140

cacccacgct gggagactct gtggacttgg acgccatctc ggcggcttcc ttcggcagca 1200

tcaaggactt gaccgatttt tcattttcca gcggagctga ggtgggcatc gctggttgtc 1260

cagcactgca gcatttccca ccagcttggt tggttgcctg tgttttagtg cagagcagag 1320

gccgaggcta ctggttcaac cagcctagtt actcaaaaca attttggcaa cctgctgact 1380

tctctcttaa cctgcagagc gtttcacagc atgtacactt ttcagtgggt ttcgtaattt 1440

cgtagcgcac ccgctggctt ttttctgcag gtcctgagag cctcctttga cagctgtgac 1500

ggctccgagg tcaacctagg cagcgctttg ggccagcctc gcccccctct ggcagtcaag 1560

cccagctcct ttggccccct ggtgagtggc atagctcagc aggagaccca caagtggctg 1620

gaacccacca tgttggcgcg caccctgccc tcgcacgcgc ggctgccgtc tgcgcagcgc 1680

gccgcggtgc gccgctgtgg agttgtggtg ttgcggagtc actcgcgggc cagtgcttca 1740

cagcccattc tcgccgcaca caccctgccc gcacaaatgg ctgccacccc cctaaagcgg 1800

tcctcaggcc acccggatgt gctcaggatg gatccgccga atctcgcacc cctcctccct 1860

caatcccggt tgttcagacg gtttggaacc cctccggctc tctacccctc tgcaggtacc 1920

cgtccctccc acctcccagt ggccccagct gcaggctggc tgcgtgtggg gcactcccgt 1980

gggcggcccg ctggcgcccc cctccatgac caacgcccag catggtgccc cccacagcgt 2040

gcccctggca gacgcacact tggccggcag cgccagttac atgtccctct cctctgtgag 2100

tctcctcccc tccaccccta catcttccaa tcgaacatgc gacgcacgca cacccatagt 2160

ccctaaacaa gtgctttggt gttttttcac ttgcaaaccc caaccctgac acctgaagcg 2220

tgacacaggc gactgcgctg ctccccgccc ccacacgccc ttggttgttt gtgccctgca 2280

cttctgccac gacatgcatg tcatgtcttt tcacgcctgc gatgtcgctg cttaaacttg 2340

aaactcattg tggccggggt gcagctcatg gaggaggaca ccccctgtcc cttggacatg 2400

gatgcaccag aggacgggat gcagcttcct gttgacttcc tgtctgttgc caacgtcagc 2460

agcaatggta ggtccagcac cagacgcctc tgtctgctat gagacgcacc tccagccgcc 2520

ccctctggac agacagcgcg ctgcacgctc tgcgcgctgg accttgccgc acacgcgcgc 2580

gacaaggcct ggtgtgatgc ttggatgtgg aaggttccag catggttgga caagatggta 2640

tcctggcaca catattggta tgcagcatac acccaggctg cccccttacc ctcgcacgcc 2700

ctacccctta ctgcaggcag cggtcccatt gggttgaagc tgaagaaaag caacagcctg 2760

ctgaacatga tcaacgcagc gctgatgtct ggtggtcagt ga 2802

<210> 5

<211> 359

<212> PRT

<213> 拟小球藻属

<400> 5

Met Ala Ala Pro Pro Val Ser Ile Ser Ser Asn Phe Pro Lys Gly Leu

1 5 10 15

Arg Val Leu Leu Val Asp Gln Gln Pro Ser Arg Ser His Ile Glu Ala

20 25 30

Gln Leu Met Gln Pro Asp Leu Asn Tyr Thr Val Thr Gly Cys Glu Ser

35 40 45

Val Ser Glu Ala Leu Ser Tyr Cys Arg Ser Gly Val Ser Ser Phe Asp

50 55 60

Val Val Leu Ala Glu Ala Arg Ile Val Ala Val Asp Glu Thr Ser Gly

65 70 75 80

Arg Ala Phe Ile Asp Ser Leu Glu Asp Thr Pro Val Ile Leu Met Ser

85 90 95

Glu Gly Ser Thr Thr Gly Asp Val Leu Arg Ala Val Lys Leu Gly Ala

100 105 110

Val Asp Trp Leu Asp Lys Pro Leu Ser Val Leu Lys Leu Lys Asn Ile

115 120 125

Trp Gln His Ser Val Arg Lys Met Met Gln Arg Thr Thr Phe Tyr Asp

130 135 140

Thr Cys Ser Glu Gln Pro Thr Gln Pro Ala Arg Ser Lys Leu Ser Ser

145 150 155 160

Gly Ile Glu Ser Pro Ser Thr Pro Thr Leu Gly Asp Ser Val Asp Leu

165 170 175

Asp Ala Ile Ser Ala Ala Ser Phe Gly Ser Ile Lys Asp Leu Thr Asp

180 185 190

Phe Ser Phe Ser Ser Gly Ala Glu Val Leu Arg Ala Ser Phe Asp Ser

195 200 205

Cys Asp Gly Ser Glu Val Asn Leu Gly Ser Ala Leu Gly Gln Pro Arg

210 215 220

Pro Pro Leu Ala Val Lys Pro Ser Ser Phe Gly Pro Leu Val Pro Val

225 230 235 240

Pro Pro Thr Ser Gln Trp Pro Gln Leu Gln Ala Gly Cys Val Trp Gly

245 250 255

Thr Pro Val Gly Gly Pro Leu Ala Pro Pro Ser Met Thr Asn Ala Gln

260 265 270

His Gly Ala Pro His Ser Val Pro Leu Ala Asp Ala His Leu Ala Gly

275 280 285

Ser Ala Ser Tyr Met Ser Leu Ser Ser Leu Met Glu Glu Asp Thr Pro

290 295 300

Cys Pro Leu Asp Met Asp Ala Pro Glu Asp Gly Met Gln Leu Pro Val

305 310 315 320

Asp Phe Leu Ser Val Ala Asn Val Ser Ser Asn Gly Ser Gly Pro Ile

325 330 335

Gly Leu Lys Leu Lys Lys Ser Asn Ser Leu Leu Asn Met Ile Asn Ala

340 345 350

Ala Leu Met Ser Gly Gly Gln

355

<210> 6

<211> 119

<212> PRT

<213> 拟小球藻属

<400> 6

Gly Leu Arg Val Leu Leu Val Asp Gln Gln Pro Ser Arg Ser His Ile

1 5 10 15

Glu Ala Gln Leu Met Gln Asp Leu Asn Tyr Thr Val Thr Gly Cys Glu

20 25 30

Ser Val Ser Glu Ala Leu Ser Tyr Cys Arg Ser Gly Val Ser Ser Phe

35 40 45

Asp Val Val Leu Ala Glu Ala Arg Ile Val Ala Val Asp Glu Thr Ser

50 55 60

Gly Arg Ala Phe Ile Asp Ser Leu Glu Asp Thr Pro Val Ile Leu Met

65 70 75 80

Ser Glu Gly Ser Thr Thr Gly Asp Val Leu Arg Ala Val Lys Leu Gly

85 90 95

Ala Val Asp Trp Leu Asp Lys Pro Leu Ser Val Leu Lys Leu Lys Asn

100 105 110

Ile Trp Gln His Ser Val Arg

115

<210> 7

<211> 1080

<212> DNA

<213> 拟小球藻属

<400> 7

atggctgccc ccccagtatc tatctcttcc aattttccaa agggtttgcg ggttctcttg 60

gtcgatcaac agccaagtag gagccatatt gaagcgcagc tgatgcagcc ggatcttaat 120

tacacagtta ctggttgcga gagcgtttct gaagctcttt catattgccg ctcgggagta 180

agcagctttg acgtggtgct tgcggaggca aggatcgttg ccgtcgacga gacttcgggg 240

cgcgcattta tcgactctct tgaagataca ccggttattc ttatgtcgga gggaagcacg 300

acgggcgacg ttcttcgtgc ggtgaagctt ggagctgtgg actggctgga taagcctctc 360

tccgtcctga agctcaagaa catctggcag cactcagtgc gtaagatgat gcagcgcacc 420

acgttttacg acacttgctc cgagcagcca acccagccgg cgcgcagcaa gctttcttca 480

ggaatcgaat cgccgagcac acccacgctg ggagactctg tggacttgga cgccatctcg 540

gcggcttcct tcggcagcat caaggacttg accgattttt cattttccag cggagctgag 600

gtcctgagag cctcctttga cagctgtgac ggctccgagg tcaacctagg cagcgctttg 660

ggccagcctc gcccccctct ggcagtcaag cccagctcct ttggccccct ggtacccgtc 720

cctcccacct cccagtggcc ccagctgcag gctggctgcg tgtggggcac tcccgtgggc 780

ggcccgctgg cgcccccctc catgaccaac gcccagcatg gtgcccccca cagcgtgccc 840

ctggcagacg cacacttggc cggcagcgcc agttacatgt ccctctcctc tctcatggag 900

gaggacaccc cctgtccctt ggacatggat gcaccagagg acgggatgca gcttcctgtt 960

gacttcctgt ctgttgccaa cgtcagcagc aatggcagcg gtcccattgg gttgaagctg 1020

aagaaaagca acagcctgct gaacatgatc aacgcagcgc tgatgtctgg tggtcagtga 1080

<210> 8

<211> 1707

<212> DNA

<213> 拟小球藻属

<400> 8

atgcttcggc agcagctgtt gcacagcggc aggcagccgg gtgcgacatg cagcttacta 60

acctgctcga catggcgacc gtctgccttg ttcggccgtc ctaagcccca aaaactgcac 120

agccagcgct tgcagcatca gggccgcccc tcccgcctcg tcgtgcgcag cgcaatgttc 180

gacaacctga gccgcagcct ggagagggcg tgggacatgg tgcgcaagga cgggcggcta 240

acggcggaca acatcaagga gcccatgcgg gagattcgca gggcgctgct tgaggcggat 300

gtgaggctgg gggcgccgct gatcagattc ttggtatcta cccccccccc ctcccaggtc 360

tccctccccg tggtgcgcaa gtttgtgaag gcggtggagg agaaggcgct gggttctgca 420

gtgaccaagg gtgtcacccc cgaccagcag ctggtgaagg tggtgtacga ccagctgcgg 480

gagctgatgg gggggcagca ggaagggctg gtgcccactt cgccagagga gccgcaggtg 540

atcttgatgg cggggctgca gggcacgggg aagacgacag ctgcggggaa gctggccttg 600

ttcctgcaga agaaggggca gaaggtgctg ctggtggcca ccgacatcta ccgccccgcc 660

gccatcgacc agctggtgaa gctgggcgac aggatagggg tgccggtgtt ccagctggga 720

acccaggtgc agccgccgga gattgcaagg caggggctgg agaaggcgcg agcagagggg 780

tttgacgccg tcatcgtcga cacggcgggg cggctgcaga tcgaccagag catgatggag 840

gagctggtgc agatcaagtc cacggtgaag ccctccgaca cgctgctagt ggtcgatgcg 900

atgacggggc aggaggcagc cgggctggtg aaggcgttca atgatgccgt ggacatcaca 960

ggcgccgtgc tgaccaagct tgacggggac agccgcggcg gcgccgcgct gagcgtgcgc 1020

caggtcagcg ggcggcccat caagtttgtg ggcatggggg agggcatgga ggcgctggag 1080

cccttctacc ccgagcgcat ggccagcagg attctgggca tgggtgacgt ggtcaccctg 1140

gtggagaagg ctgaggagag catcaaggaa gaggaggcgc aggagatatc gcggaagatg 1200

ctgtcggcca aatttgactt tgacgacttc ctgaagcagt acaagatggt ggcggggatg 1260

gggaacatgg cccaaatcat gaagatgctg ccaggcatga acaagtttac ggagaagcag 1320

ctggcgggcg ttgagaagca gtacaaggtg tacgagagca tgatccagag catgacggtg 1380

aaggagcgca agcagccgga gctgttggtg aagtcgccct ccaggaggcg gcgcatagcg 1440

cgcgggtcgg ggcgctcgga gcgggaggtc acagagctgc tgggggtgtt caccaacctg 1500

cggacgcaga tgcagagctt ctccaaaatg atggccatgg gggggatggg catgggctcc 1560

atgatgagcg acgaggagat gatgcaggcc acgctggcag gcgccggccc ccgccccgtg 1620

ccagctggca aggtgcggcg gaagaagctg gccgcggcgg gcgggtcgcg gggcatggct 1680

gagctggcat ccctgaaggc agaatga 1707

<210> 9

<211> 302

<212> PRT

<213> 胶球藻

<400> 9

Met Gly Leu Lys Ala Arg Ala Ala Ser Val Ser Val His Ser Ser Ala

1 5 10 15

Asn Asn Thr Ala Ser Pro Leu Ser Ser Gly Arg Arg Gly Phe Pro His

20 25 30

Ser Gly Glu Met Ser Gly Glu Asp Leu Ala Arg Ser Asp Ser Trp Glu

35 40 45

Met Phe Pro Ala Gly Leu Lys Val Leu Val Val Asp Asp Asp Pro Leu

50 55 60

Cys Leu Lys Val Val Glu His Met Leu Arg Arg Cys Asn Tyr Gln Val

65 70 75 80

Thr Thr Cys Pro Asn Gly Lys Ala Ala Leu Glu Lys Leu Arg Asp Arg

85 90 95

Ser Val His Phe Asp Leu Val Leu Ser Asp Val Tyr Met Pro Asp Met

100 105 110

Asp Gly Phe Lys Leu Leu Glu His Ile Gly Leu Glu Leu Asp Leu Pro

115 120 125

Val Ile Met Met Ser Ser Asn Gly Glu Thr Asn Val Val Leu Arg Gly

130 135 140

Val Thr His Gly Ala Val Asp Phe Leu Ile Lys Pro Val Arg Val Glu

145 150 155 160

Glu Leu Arg Asn Val Trp Gln His Val Val Arg Arg Lys Arg Asp Gln

165 170 175

Ala Val Ser Gln Ala Arg Asp Ser Arg Asp Ile Ser Asp Glu Glu Gly

180 185 190

Thr Asp Asp Gly Lys Pro Arg Asp Lys Lys Arg Lys Glu Val Ile Leu

195 200 205

Val Leu Trp Trp Asp Met Gln Arg Arg Asp Ser Asp Asp Gly Val Ser

210 215 220

Ala Lys Lys Ala Arg Val Val Trp Ser Val Glu Met His Gln Gln Phe

225 230 235 240

Val Gln Ala Val Asn Gln Leu Gly Ile Asp Lys Ala Val Pro Lys Arg

245 250 255

Ile Leu Asp Leu Met Asn Val Asp Gly Leu Thr Arg Glu Asn Val Ala

260 265 270

Ser His Leu Gln Val Pro His Leu Ser Ile Phe Ser Pro Leu Phe Ala

275 280 285

Glu Leu Met Ser Thr Leu Pro Arg Arg Cys Phe Tyr Asp Phe

290 295 300

<210> 10

<211> 269

<212> PRT

<213> 海洋发光蚝球藻

<400> 10

Phe Pro Ala Gly Leu Gly Val Leu Val Val Asp Asp Asp Leu Leu Cys

1 5 10 15

Leu Lys Val Val Glu Lys Met Leu Lys Ala Cys Lys Tyr Lys Val Thr

20 25 30

Ala Cys Ser Thr Ala Lys Thr Ala Leu Glu Ile Leu Arg Thr Arg Lys

35 40 45

Glu Glu Phe Asp Ile Val Leu Ser Asp Val His Met Pro Asp Met Asp

50 55 60

Gly Phe Lys Leu Leu Glu Ile Ile Gln Phe Glu Leu Ala Leu Pro Val

65 70 75 80

Leu Met Met Ser Ala Asn Ser Asp Ser Ser Val Val Leu Arg Gly Ile

85 90 95

Ile His Gly Ala Val Asp Tyr Leu Leu Lys Pro Val Arg Ile Glu Glu

100 105 110

Leu Arg Asn Ile Trp Gln His Val Val Arg Arg Asp Tyr Ser Ser Ala

115 120 125

Lys Ser Ser Gly Ser Glu Asp Val Glu Ala Ser Ser Pro Ser Lys Arg

130 135 140

Ala Lys Thr Ser Gly Ser Asn Ser Lys Ser Glu Glu Val Asp Arg Thr

145 150 155 160

Ala Ser Glu Met Ser Ser Gly Lys Ala Arg Lys Lys Pro Thr Gly Lys

165 170 175

Lys Gly Gly Lys Ser Val Lys Glu Ala Glu Lys Lys Asp Val Val Asp

180 185 190

Asn Ser Asn Ser Lys Lys Pro Arg Val Val Trp Ser Ala Glu Leu His

195 200 205

Ala Gln Phe Val Thr Ala Val Asn Gln Leu Gly Ile Asp Lys Ala Val

210 215 220

Pro Lys Arg Ile Leu Asp Leu Met Gly Val Gln Gly Leu Thr Glu Asn

225 230 235 240

Val Ala Ser His Leu Gln Lys Tyr Arg Leu Tyr Leu Lys Arg Leu Gln

245 250 255

Gly Asn Asp Ala Arg Gly Gly Gly Asn Ala Ser Ser Thr

260 265

<210> 11

<211> 941

<212> PRT

<213> 莱茵衣藻

<400> 11

Met Asp Ser Gln Gly Val Lys Leu Glu Glu His Pro Gly His Thr Gly

1 5 10 15

Gly His Trp Gln Gly Phe Pro Ala Gly Leu Arg Leu Leu Val Val Asp

20 25 30

Asp Asp Pro Leu Cys Leu Lys Val Val Glu Gln Met Leu Arg Lys Cys

35 40 45

Ser Tyr Glu Val Thr Val Cys Ser Asn Ala Thr Thr Ala Leu Asn Ile

50 55 60

Leu Arg Asp Lys Asn Thr Glu Tyr Asp Leu Val Leu Ser Asp Val Tyr

65 70 75 80

Met Pro Asp Met Asp Gly Phe Arg Leu Leu Glu Leu Val Gly Leu Glu

85 90 95

Met Asp Leu Pro Val Ile Met Met Ser Ser Asn Gly Asp Thr Ser Asn

100 105 110

Val Leu Arg Gly Val Thr His Gly Ala Cys Asp Tyr Leu Ile Lys Pro

115 120 125

Val Arg Leu Glu Glu Leu Arg Asn Leu Trp Gln His Val Val Arg Arg

130 135 140

Arg Arg Gln His Ala Gln Glu Ile Asp Ser Asp Glu Gln Ser Gln Glu

145 150 155 160

Arg Asp Glu Asp Gln Thr Arg Asn Lys Arg Lys Ala Asp Ala Ala Gly

165 170 175

Val Thr Gly Asp Gln Cys Arg Leu Asn Gly Ser Gly Ser Gly Gly Ala

180 185 190

Ala Gly Pro Gly Ser Gly Gly Gly Ala Gly Gly Met Thr Asp Glu Met

195 200 205

Leu Met Met Ser Gly Gly Glu Asn Gly Ser Asn Lys Lys Ala Arg Val

210 215 220

Val Trp Ser Val Glu Met His Gln Gln Phe Val Asn Ala Val Asn Gln

225 230 235 240

Leu Gly Ile Asp Lys Ala Val Pro Lys Lys Ile Leu Glu Ile Met Gly

245 250 255

Val Asp Gly Ser Ala Gly Arg Leu Ala Asp Thr Ser Gly Arg Asp Val

260 265 270

Cys Gly Thr Val Tyr Arg Leu Tyr Leu Lys Arg Val Ser Gly Val Thr

275 280 285

Pro Ser Gly His His His Asn Ala Ala His Lys Ser Asn Lys Pro Ser

290 295 300

Pro His Thr Thr Pro Pro Pro Pro Ala Leu Pro Gly Gln Ala Gly Thr

305 310 315 320

His Pro Ala Asn Gln Ala Thr Ala Ile Pro Pro Pro Pro Gln Pro Gly

325 330 335

Ser Gly Thr Ala Ala Gly Ala Gly Ala Ala Ala Ala Gly Thr Gly Gly

340 345 350

Gly Ala Ala Ala Ala Asn Gly His Ala Ala Thr Thr Gly Ala Gly Thr

355 360 365

Pro Gly Ala Ala Pro Gly Ala Gly Gly Gly Val Gly Gly Thr Gly Ala

370 375 380

Gly Gly Leu Gly Ser Gly Pro Asp Gly Ala Ala Ala Ala Ala Gly Pro

385 390 395 400

Gly Pro Gly Ala Ala Val Pro Gly Gly Leu Gly Gly Leu Pro Leu Pro

405 410 415

Pro Gly Ala Gly Pro Gly Pro Gly Pro Gly Gly Phe Gly Gly Pro Ser

420 425 430

Pro Pro Pro Pro Pro His Pro Ala Ala Leu Leu Ala Asn Pro Met Ala

435 440 445

Ala Ala Val Ala Gly Leu Asn Gln Ser Leu Leu Asn Ala Met Gly Ser

450 455 460

Leu Gly Val Gly Val Gly Gly Met Ser Pro Leu Gly Pro Val Gly Pro

465 470 475 480

Leu Gly Pro Leu Gly Gly Leu Pro Gly Leu Pro Gly Met Gln Pro Pro

485 490 495

Pro Leu Gly Met Gly Gly Leu Gln Pro Gly Met Gly Pro Leu Gly Pro

500 505 510

Leu Gly Leu Pro Gly Met Gly Gly Leu Pro Gly Leu Pro Gly Met Asn

515 520 525

Pro Met Ala Asn Leu Met Gln Gly Met Ala Ala Gly Met Ala Ala Ala

530 535 540

Asn Gln Met Asn Gly Met Gly Gly His Met Gly Gly His Met Gly Gly

545 550 555 560

Met Asn Gly Pro Met Gly Ala Leu Ala Gly Met Asn Gly Leu Asn Gly

565 570 575

Ala Met Met Gly Gly Leu Pro Gly Met Gly Gly Pro Gln Asn Met Phe

580 585 590

Gln Ala Ala Ala Ala Ala Ala Ala Gln Gln Gln Gln Gln Gln Gln Glu

595 600 605

Gln Gln His Ala Met Met Gln Gln Ala Ala Ala Gly Leu Leu Ala Ser

610 615 620

Gln Gln Gln Gln Gln Gln Gln Gln Gln Gln Gln Gln Gln Gln Gln Ala

625 630 635 640

Leu Gln Gln Gln Gln Gln Gln Gly Met Ala Val Ser Pro Pro Gly Pro

645 650 655

His Asn Ala Thr Pro Asn Gly Gln Leu His Thr His Pro Gln Ala His

660 665 670

His Pro His Gln His Gly Leu His Ala His Ala His Pro His Gln His

675 680 685

Leu Asn Thr Ala Pro Ala Gly Ala Leu Gly Leu Ser Pro Pro Gln Pro

690 695 700

Pro Ala Gly Leu Leu Ser Ala Ser Gly Leu Ser Ser Gly Pro Asp Gly

705 710 715 720

Ser Gly Leu Gly Ser Gly Val Gly Gly Leu Leu Asp Gly Leu Gln Gln

725 730 735

His Pro His His Pro Gln Leu Gln Leu Ala Gly Ser Leu Gly Thr Gly

740 745 750

Gly Thr Gly Arg Ser Ser Gly Ala Ala Gly Arg Gly Ser Leu Asp Leu

755 760 765

Pro Ala Asp Leu Met Gly Met Ala Leu Leu Asp Phe Pro Pro Val Pro

770 775 780

Val Pro Gly Gly Ala Asp Val Gly Met Ala Gly Ala Gly Gly Gly Ala

785 790 795 800

Ala Gly Ala His His His Gly His Gln Gly His Gln Gly Ile Gly Gly

805 810 815

Gly Ala Gly Val Gly Ile Ala Gly Gly Val Gly Cys Gly Val Pro Ala

820 825 830

Ala Ala His Gly Leu Glu Pro Ala Ile Leu Met Asp Asp Pro Ala Asp

835 840 845

Leu Gly Ala Val Phe Ser Asp Val Met Tyr Gly Thr Pro Gly Gly Gly

850 855 860

Gly Val Pro Gly Gly Val Pro Gly Gly Gly Val Gly Leu Gly Leu Gly

865 870 875 880

Ala Gly Gln Val Pro Ser Gly Pro Ala Gly Ala Gly Gly Leu His Ser

885 890 895

His His His Gln His His His His Gln His His Leu Gly His Val Val

900 905 910

Pro Val Gly Gly Val Asp Pro Leu Ala Gly Asp Ala Ala Lys Met Ala

915 920 925

Met Asn Asp Asp Asp Phe Phe Asn Phe Leu Leu Lys Asn

930 935 940

<210> 12

<211> 523

<212> PRT

<213> 佐夫色绿藻

<400> 12

Met Asp Gly Phe Lys Leu Leu Glu Thr Val Gly Leu Glu Leu Asp Leu

1 5 10 15

Pro Val Ile Met Met Ser Ser Asn Gly Glu His Thr Thr Val Met Arg

20 25 30

Gly Val Thr His Gly Ala Cys Asp Phe Leu Ile Lys Pro Val Arg Ile

35 40 45

Glu Glu Leu Arg Asn Ile Trp Gln His Val Ile Arg Arg Thr Arg His

50 55 60

Pro Val Phe Arg Asp Leu Glu Pro Asp Asp His Glu Gly Gly Asp Tyr

65 70 75 80

Glu Ala Ser Lys Lys Arg Lys Asp Leu Tyr Arg Gly Glu Asn Ser Ser

85 90 95

Gly Ser Gly Gly Ala Gly Gly Leu Glu Arg Asp Asp Asp Gly Ser Ala

100 105 110

Ser Lys Lys Pro Arg Val Val Trp Ser Val Glu Met His Gln Gln Phe

115 120 125

Val Gln Ala Val Asn Gln Leu Gly Ile Asp Lys Ala Val Pro Lys Lys

130 135 140

Ile Leu Glu Leu Met Asn Val Asp Gly Leu Thr Arg Glu Asn Val Ala

145 150 155 160

Ser His Leu Gln Lys Tyr Arg Leu Tyr Leu Lys Arg Val Gln Gly Val

165 170 175

Gln Ala Pro Phe Gly Leu Pro Asn Ile Gln Leu Pro Arg Gln Thr Ser

180 185 190

Ser Lys Gly Ala Gly Ser Ser Ser Gln Gln Gln His His Gln Gln Gln

195 200 205

Gln His Gln Gln Gln His Gln His Gln His Gln Thr Ala Leu Gly Thr

210 215 220

Gly Gln Gln Gln Ser His Gln Leu Gln Pro Cys Pro Val Ser Thr Ala

225 230 235 240

Thr Pro Val Met Pro Ser Pro Asp Ala Met Val Ala Ala Ser Met Met

245 250 255

Ser Ser Gln Ala Met Ala Ala Met Ala Pro Gly Val Met Asn Pro Met

260 265 270

Thr Ala Met Asn Ser Met Met Ala Gly Leu Asn Pro Asn Met Met Gly

275 280 285

Met Ala Ala Gly Leu Gly Leu Ala Gly Leu Gly Ile Gly Gly Met Ala

290 295 300

Gly His Pro Val Pro Asn Pro Met Leu Ala Gly Met Gly Pro Met Gly

305 310 315 320

Leu Gly Leu Pro Pro Pro Pro Gly Met Pro Pro Pro Pro Pro Gly Met

325 330 335

Pro Pro Gly Met Pro Pro Gly Met Pro Pro Gly Met Pro Ala Met Met

340 345 350

Gln Gly Leu Ser Met Ala Gly Met Ser His Leu Ala Ala Ala Gly Met

355 360 365

Arg Pro Pro Pro Gly Ala Leu Gly Gly His Leu Gly Gly Pro Gly Leu

370 375 380

Ser Pro Phe Gly Pro Pro Pro Pro Pro Gly Ala Asp Pro Ala Asn Met

385 390 395 400

Met Ala Asn Met Ser Ser Met Met Ala Asn Met Gln Ala Ala Leu Ala

405 410 415

Phe Gln Ala Asp Ala Ala Ala Ala Ala Gln His Gln Ala Ala Ser Thr

420 425 430

Gly Ser Val Ala Pro Gly Arg Gln Gln Gln Val His Gln His Gln Gln

435 440 445

Ala Val Gly Met Ala Val Asp Asp Ala Ala Ala Phe Pro Ser Pro Gly

450 455 460

Cys Arg Pro Asn Gly Ser Ala Asp Ala Gly Ala Gln Ser Ala Ala Glu

465 470 475 480

Pro Asn Asp Phe Ser Arg Val Phe Asp Asp Pro Phe Ala Gln Pro Ala

485 490 495

Ala Ser Pro Ser Gly Ala Ala Ala Ala Gly Ser Asn Glu Ala Pro Gly

500 505 510

Met Asp Asp Phe Leu Asp Phe Phe Leu Lys Ser

515 520

<210> 13

<211> 834

<212> PRT

<213> 卡特氏团藻

<400> 13

Met Asp Gly Arg Ala Glu Gly Thr Val Ala Ile Lys Gln Glu Asp His

1 5 10 15

Ala Ser Gly His Trp His Asn Phe Pro Ala Gly Leu Arg Leu Leu Val

20 25 30

Val Asp Asp Asp Pro Leu Cys Leu Lys Val Val Glu Gln Met Leu Arg

35 40 45

Lys Cys Ser Tyr Asp Val Thr Thr Cys Thr Asn Ala Thr Met Ala Leu

50 55 60

Asn Leu Leu Arg Asp Lys Ser Thr Glu Tyr Asp Leu Val Leu Ser Asp

65 70 75 80

Val Tyr Met Pro Asp Met Asp Gly Phe Lys Leu Leu Glu Val Val Gly

85 90 95

Leu Glu Met Asp Leu Pro Val Ile Met Met Ser Ser Asn Gly Asp Thr

100 105 110

Ser Asn Val Leu Arg Gly Val Thr His Gly Ala Cys Asp Tyr Leu Ile

115 120 125

Lys Pro Val Arg Leu Glu Glu Leu Arg Asn Leu Trp Gln His Val Val

130 135 140

Arg Arg Arg Arg Gln Leu Asn Leu Asp Met Asp Ser Asp Glu His Ser

145 150 155 160

Gln Glu Arg Asp Asp Asp Gln Gly Arg Lys Arg Lys Ala Asp Thr Ala

165 170 175

Gly Cys Ile Gly Asp Gln Leu Arg Met Met Gly Ala Gly Cys Ser Gly

180 185 190

Gly Ala Asn Gly Leu Gly Ser Thr Gly Asn Leu Gly Ala Val Ala Thr

195 200 205

Gly Ser Ala Gly Leu Gly Leu Gly Leu Gly Thr Ala Ala Asp Glu Leu

210 215 220

Gly Leu Gly Leu Asp Asn Gly Ser Ser Lys Lys Ala Arg Val Val Trp

225 230 235 240

Ser Val Glu Met His Gln Gln Phe Val Asn Ala Val Asn Gln Leu Gly

245 250 255

Ile Asp Lys Ala Val Pro Lys Lys Ile Leu Glu Ile Met Asn Val Asp

260 265 270

Gly Leu Thr Arg Glu Asn Val Ala Ser His Leu Gln Lys Tyr Arg Leu

275 280 285

Tyr Leu Lys Arg Val Ser Gly Ala Gln Gln Pro Gly Gln Asn Arg Val

290 295 300

Ser Arg Pro Ser Pro Pro Gln Pro Gln Ser Pro Gln Val Pro Ser Gln

305 310 315 320

Gln Gln Gln Ser Leu Pro Gly Gly Gly Gly Ala Ala Ala Ala Gly Ala

325 330 335

Gly Gln Leu Gln Gly Gly Gly Gly Ala Ala Ala Ala Ala Ala Ser Leu

340 345 350

Ala Ser Ile Leu Ala Gly Gly Gly Pro Ala Gly Gly Gly Ala Gly Ala

355 360 365

Gly Pro Pro Pro Gly Gly Gly Gln Leu Gly Ala Asp Gly Gly Gly Pro

370 375 380

Gly Pro Gly Leu Ser Ser Ala Val Ala Asn Ala Met Ser Ala Ala Ala

385 390 395 400

Ala Ala Gly Gly Phe Pro Thr Pro Pro Pro Pro Pro Pro Pro His Pro

405 410 415

Ala Ala Leu Leu Ala Ala Asn Pro Met Met Ala Ala Ala Ala Gly Leu

420 425 430

Asn Pro Leu Leu Gly Ala Met Gly Gly Leu Gly Val Gly Pro Leu Gly

435 440 445

Pro Leu Asn Pro Leu Asn Gly Met Pro Met Pro Gly Met Gln Pro Pro

450 455 460

Leu Gly Leu Leu Pro Gly Leu Pro Gly Pro Gly Gly Gln Leu Gly Leu

465 470 475 480

Gly Pro Leu Gly Pro Ile Gly Leu Pro Gly Pro Gly Pro Leu Pro Ser

485 490 495

Leu Pro Ala Gly Leu Pro Leu Asn Pro Met Ala Asn Gly Leu Gln Gln

500 505 510

Met Ala Ala Ala Asn Leu Met Gln Gly Met Ala Gly Met Gly Gln Leu

515 520 525

Pro Ala Leu Ser Met Asn Gly Met Asn Gly Ile Met Gly Pro Leu Pro

530 535 540

Gly Val Gly Leu Pro Gly Pro Gln Gln His Leu Phe Pro Gln Gln Gln

545 550 555 560

Gln Pro His Leu Gln Gln Gln Gln Gln Gln Gln Gln Gln Lys Asp Leu

565 570 575

Gln Met Ala Gln Lys Gln His Gln Ala Ala Ala Ala Ala Ala Ala Val

580 585 590

Ala Ala Ala Val Ala Ala Ala Gln His Gln Gln Gln Gln Pro Gln Ala

595 600 605

Gln Gln Gln Pro Gln Pro Gln Gln Gln Gln Gln Gln Pro Gly Lys Leu

610 615 620

Pro Gln Ala Thr Val Gly Thr Pro Ala Leu Ala Ser Pro Ala Gly Ala

625 630 635 640

Leu Pro Arg Gln Pro Ser Gly Gln His Pro His Thr Leu Ser Ser Ser

645 650 655

Ser Leu His Thr Gln Gln Pro His Gln Gln Gln Leu Leu His Ser Gln

660 665 670

Pro Ser Ser Thr His Leu Ala Thr Asn Asn Thr Leu Ala Met Ala Pro

675 680 685

Ala Leu Asn Gly Thr Leu Asp Val Gly Gly Lys Gly His Leu His Ala

690 695 700

Ala Gly Gly Gln Gly Ala Gly Ala Gly Ala Gly Ala Val Leu Asp Ile

705 710 715 720

Pro Pro Asp Leu Ile Gly Gly Leu Ile Glu Asp Gly Phe Gly Ala Pro

725 730 735

Pro Gly Pro Thr Ile Gln Leu Ala His Gly Thr Ala Ala Val Leu Asp

740 745 750

Pro Thr Met Leu Leu Asp Glu Gly Asp Asn Ser Asp Phe Ala Ala Val

755 760 765

Phe Gln Glu Met Ser Ser Tyr Gly Gly Gly Gly Val Ile Gly Gly Gly

770 775 780

Gly Ser Gly Ala Gly Ala Met Gly Val Leu Gly His Gly Leu Leu Ala

785 790 795 800

Ala Gly Gly Pro Val Met Val Asp Val Ala Ala Gly Leu Ala Gly Val

805 810 815

Thr Glu Thr Ala Thr Arg Val Asp Asp Asp Phe Leu Asn Phe Leu Leu

820 825 830

Lys Ser

<210> 14

<211> 446

<212> PRT

<213> 四爿藻属

<400> 14

Met Ser Cys Thr Val Ala Ser Phe Pro Pro Ala Ala Gly Gly Gln Gly

1 5 10 15

Ser Pro Ala Thr Pro Val Pro Tyr Gln Asp Leu Leu Val Lys Arg Gln

20 25 30

Asp Gln Trp Ser Asn Phe Pro Ala Gly Leu Arg Val Leu Val Ala Asp

35 40 45

Asn Asp Pro Ala Ser Leu Gln Gln Val Glu Lys Met Leu Lys Lys Cys

50 55 60

Ser Tyr Gln Val Thr Leu Cys Ser Ser Gly Lys Asn Ser Leu Glu Ile

65 70 75 80

Leu Arg Lys Arg Arg Glu Glu Phe Asp Leu Val Leu Ala Asp Ala Asn

85 90 95

Leu Pro Asp Ile Asp Gly Phe Lys Leu Leu His Val Cys His Thr Glu

100 105 110

Leu Ser Leu Pro Val Val Leu Met Ser Gly Thr Ser Asp Thr Gln Leu

115 120 125

Val Met Arg Gly Val Met Asp Gly Ala Arg Asp Phe Leu Ile Lys Pro

130 135 140

Leu Arg Val Glu Glu Leu Lys Val Leu Trp Gln His Leu Val Arg Phe

145 150 155 160

Thr Ser Glu Ile Thr Lys Thr Asp Ala Gln Leu Asn Val Val Lys Val

165 170 175

Glu Leu Asp Gly Gly Arg Pro Ala Gly Glu Val Ser Thr Ser Gln Asn

180 185 190

Gly Ser Gln Cys Thr Glu Arg Glu Gly Glu Gly Asn Ser Ser Lys Lys

195 200 205

Gln Arg Met Asn Trp Ser Asp Glu Met His Gln Gln Phe Val Asn Ala

210 215 220

Val Asn Gln Leu Gly Ile Asp Lys Ala Val Pro Lys Arg Ile Leu Asp

225 230 235 240

Leu Met Ser Val Glu Gly Leu Thr Arg Glu Asn Val Ala Ser His Leu

245 250 255

Gln Lys Tyr Arg Ile Tyr Leu Lys Arg Met Ala Asn His Gln Glu Asn

260 265 270

Gly Lys Gln Ala Val Met Ser Thr Asp Thr Ile Ala Arg Ala Glu Ala

275 280 285

Ala Tyr Gln Gly Gly Met Pro Gln Gly Gln Gln Met Met Gln Gln Glu

290 295 300

His Ser Gly Gln Ala Val Gln Tyr Ser Gln Pro His Ala Pro Gly Gly

305 310 315 320

Leu His Gln Gln Ala Met Pro Ala Gln Met His Met Gly Met Met Pro

325 330 335

Ala Gly Pro Gln Pro Gly Ser Met Gln Met Ala Pro His His Val Met

340 345 350

Gln Met Pro Asn Gly Gln Val Met Val Met Gln Gln Met Gly Pro Arg

355 360 365

Pro Gly Met Pro Pro Gly Met Pro Gln Gln Met Met Ala Ser Ser Gln

370 375 380

Gln Met Gly Met Leu Gln Pro Gly Met Pro Ala Gly Gln Met Leu His

385 390 395 400

Phe Gln His Pro Gln Gln Val His Gln His Pro Pro Ser Ser Gly Pro

405 410 415

Met His Ala Val Gln His Met Glu Tyr Ala Tyr Ser Gln Pro Met Gln

420 425 430

Met Ala Gly Trp Pro Val Gln Gly Gln Pro Gly Asn Gln Ala

435 440 445

<210> 15

<211> 490

<212> PRT

<213> 四爿藻属

<400> 15

Met Thr Pro Thr Pro Pro Met Ser Cys Thr Val Ala Ser Phe Pro Pro

1 5 10 15

Ala Ala Gly Gly Gln Gly Ser Pro Ala Thr Pro Val Pro Tyr Gln Asp

20 25 30

Leu Leu Val Lys Arg Gln Asp Gln Trp Ser Asn Phe Pro Ala Gly Leu

35 40 45

Arg Val Leu Val Ala Asp Asn Asp Pro Ala Ser Leu Gln Gln Val Glu

50 55 60

Lys Met Leu Lys Lys Cys Ser Tyr Gln Val Thr Leu Cys Ser Ser Gly

65 70 75 80

Lys Asn Ser Leu Glu Ile Leu Arg Lys Arg Arg Glu Glu Phe Asp Leu

85 90 95

Val Leu Ala Asp Ala Asn Leu Pro Asp Ile Asp Gly Phe Lys Leu Leu

100 105 110

His Val Cys His Thr Glu Leu Ser Leu Pro Val Val Leu Met Ser Gly

115 120 125

Thr Ser Asp Thr Gln Leu Val Met Arg Gly Val Met Asp Gly Ala Arg

130 135 140

Asp Phe Leu Ile Lys Pro Leu Arg Val Glu Glu Leu Lys Val Leu Trp

145 150 155 160

Gln His Leu Val Arg Phe Thr Ser Glu Ile Thr Lys Thr Asp Ala Gln

165 170 175

Leu Asn Val Val Lys Val Glu Leu Asp Gly Gly Arg Pro Ala Gly Glu

180 185 190

Val Ser Thr Ser Gln Asn Gly Ser Gln Cys Thr Glu Arg Glu Gly Glu

195 200 205

Gly Asn Ser Ser Lys Lys Gln Arg Met Asn Trp Ser Asp Glu Met His

210 215 220

Gln Gln Phe Val Asn Ala Val Asn Gln Leu Gly Ile Asp Lys Ala Val

225 230 235 240

Pro Lys Arg Ile Leu Asp Leu Met Ser Val Glu Gly Leu Thr Arg Glu

245 250 255

Asn Val Ala Ser His Leu Gln Lys Tyr Arg Ile Tyr Leu Lys Arg Met

260 265 270

Ala Asn His Gln Glu Asn Gly Lys Gln Ala Val Met Ser Thr Asp Thr

275 280 285

Ile Ala Arg Ala Glu Ala Ala Tyr Gln Gly Gly Met Pro Gln Gly Gln

290 295 300

Gln Met Met Gln Gln Glu His Ser Gly Gln Ala Val Gln Tyr Ser Gln

305 310 315 320

Pro His Ala Pro Gly Gly Leu His Gln Gln Ala Met Pro Ala Gln Met

325 330 335

His Met Gly Met Met Pro Ala Gly Pro Gln Pro Gly Ser Met Gln Met

340 345 350

Ala Pro His His Val Met Gln Met Pro Asn Gly Gln Val Met Val Met

355 360 365

Gln Gln Met Gly Pro Arg Pro Gly Met Pro Pro Gly Met Pro Gln Gln

370 375 380

Met Met Ala Ser Ser Gln Gln Met Gly Met Leu Gln Pro Gly Met Pro

385 390 395 400

Ala Gly Gln Met Leu His Phe Gln His Pro Gln Gln Val His Gln His

405 410 415

Pro Pro Ser Ser Gly Pro Met His Ala Gly Gly Glu Met Ile Asp Pro

420 425 430

Gly Ser Met Gln Arg Leu His Gln Gln Pro His Tyr Ile Gly Pro Asn

435 440 445

Gly Gln His Met Pro Ala Pro Ala Met Gly Met Pro Ser Gly Thr Val

450 455 460

Gln His Met Glu Tyr Ala Tyr Ser Gln Pro Met Gln Met Ala Gly Trp

465 470 475 480

Pro Val Gln Gly Gln Pro Gly Asn Gln Ala

485 490

<210> 16

<211> 574

<212> PRT

<213> 四爿藻属

<400> 16

Met Thr Met Pro Leu Gly Gly Gly Leu Cys Met Lys Asp Arg Ile His

1 5 10 15

Gly Asp Glu Arg Tyr Arg Ser Lys Ala Lys Arg Gln Val Asn Thr Ile

20 25 30

Phe Ala Phe Thr Gln Arg Asn Thr Trp Arg Gly Arg Phe Arg Leu Cys

35 40 45

Ser Tyr Arg Thr Thr Glu Leu Leu Gly Gly Ser Lys Thr Thr Glu Pro

50 55 60

Gly Arg Gly Thr Phe Val Leu Gln Ile Phe Met Cys Val Lys Asn Ala

65 70 75 80

Ser Ile Asp Asp Gly Ser Arg His Ile Ser Thr Ser Arg Gly Leu Glu

85 90 95

Ser Val Leu Lys Arg Arg Gly Gly Gln Gly Ala Pro Ala Ala Pro Val

100 105 110

Pro Tyr His Asp Leu Leu Val Lys Arg Gln Asp Gln Trp Ser Asn Phe

115 120 125

Pro Ala Gly Leu Arg Val Leu Val Ala Asp Asn Asp Pro Ala Ser Leu

130 135 140

Gln Gln Val Glu Lys Met Leu Lys Lys Cys Ser Tyr Gln Val Thr Leu

145 150 155 160

Cys Ser Ser Gly Lys Asn Ser Leu Glu Ile Leu Arg Lys Arg Arg Glu

165 170 175

Glu Phe Asp Leu Val Leu Ala Asp Ala Asn Leu Pro Asp Ile Asp Gly

180 185 190

Phe Lys Leu Leu His Val Cys His Thr Glu Leu Ser Leu Pro Val Val

195 200 205

Leu Met Ser Gly Thr Ser Asp Thr Gln Leu Val Met Arg Gly Val Met

210 215 220

Asp Gly Ala Arg Asp Phe Leu Ile Lys Pro Leu Arg Val Glu Glu Leu

225 230 235 240

Lys Val Leu Trp Gln His Leu Val Arg Phe Thr Ser Glu Ile Thr Lys

245 250 255

Thr Asp Ala Gln Leu Asn Val Val Lys Val Glu Leu Asp Ser Gly Arg

260 265 270

Pro Ala Gly Glu Val Ser Thr Ser Gln Asn Gly Ser Gln Cys Ala Glu

275 280 285

Arg Glu Gly Glu Gly Asn Ser Ser Lys Lys Gln Arg Met Asn Trp Ser

290 295 300

Asp Glu Met His Gln Gln Phe Val Asn Ala Val Asn Gln Leu Gly Ile

305 310 315 320

Asp Lys Ala Val Pro Lys Arg Ile Leu Asp Leu Met Ser Val Glu Gly

325 330 335

Leu Thr Arg Glu Asn Val Ala Ser His Leu Gln Lys Tyr Arg Ile Tyr

340 345 350

Leu Lys Arg Met Ala Asn His Gln Glu Asn Gly Lys Gln Ala Val Met

355 360 365

Ser Thr Asp Thr Ile Ala Arg Ala Glu Ala Ala Tyr Gln Gly Gly Met

370 375 380

Pro Gln Gly Gln Gln Met Met Gln Gln Glu His Ser Gly Gln Ala Val

385 390 395 400

Gln Tyr Ser Gln Pro His Ala Pro Ser Gly Leu His Gln Gln Ala Met

405 410 415

Pro Ala Gln Met His Met Gly Met Met Pro Ala Gly Pro Gln Pro Gly

420 425 430

Ser Met Gln Met Ala Pro His His Val Met Gln Met Pro Asn Gly Gln

435 440 445

Val Met Val Met Gln Gln Met Gly Pro Arg Pro Gly Met Pro Pro Gly

450 455 460

Met Pro Gln Gln Met Met Ala Ser Ser Gln Gln Met Gly Met Leu Gln

465 470 475 480

Pro Gly Met Pro Ala Gly Gln Met Leu His Phe Gln His Pro Gln Gln

485 490 495

Val His Gln His Pro Pro Ser Ser Gly Pro Met His Ala Gly Gly Glu

500 505 510

Met Ile Asp Pro Gly Ser Met Gln Arg Leu His Gln Gln Pro His Tyr

515 520 525

Ile Val Pro Asn Ala Gln His Met Pro Ala Pro Ala Met Gly Met Pro

530 535 540

Pro Gly Ala Val Gln His Met Glu Tyr Ala Tyr Ser Gln Pro Met Gln

545 550 555 560

Met Ala Gly Trp Pro Val Gln Gly Gln Pro Gly Ser Gln Ala

565 570

<210> 17

<211> 674

<212> PRT

<213> 卵囊藻属

<400> 17

Met Leu Ala Phe Thr His Gln Arg Met Thr Thr Ala Pro Ala Leu Ala

1 5 10 15

Val Ala Thr Ser His Phe Phe Ala His Val Arg Val Thr Thr Gly Ser

20 25 30

Ser Ala Ile Ala Thr Val Phe Ala Ala Arg Ser Arg Gly Ser Gly Leu

35 40 45

Leu Ala Gly Phe Asn Thr Met Glu Asn Val Lys Val Glu Val Pro Glu

50 55 60

Val Val Pro Glu Asn Val Asn Phe Pro Ala Gly Leu Lys Val Leu Val

65 70 75 80

Val Asp Asp Asp Pro Leu Cys Leu Lys Val Ile Asp Gln Met Leu Arg

85 90 95

Arg Cys Asn Tyr Ala Ala Thr Thr Cys Gln Ser Ser Leu Glu Ala Leu

100 105 110

Glu Leu Leu Arg Ser Ser Lys Glu Asn His Phe Asp Leu Val Leu Ser

115 120 125

Asp Val Tyr Met Pro Asp Met Asp Gly Phe Lys Leu Leu Glu Ile Ile

130 135 140

Gly Leu Glu Met Gly Leu Pro Val Ile Met Met Ser Ser Asn Gly Glu

145 150 155 160

Thr Gly Val Val Phe Arg Gly Val Thr His Gly Ala Val Asp Phe Leu

165 170 175

Ile Lys Pro Val Arg Ile Glu Glu Leu Arg Asn Leu Trp Gln His Val

180 185 190

Val Arg Lys Thr Met Val Val Pro Ser Asn Asp Lys Ala Thr Ser Glu

195 200 205

Glu Asp Gly Glu Glu Ser Lys His Arg Val Asp Arg Lys Arg Lys Glu

210 215 220

Ser Phe His Ser Arg Ala Arg Glu Gln Val Glu Ile Ala Cys Ser Val

225 230 235 240

Val Pro Ala Leu Leu Trp Pro Thr Val Pro Pro Ser Ser Val His Pro

245 250 255

Thr Ser Ser Ser Phe Leu Arg Ser His Val Leu Leu Leu Gln Arg Ser

260 265 270

Ser Gly Gly Lys Asp Val Leu Asp Glu Gly Gly Ser Asn Ala Lys Lys

275 280 285

Pro Arg Val Val Trp Ser Val Glu Met His Gln Gln Phe Val Asn Ala

290 295 300

Val Asn Gln Leu Gly Ile Asp Lys Ala Val Pro Lys Arg Ile Leu Asp

305 310 315 320

Leu Met Asn Val Asp Gly Leu Thr Arg Glu Asn Val Ala Ser His Leu

325 330 335

Gln Lys Tyr Arg Leu Tyr Leu Lys Arg Val Ala Gly Ile Asn Thr Ala

340 345 350

Thr Gly Ser Arg Asn Gly Lys Gly Arg Ser Asp Val Ser Gly Leu Ser

355 360 365

Gly Met Pro Asn Gly Ser Leu Pro Met Pro Gly Met Met Pro Pro His

370 375 380

Met Ala Ala Gly Met Leu Leu Ala Gly Met Ala Ala Asp Val Gly Pro

385 390 395 400

Arg Pro His Pro Phe Pro Ile Met Pro Met Pro Ala Met Ala Leu Gln

405 410 415

Gly Met His Gly Gly Met Ala Gln Met Met Gln Leu Pro Pro Gly Met

420 425 430

Pro Pro Pro Met Met Met Pro Met Ala Pro Leu Leu Pro Ser Gln Leu

435 440 445

Ala Ala Leu Gly Gln Gln Gln Gln Gln Gln Gln Gln Gln Gln Val Ala

450 455 460

Arg Ser Glu Ser Met Pro Ser Glu Asn Gly Val Ala Gly Pro Ser Gly

465 470 475 480

Ser Phe Thr Ala Met Leu Asn Gly Pro Ala Pro Met Glu Ser Ser Pro

485 490 495

Phe Ala Ala Leu Gln Val Phe Gly Pro Pro Gln Gly Met Glu Gln Leu

500 505 510

Thr Gln Gln Gln Gln Gln Gln Gln Gln Ala Gly Ala Ala Ala Phe Val

515 520 525

Ala Ala Phe Ala Ala Ala Asn Gly Gly Asp Met Gln Gly Gly Gly Gly

530 535 540

Gly Pro Gly Pro Met Leu Gly Gly Ala Gly Gly Ala Gly Pro Leu Leu

545 550 555 560

Gly Gly Val Gly Gly Gly Asp Pro Leu His Gly Gly Gly Gly Ser Ser

565 570 575

Ala Leu Gly Gly Arg Pro Met Met Ser Ala Glu Gln Pro Met Gly Gly

580 585 590

Ser Gly Gly Leu Ala Ser Asn Ser Leu Thr Val Gln Gln Asn Asp Leu

595 600 605

Ala Gln Met Cys Ser Gln Leu Asp Val Asn Gly Leu Gln Ala Val Ala

610 615 620

Ala Ala Ala Ala Ala Gly Ala Met Gly Ala Pro Gly Gly Ala Gly Gly

625 630 635 640

Ala Met Pro Pro Ser Ser Val Gly Gly Val Gly Pro Asp Met Lys Leu

645 650 655

Thr Glu Gln Asp Asp Phe Phe Ser Phe Leu Leu Lys Asp Ser Asn Leu

660 665 670

Ile Asp

<210> 18

<211> 488

<212> PRT

<213> 微胞藻属

<400> 18

Met Ser Thr Pro Ala Val Ser Lys Gly Phe Pro Ile Gly Leu Arg Val

1 5 10 15

Leu Val Val Asp Asp Asp Pro Leu Cys Leu Lys Ile Val Glu Lys Met

20 25 30

Leu Lys Arg Cys Gln Tyr Glu Val Thr Thr Phe Ser Arg Gly Ala Glu

35 40 45

Ala Leu Lys Thr Leu Arg Glu Arg Lys Asp Asp Phe Asp Ile Val Leu

50 55 60

Ser Asp Val His Met Pro Asp Met Asp Gly Phe Lys Leu Leu Glu His

65 70 75 80

Ile Ala Leu Glu Leu Asp Ile Pro Val Met Met Met Ser Ala Asn Cys

85 90 95

Ala Thr Asp Val Val Leu Arg Gly Ile Ile His Gly Ala Val Asp Tyr

100 105 110

Leu Leu Lys Pro Val Arg Ile Glu Glu Leu Arg Asn Ile Trp Gln His

115 120 125

Val Val Arg Arg Lys Arg Glu Ser Ser Gln Gly Asn Leu Arg Ser Gly

130 135 140

Glu Gly Gly Ser Asn Gly Arg Thr Val Ser Gly Gly Ser Thr Gly Glu

145 150 155 160

Gly Gly Gly Lys Asp Ser Lys Gly Ser Ser Glu Gln His Gly Asp Ala

165 170 175

Lys Asp Lys Thr Gly Ser Ala Gly Gly Ser Gly Gly Ser Ser Lys Arg

180 185 190

Lys Lys Gly Ser Gly Lys Lys Gly Asp Glu Gly Thr Asp Glu Val Lys

195 200 205

Asp Gly Ser Gly Gly Asp Glu Asn Glu Asp Ser Ser Ala Leu Lys Lys

210 215 220

Pro Arg Val Val Trp Ser Ala Glu Leu His Gln Gln Phe Val Thr Ala

225 230 235 240

Val Asn Gln Leu Gly Ile Asp Lys Ala Val Pro Lys Arg Ile Leu Asp

245 250 255

Leu Met Gly Val Gln Gly Leu Thr Arg Glu Asn Val Ala Ser His Leu

260 265 270

Gln Lys Tyr Arg Leu Tyr Leu Lys Arg Leu Gln Gly Val Asn Ser Gly

275 280 285

Gly Ala Pro Gly Gly Gly Pro Gly Phe Met Ser Pro Ile Ala Leu Asp

290 295 300

Gly Ser Met Val Gln Gly Gly Pro Gly Gly Arg Val Gly Ser Pro Ala

305 310 315 320

Ile Gly Gly Pro Asn Gly Pro Ile Met Val Gly His Gly His Ile Asp

325 330 335

Pro Ala Met Leu Ala Gly Gly Ala Pro Gln Thr Ile Gln Met Gly Met

340 345 350

Val Tyr Gly Gly Pro Gly Met Gly Pro Pro Gln Met Met Ala Pro Asn

355 360 365

Gly Lys Gly Gly Gly Gly Met Pro Gly Gly Tyr Val Met Gln Pro Gly

370 375 380

Gln Met Met Ala Pro Asn Gly Gln Met Met Pro Val Gly Gln Met Gly

385 390 395 400

Pro Gly Gly Met Met Val Gln Gly Pro Gly Gly Gly Met Met Gln Met

405 410 415

His Asp Gly Gly Met Met Asn Gly Asn Gly Ser Tyr Gly Ser Leu Gln

420 425 430

Asn Met Lys Gln Gly Asn Gly Val Val Met Met Pro Asn Gly Gly Met

435 440 445

Gly Gly Val Asp Gly Ala Ile Pro Asn Met Ala Thr Gly Leu Ile Asn

450 455 460

Gly Gln Gly Leu Pro Asp Asp Asp Val Leu Asp Met Phe Leu Lys Asp

465 470 475 480

Gly Leu Pro Glu Gly Glu Gly Phe

485

<210> 19

<211> 544

<212> PRT

<213> 细小微胞藻

<400> 19

Met Thr Ala Glu Lys Lys Glu Leu Lys Val Phe Pro Ala Gly Leu Arg

1 5 10 15

Val Leu Val Val Asp Asp Asp Pro Leu Cys Leu Arg Ile Val Glu Lys

20 25 30

Met Leu Lys Arg Cys Gln Tyr Glu Val Thr Thr Phe Ser Arg Gly Ala

35 40 45

Glu Ala Leu Glu Thr Leu Arg Ala Arg Arg Asp Asp Phe Asp Ile Val

50 55 60

Leu Ser Asp Val His Met Pro Asp Met Asp Gly Phe Lys Leu Leu Glu

65 70 75 80

His Ile Ala Leu Glu Leu Asp Val Pro Val Met Met Met Ser Ala Asn

85 90 95

Cys Ala Thr Asp Val Val Leu Arg Gly Ile Ile His Gly Ala Val Asp

100 105 110

Tyr Leu Leu Lys Pro Val Arg Leu Glu Glu Leu Arg Asn Ile Trp Gln

115 120 125

His Val Val Arg Arg Gln Arg Glu Pro Ser Lys Asp Gly Ala Ala Gly

130 135 140

Lys Gly Gly Gly Ala Ser Gly Ala Pro Glu Val Ser Gly Asp Thr His

145 150 155 160

Ala Asn Thr Asp Asp Lys Gln Asp Gly Asn Ala Thr Asp Ser Lys Gly

165 170 175

Ser Gly Ser Gln Lys Arg Lys Ser Gly Lys Ser Gly Asp Asp Gly Gly

180 185 190

Lys Asp Gly Gly Gly Ser Gly Gly Lys Asp Gly Asp Ala Ser Asn Lys

195 200 205

Gly Asn Asn Asn Lys Arg Lys Lys Gly Lys Ser Asn Asp Ala Thr Glu

210 215 220

Thr Ala Gly Gly Ala Gly Val Glu Asp Asn Asp Asp Thr Ser Gly Leu

225 230 235 240

Lys Lys Pro Arg Val Val Trp Ser Pro Glu Leu His Gln Gln Phe Val

245 250 255

Thr Ala Val Asn Gln Leu Gly Ile Asp Lys Ala Val Pro Lys Arg Ile

260 265 270

Leu Asp Leu Met Gly Val Gln Gly Leu Thr Arg Glu Asn Val Ala Ser

275 280 285

His Leu Gln Lys Tyr Arg Leu Tyr Leu Lys Arg Leu Gln Gly Val Asn

290 295 300

Asn Asn Gly Thr Val Pro Ser Gly Ala Ala Gly Phe Met Thr Gly Leu

305 310 315 320

Ala Ile Asp Gly Val Gly Gly Val Met Gly Pro Pro Thr Thr Gly Ser

325 330 335

Pro Ala Met Asn Gly Pro Gly Gly Pro Gly Gly Gly Leu Val Met Gly

340 345 350

Pro Gly His Met Gly Gly Pro His Met Asp Gly Ser Gly Met Met His

355 360 365

Met Gly Pro Gly Gly Pro Met Ala Gly Met Thr Val Val Tyr Gly Gly

370 375 380

Gly Met Pro Gly Gly Met Pro Gly Gly Ala Asp Ser Lys Asn Gly Ala

385 390 395 400

Ser Gly Gln Pro Pro Pro Gly Gly Tyr Val Val Met Gly Gly Pro His

405 410 415

Gly Gly Gly Pro Gly Gly Ala Pro Met Met Met Gln His Gly Gly Met

420 425 430

Val Pro Gly Pro Gly Pro Gly Leu Val Pro Gly Pro Gly Gly Ser Leu

435 440 445

Met Met Pro Ala Gly Met Met Pro Asp Gly Gly Gly Gly Met Val Gly

450 455 460

Val His Val Gly Pro Gly Val Val Met Gly Gln His Gln Leu Gly Gly

465 470 475 480

Lys His Ser Ser Gly Gly Ala Gly Met Ala Gly Gly Ser Ala Ala Gly

485 490 495

Lys Gly Ala Gln Arg Gly Gly Val Gly Gly Ala Phe Asp Val Pro Pro

500 505 510

Thr Asn Gly Ser Leu Asp Ala Asp Glu Ile Gly Asp Asp Val Leu Thr

515 520 525

Met Phe Leu Lys Asp Gly Leu Pro Glu Met Asn Asp Gly Asp Ala Leu

530 535 540

<210> 20

<211> 776

<212> PRT

<213> 伪泥炭藓

<400> 20

Met Ser Gly Gly Asp Leu Ser Arg Val Arg Glu Gly Thr Ala Asp Leu

1 5 10 15

Asp Pro Val Met Ala Ser His Gln His Pro Pro Pro Arg Gln Gln Ser

20 25 30

His Gln Gln Pro Lys Asn His Gln Gln Glu Ala His Gln Gln His Cys

35 40 45

Ser Ser Ala Glu Thr Thr Ser Pro Asn Asn Thr Ala Arg Gly Ala Gly

50 55 60

Ala Thr Tyr Gly Lys Met Glu Pro Ala Asp Asp Phe Pro Ala Gly Leu

65 70 75 80

Arg Ile Leu Val Val Asp Asp Asp Pro Thr Cys Leu Ala Ile Leu Lys

85 90 95

Lys Met Leu Gln Gln Cys Ser Tyr Gln Val Thr Thr Cys Gly Arg Ala

100 105 110

Thr Arg Ala Leu Glu Leu Leu Arg Glu Asp Lys Asp Lys Phe Asp Leu

115 120 125

Val Ile Ser Asp Val Tyr Met Pro Asp Met Asp Gly Phe Lys Leu Leu

130 135 140

Glu Leu Val Gly Leu Glu Met Asp Leu Pro Val Ile Met Met Ser Gly

145 150 155 160

Asn Gly Glu Thr Ser Val Val Met Lys Gly Ile Thr His Gly Ala Cys

165 170 175

Asp Tyr Leu Leu Lys Pro Val Arg Ile Glu Glu Leu Ser Asn Ile Trp

180 185 190

Gln His Val Val Arg Lys Leu Arg Ser Glu Pro Lys Glu His Ser Ala

195 200 205

Ser Leu Glu Asp Gly Asp Arg Gln Arg Arg Gly Gly Ala Glu Asp Ala

210 215 220

Asp Asn Thr Ser Ser Ala Ala Asp Thr Ala Asp Gly Ile Trp Arg Asn

225 230 235 240

Lys Lys Lys Lys Glu Ala Lys Glu Asp Glu Glu Asp Phe Glu Gln Asp

245 250 255

Asn Asp Asp Pro Ser Thr Leu Lys Lys Pro Arg Val Val Trp Ser Val

260 265 270

Glu Leu His Gln Gln Phe Val Ser Ala Val Asn Gln Leu Gly Ile Asp

275 280 285

Lys Ala Val Pro Lys Arg Ile Leu Glu Leu Met Ser Val Gln Gly Leu

290 295 300

Thr Arg Glu Asn Val Ala Ser His Leu Gln Lys Tyr Arg Leu Tyr Leu

305 310 315 320

Lys Arg Leu Ser Gly Val Thr Ser Gln Ser Asn Ser Leu Asn Val Ser

325 330 335

Phe Gly Gly Pro Asp Ala Gly Tyr Gly Gly Leu Phe Gly Leu Asp Glu

340 345 350

Met Ser Asp Tyr Arg Asn Leu Val Thr Asn Gly His Leu Pro Ala Gln

355 360 365

Thr Ile Ala Ala Leu His His Ala Asn Met Ala Gly Arg Leu Gly Ala

370 375 380

Ser Ser Gly Met Val Gly Pro Ser Ser Pro Leu Asp Pro Ser Val Leu

385 390 395 400

Ala Gln Ile Ala Ala Leu Gln Ser Gly Ser Leu Pro Arg Pro Gly Met

405 410 415

Asp Gly Ser Leu Gln Gly Asn Gln Ala Gly Leu Leu Gln Ser Leu Ser

420 425 430

Gly Ala Leu Asp Tyr Asn Ser Leu His Gln Ser His Leu Leu Pro Ala

435 440 445

Ile Gly Gln Leu Gly Gln Leu Asp Glu Leu Pro Ser Leu Lys Ser Met

450 455 460

Gln His Gln Leu Gly Met Gly Ser Leu Gly Gly Ser Thr Arg Asn Leu

465 470 475 480

Ala Gly Ser Pro Asn Glu Glu Leu Thr Met Gln Leu Leu Gln Gln Arg

485 490 495

Ala Gln Gln Gln Ser Gly Gly Ser Pro Ile Asn Leu Pro Gln Ala Thr

500 505 510

Gly Ile Leu Arg Pro Leu Ser Ser Asn Ile Asn Gln Gly Gly Ser Val

515 520 525

Pro Asn Leu Val Gly Val Ile Pro Gly Thr Ala Ile Gly Leu Ser Asn

530 535 540

Met Cys Ser Gly Gly Arg Glu Phe Gly Ser Ser Ser Gly Leu Leu Ser

545 550 555 560

Ala Ser Gly Ser Leu Met Gln Ser Ser Thr Val Glu Ala Gln Asn Leu

565 570 575

Asn Phe Gly Gly Ser Ser Gly Ser Ser Gly Cys Ser Phe Gln Ala Ser

580 585 590

Val Leu Ser Ser Lys Thr Gly Gly Leu Glu Asp Leu Asn Pro Ala Lys

595 600 605

Arg Val Arg Thr Thr Tyr Ser Ala Leu Ser His Ser Ser Pro Asp Leu

610 615 620

Gly Gln Ser Ser Arg Pro Ala Trp Leu Gly Ser Gln Glu Gly Leu Val

625 630 635 640

His Gly Asp Pro Val Tyr Ser Pro His Gln Leu Ser Leu Pro Arg Gln

645 650 655

Asp Ile Val Gly Gly Ile Gly Ser Ser Gly Arg Pro Ala Tyr Met Gly

660 665 670

Ser Gln Ser Met Gly Ser Leu Gly Met Asn Phe Pro Leu Ser Leu Ala

675 680 685

Val Asp Ala Gly Ala Val Arg Pro Ser Leu Thr Arg Gly Gln Ser Leu

690 695 700

Thr Glu Gln Val Ala Ala Asn Arg Glu Leu Lys Phe Pro Lys Glu Glu

705 710 715 720

Arg Gly Arg Asp Asn Leu Met Cys Ala Arg Leu Gly Gly Gly Met Ile

725 730 735

Thr Asn Glu Ser Ser Ser Glu Glu Leu Leu Asn Tyr Leu Lys Gln Ser

740 745 750

His Glu Gly Leu Gly Phe Met Glu Gly Asp Leu Val Ser Asp Gly Tyr

755 760 765

Pro Val Asp Asn Leu Tyr Val Lys

770 775

<210> 21

<211> 715

<212> PRT

<213> 小立碗藓

<400> 21

Met Gly Gly Gly Tyr Leu Ser Ser Thr Val Asn Met Gly Glu Ser Arg

1 5 10 15

Asp Gly Gly Ser Pro Ala Met Ala Thr Leu Gln Gln Gln Gln Lys His

20 25 30

Gln Pro Leu Asn Pro Asn His Gln Asn Pro Arg Asn Arg Ser Asn Ser

35 40 45

Ser Pro Thr Asn Cys Tyr Ser Asn Thr Ala Trp Gly Ala Lys Pro Ala

50 55 60

Lys Leu Asp Thr Pro Asp Glu Phe Pro Val Gly Met Arg Val Leu Val

65 70 75 80

Val Asp Asp Asn Pro Thr Cys Leu Met Ile Leu Glu Gln Met Leu Val

85 90 95

Arg Cys Ala Tyr Arg Val Thr Thr Cys Gly Lys Ala Thr Glu Ala Leu

100 105 110

Ser Met Leu Arg Glu Asp Ile Gly Lys Phe Asp Val Val Ile Ser Asp

115 120 125

Val Asp Met Pro Asp Met Asp Gly Phe Lys Leu Leu Glu Leu Val Gly

130 135 140

Leu Glu Met Asp Leu Pro Val Ile Met Val Ser Gly Asn Gly Glu Thr

145 150 155 160

Ser Ala Val Met Lys Gly Ile Thr His Gly Ala Cys Asp Tyr Leu Leu

165 170 175

Lys Pro Val Arg Ile Glu Glu Leu Arg Asn Ile Trp Gln His Val Val

180 185 190

Arg Lys Lys Arg Arg Glu Val Lys Ala Val Ala Thr Lys Ser Val Glu

195 200 205

Glu Ala Gly Gly Cys Glu Arg Pro Lys Arg Gly Gly Gly Ala Asp Asp

210 215 220

Ala Asp Tyr Thr Ser Ser Ala Thr Asp Thr Thr Asp Ser Asn Trp Lys

225 230 235 240

Leu Thr Lys Arg Arg Lys Gly Glu Phe Lys Asp Glu Asn Glu Glu Asp

245 250 255

Asn Glu Gln Glu Asn Asp Asp Pro Ser Thr Leu Lys Arg Pro Arg Val

260 265 270

Val Trp Ser Val Glu Leu His Gln Gln Phe Val Ser Ala Val Asn Gln

275 280 285

Leu Gly Ile Asp Lys Ala Val Pro Lys Arg Ile Leu Glu Leu Met Gly

290 295 300

Val Gln Gly Leu Thr Arg Glu Asn Val Ala Ser His Leu Gln Lys Tyr

305 310 315 320

Arg Leu Tyr Leu Lys Arg Leu Ser Gly Val Thr Ser Gln Gln Gly Asn

325 330 335

Met Ser Ala His Phe Gly Gly Ser Asp Pro Phe Cys Met Met Pro Pro

340 345 350

Asp Met Ser Leu Ala Asn Gly Gln Leu Thr Pro Gln Ala Leu Ala Lys

355 360 365

Phe His Met Leu Gly Arg Met Asn Ala Thr Asn Gly Ile Gly Phe Ser

370 375 380

Gly Gly Gly Leu Asp Pro Gly Met Asn Gln Met Phe Leu Gln Asp Leu

385 390 395 400

Pro Arg Pro Pro Gln Leu Asn Ser Met Leu Arg Asn Asn Thr Gly Leu

405 410 415

Leu Ala Ser Val Pro Asn Gly Leu Gln His Leu Glu Gln Leu Ser Glu

420 425 430

Pro His His Val His Val Val Asn Glu Leu Glu His Tyr Pro Ser Asn

435 440 445

Thr Lys Val Tyr Pro Gln Leu Asn Gly Asn Leu Asp Val Ser Val Gly

450 455 460

Pro Leu Gly Ala Ala Asn Gly Asn Leu Ala Ser Asn Pro Asn Ser Asp

465 470 475 480

Thr Leu Leu Met His Ile Leu His Ser Arg Ala Ser Gln Gln Gly Val

485 490 495

Gly Ser Pro Ser Thr Leu Pro Gln Pro Arg Cys Gly Leu Asn Pro Thr

500 505 510

His Leu Leu Ser Asn Asp Ile Asn Phe Ala Pro Val Gly Ser Leu Pro

515 520 525

Asn Leu Ala Gly Ser Leu Gly Pro Ala Val Gly Leu Ser Ala Ile Pro

530 535 540

Gly Ser Ala Gly Gly Arg Asp Leu Ser Pro Ser Val Gly Gly Ser Gly

545 550 555 560

Ala Ser Leu Ser Ser Pro Leu Gly Ser Leu Val Arg Arg Pro Leu Met

565 570 575

Ala Glu Glu Gln Ser Asn Pro Val Asn Ser Thr Asn Gly Thr Tyr Ser

580 585 590

Met Ala His Ser Gly Gln Ser Pro Lys Pro Ser Gly Asp Thr Leu Pro

595 600 605

Thr Pro Leu Asn Glu Gly Leu Glu Gln Gln Gln Pro Leu Trp Ala Leu

610 615 620

Tyr Gln Asn Pro Met Asn Gln Leu Ser His Gly Pro Ser Gln Gly Phe

625 630 635 640

Pro His Asp Ser Leu Gln Trp Ser Val Leu Thr Glu Asn Leu Ser Phe

645 650 655

Gly Asp Met Gly Gln Ser Leu Ser Ala Gly Leu Ile Ser Gln Phe Ser

660 665 670

Ser Gln Gly Gln Asp Asn Gly Ile Gly Phe Ala Pro Pro Ser Gln Arg

675 680 685

Gly Ser Tyr Thr Arg Gln Ser Val Ser Phe Pro Ala Ser Ser Ala Leu

690 695 700

Asp Gly Arg Met Val Arg Ser Ser Tyr Glu Pro

705 710 715

<210> 22

<211> 664

<212> PRT

<213> 拟南芥

<400> 22

Met Val Asn Pro Gly His Gly Arg Gly Pro Asp Ser Gly Thr Ala Ala

1 5 10 15

Gly Gly Ser Asn Ser Asp Pro Phe Pro Ala Asn Leu Arg Val Leu Val

20 25 30

Val Asp Asp Asp Pro Thr Cys Leu Met Ile Leu Glu Arg Met Leu Met

35 40 45

Thr Cys Leu Tyr Arg Val Thr Lys Cys Asn Arg Ala Glu Ser Ala Leu

50 55 60

Ser Leu Leu Arg Lys Asn Lys Asn Gly Phe Asp Ile Val Ile Ser Asp

65 70 75 80

Val His Met Pro Asp Met Asp Gly Phe Lys Leu Leu Glu His Val Gly

85 90 95

Leu Glu Met Asp Leu Pro Val Ile Met Met Ser Ala Asp Asp Ser Lys

100 105 110

Ser Val Val Leu Lys Gly Val Thr His Gly Ala Val Asp Tyr Leu Ile

115 120 125

Lys Pro Val Arg Ile Glu Ala Leu Lys Asn Ile Trp Gln His Val Val

130 135 140

Arg Lys Lys Arg Asn Glu Trp Asn Val Ser Glu His Ser Gly Gly Ser

145 150 155 160

Ile Glu Asp Thr Gly Gly Asp Arg Asp Arg Gln Gln Gln His Arg Glu

165 170 175

Asp Ala Asp Asn Asn Ser Ser Ser Val Asn Glu Gly Asn Gly Arg Ser

180 185 190

Ser Arg Lys Arg Lys Glu Glu Glu Val Asp Asp Gln Gly Asp Asp Lys

195 200 205

Glu Asp Ser Ser Ser Leu Lys Lys Pro Arg Val Val Trp Ser Val Glu

210 215 220

Leu His Gln Gln Phe Val Ala Ala Val Asn Gln Leu Gly Val Asp Lys

225 230 235 240

Ala Val Pro Lys Lys Ile Leu Glu Met Met Asn Val Pro Gly Leu Thr

245 250 255

Arg Glu Asn Val Ala Ser His Leu Gln Lys Tyr Arg Ile Tyr Leu Arg

260 265 270

Arg Leu Gly Gly Val Ser Gln His Gln Gly Asn Met Asn His Ser Phe

275 280 285

Met Thr Gly Gln Asp Gln Ser Phe Gly Pro Leu Ser Ser Leu Asn Gly

290 295 300

Phe Asp Leu Gln Ser Leu Ala Val Thr Gly Gln Leu Pro Pro Gln Ser

305 310 315 320

Leu Ala Gln Leu Gln Ala Ala Gly Leu Gly Arg Pro Thr Leu Ala Lys

325 330 335

Pro Gly Met Ser Val Ser Pro Leu Val Asp Gln Arg Ser Ile Phe Asn

340 345 350

Phe Glu Asn Pro Lys Ile Arg Phe Gly Asp Gly His Gly Gln Thr Met

355 360 365

Asn Asn Gly Asn Leu Leu His Gly Val Pro Thr Gly Ser His Met Arg

370 375 380

Leu Arg Pro Gly Gln Asn Val Gln Ser Ser Gly Met Met Leu Pro Val

385 390 395 400

Ala Asp Gln Leu Pro Arg Gly Gly Pro Ser Met Leu Pro Ser Leu Gly

405 410 415

Gln Gln Pro Ile Leu Ser Ser Ser Val Ser Arg Arg Ser Asp Leu Thr

420 425 430

Gly Ala Leu Ala Val Arg Asn Ser Ile Pro Glu Thr Asn Ser Arg Val

435 440 445

Leu Pro Thr Thr His Ser Val Phe Asn Asn Phe Pro Ala Asp Leu Pro

450 455 460

Arg Ser Ser Phe Pro Leu Ala Ser Ala Pro Gly Ile Ser Val Pro Val

465 470 475 480

Ser Val Ser Tyr Gln Glu Glu Val Asn Ser Ser Asp Ala Lys Gly Gly

485 490 495

Ser Ser Ala Ala Thr Ala Gly Phe Gly Asn Pro Ser Tyr Asp Ile Phe

500 505 510

Asn Asp Phe Pro Gln His Gln Gln His Asn Lys Asn Ile Ser Asn Lys

515 520 525

Leu Asn Asp Trp Asp Leu Arg Asn Met Gly Leu Val Phe Ser Ser Asn

530 535 540

Gln Asp Ala Ala Thr Ala Thr Ala Thr Ala Ala Phe Ser Thr Ser Glu

545 550 555 560

Ala Tyr Ser Ser Ser Ser Thr Gln Arg Lys Arg Arg Glu Thr Asp Ala

565 570 575

Thr Val Val Gly Glu His Gly Gln Asn Leu Gln Ser Pro Ser Arg Asn

580 585 590

Leu Tyr His Leu Asn His Val Phe Met Asp Gly Gly Ser Val Arg Val

595 600 605

Lys Ser Glu Arg Val Ala Glu Thr Val Thr Cys Pro Pro Ala Asn Thr

610 615 620

Leu Phe His Glu Gln Tyr Asn Gln Glu Asp Leu Met Ser Ala Phe Leu

625 630 635 640

Lys Gln Glu Gly Ile Pro Ser Val Asp Asn Glu Phe Glu Phe Asp Gly

645 650 655

Tyr Ser Ile Asp Asn Ile Gln Val

660

<210> 23

<211> 1036

<212> PRT

<213> 圆叶拟南芥

<400> 23

Leu Ser Lys Lys Gln Asn Glu Asp Ala Ser Gly Arg Lys Glu Glu Asp

1 5 10 15

Gly Lys Gly Asn Glu His Asn Gly Met Glu Ser Cys Thr Arg Met Lys

20 25 30

Arg Thr Val Trp Thr Val Glu Leu His Gln Lys Phe Val Asn Ala Phe

35 40 45

Gln Gln Leu Gly Leu Asp Lys Ala Ser Pro Glu Gln Ile His Ala Leu

50 55 60

Met Asn Val Glu Gly Leu Pro Val Ile Asn Val Ala Ser His Leu Gln

65 70 75 80

Lys Tyr Arg Leu Phe Leu Lys Lys Ile Tyr Glu Gly Gln Gln Leu Asp

85 90 95

Met Ala Thr Ile Gln Leu Leu Leu Ser Ala Gly Ser His Phe Pro Gln

100 105 110

Thr Pro Trp Thr Asn His Cys Ser Ser Phe Ile Gln Gln Gly His His

115 120 125

Gln Asn Ser Ser Asn Ser Ser Glu Thr Tyr His Thr Thr Leu Ser Pro

130 135 140

Arg Val Gln Lys Val Asn Thr Phe Gln Pro Ser Ser Ser Pro Leu Lys

145 150 155 160

Pro Leu Leu Phe Pro Lys Ser Asn Ile Ser Ala Phe Lys Glu Asp Phe

165 170 175

Lys Ser Ile Lys Glu Pro Ala Ile Val Gly Asp Ser Ser Leu Asp Ser

180 185 190

Ser Lys Pro Arg Asn Ser Phe Gln Thr Ala Ser Lys Phe Pro Lys Thr

195 200 205

Asp Pro Cys Thr Gly Ser Tyr Ile Ile Glu Ile Met Thr Glu Pro Tyr

210 215 220

Tyr Gly Lys Ser Ser Arg Arg His Ser Asn Phe Ser Ala Tyr Met Gly

225 230 235 240

Asp Phe Lys Ser Ile Lys Asp Pro Glu Ile Val Gln Glu Ser Arg Thr

245 250 255

Arg Lys Asn His Gly Arg Val Val Trp Ser His Glu Leu His Gln Lys

260 265 270

Phe Leu Asn Ala Ile Asp Gln Leu Gly Gly Asn Glu Lys Ala Ile Pro

275 280 285

Lys Lys Ile Leu Ala Val Met Asn Val Glu Gly Leu Thr Arg Leu Asn

290 295 300

Val Ala Thr His Leu Gln Lys Tyr Arg Gln Cys Cys Ser Ala Glu Ala

305 310 315 320

Gln Gln Leu Asn Met Ala Thr Arg Lys Leu Pro Ser Ser Glu His Leu

325 330 335

Pro Gln Ser Pro Ser Thr Asn His His Ser Ser Leu Ser Pro Arg Val

340 345 350

Gln Asp Val Asn Ile Arg Leu Trp Ser Ser Ser Pro Lys Arg Gln Asp

355 360 365

Gln Ile Leu Val Tyr Val Leu Phe Ser Phe Glu Asn Asp Asn Gly Arg

370 375 380

Glu Glu Thr Thr Cys Arg Arg Ile Ala Ser Thr Met Glu Leu Gly Ser

385 390 395 400

Thr Glu Asp Gly Arg His Asp Lys Phe Pro Val Gly Met Arg Val Leu

405 410 415

Ala Val Asp Asp Asn Pro Thr Cys Leu Arg Lys Leu Glu Glu Leu Leu

420 425 430

Leu Arg Cys Lys Tyr His Val Thr Lys Thr Met Glu Ser Arg Lys Ala

435 440 445

Leu Glu Leu Leu Arg Glu Asn Ser Asn Met Phe Asp Leu Val Ile Ser

450 455 460

Asp Val Glu Met Pro Asp Thr Asp Gly Phe Lys Leu Leu Glu Ile Gly

465 470 475 480

Leu Glu Met Asp Leu Pro Val Ile Met Leu Ser Ala His Ser Asp Tyr

485 490 495

Asp Ser Val Met Lys Gly Ile Ile His Gly Ala Cys Asp Tyr Leu Val

500 505 510

Lys Pro Val Gly Leu Lys Glu Leu Gln Asn Ile Trp His His Val Val

515 520 525

Lys Lys Asn Ile Lys Ser Tyr Ala Lys Asn Ile Gly Pro Ser Arg Gln

530 535 540

Leu Leu Pro Pro Ser Glu Ser Asn Leu Val Pro Ser Ala Ser Lys Lys

545 550 555 560

Arg Lys Glu Lys Ala Ser Asp Ser Gly Asp Glu Asp Asp Ser Asp Arg

565 570 575

Glu Glu Asp Asp Gly Glu Gly Ser Glu Gln Asp Gly Glu Glu Ser Gly

580 585 590

Thr Arg Lys Lys Pro Arg Val Val Trp Ser Gln Glu Leu His Gln Lys

595 600 605

Phe Val Ser Ala Val Gln Gln Leu Gly Leu Asp Lys Ala Val Pro Lys

610 615 620

Lys Ile Leu Asp Leu Met Ser Ile Glu Gly Leu Thr Arg Glu Asn Val

625 630 635 640

Ala Ser His Leu Gln Lys Tyr Arg Leu Tyr Leu Lys Lys Ile Asp Glu

645 650 655

Gly Gln Gln Gln Asn Met Thr Pro Asp Ala Phe Gly Thr Arg Asp Ser

660 665 670

Ser Tyr Phe Gln Met Ala Gln Leu Asp Gly Leu Arg Asp Phe Thr Ala

675 680 685

Thr Arg Gln Ile Pro Ser Ser Gly Leu Leu Ser Arg Ser His Leu Thr

690 695 700

Lys Leu Gln Pro Pro Met Tyr Ser Ser Ile Asn Leu Gln Gly Met Asn

705 710 715 720

Ser Ser Ser Phe Ile Gln Gln Gly His His His Asn Ser Ser Asn Ser

725 730 735

Ala Asn Pro Phe Gly Thr Tyr His Thr Thr Leu Ser Pro Arg Ile Gln

740 745 750

Asn Val Asn Leu Leu Gln Arg Thr Ser Ser Pro Leu Glu Thr Leu Gln

755 760 765

Phe Pro Arg Ser Lys Ser Tyr Ile Gly Asp Phe Lys Gly Ile Gly Asp

770 775 780

Arg Ala Val Gly Gly Ser Phe Leu Asp Ser Cys Met Pro Phe Gly Ser

785 790 795 800

Ser Ser Thr Ser Leu Pro Ser Ala Ser Thr Asn Thr Leu Met Leu Gln

805 810 815

Ala Asn Tyr Thr Gln Pro Leu His Ile Ala Ser Asp Gly Asn Gln Pro

820 825 830

Cys Ile Glu Gly Thr Pro Ser Asn Ser Ala Ser Pro Asn Ile Ser Phe

835 840 845

Gln Gly Leu Ser Arg Phe Pro Ser His Ser Trp Gln Gly Asn Leu Asn

850 855 860

Thr Thr Arg Phe Pro Pro Ser Ser Leu Pro Leu Asn Gln Ala Phe Leu

865 870 875 880

Pro Asp Gln Val Thr Cys Ala Gly Asn Asn Leu Gly Asp Cys Thr Ser

885 890 895

Leu Val Ser Ala Gly Asn Pro Gly Gly Glu Met Gln Cys Glu Pro Gln

900 905 910

Leu Leu Gly Gly Phe Met Gln Asn Met Asn Pro Leu Asp Gly Gln Lys

915 920 925

Trp Glu Gln Gln Asn Ser Met Leu Asn Asn Pro Phe Gly Asn Ile Glu

930 935 940

Tyr Pro Leu Ser Ala Asp Asn Met Val Phe Arg Asp Asn Asn Ala Thr

945 950 955 960

Arg Asn Lys Gly Leu Asp Glu Ser Leu Met Asn Pro Ile Asp Asn Ser

965 970 975

Gln Glu Tyr Val Gly Lys Ala Thr Thr Met Leu Asp Pro Glu Met Lys

980 985 990

Ser Gly Lys Pro Glu Asn Asp Asn Gln His Asp Val Phe Asp Asp Ile

995 1000 1005

Met Asn Glu Met Met Lys Gln Glu Glu Asn Asn Gly Met Val Ser

1010 1015 1020

Val Ala Thr Arg Phe Gly Phe Asp Ser Phe Pro Pro Pro

1025 1030 1035

<210> 24

<211> 774

<212> PRT

<213> 琴叶拟南芥

<400> 24

Met Gly Asp Phe Lys Ser Ile Lys Glu Pro Glu Ile Val Gln Glu Ser

1 5 10 15

Arg Thr Arg Lys Asn His Gly Arg Val Val Trp Ser His Glu Leu His

20 25 30

Gln Lys Phe Leu His Ala Ile Asp Gln Leu Gly Gly Asn Asp Lys Ala

35 40 45

Ile Pro Lys Lys Ile Leu Ala Val Met Asn Val Glu Gly Leu Thr Arg

50 55 60

Leu Asn Val Ala Thr His Leu Gln Lys Tyr Arg Gln Cys Cys Ser Thr

65 70 75 80

Glu Ala Gln Gln Leu Asn Met Ala Thr Arg Lys Leu Pro Ser Ser Glu

85 90 95

His Leu Pro Gln Ser Pro Ser Thr Asn His His Ser Ser Leu Ser Pro

100 105 110

Arg Val Gln Asp Asn Asp Asn Gly Arg Glu Glu Thr Thr Cys Arg Arg

115 120 125

Ile Ala Ser Thr Met Glu Leu Gly Ser Thr Glu Asp Gly Arg His Asp

130 135 140

Lys Phe Pro Val Gly Met Arg Val Leu Ala Val Asp Asp Asn Pro Thr

145 150 155 160

Cys Leu Arg Lys Leu Glu Glu Leu Leu Leu Arg Cys Lys Tyr His Val

165 170 175

Thr Lys Thr Met Glu Ser Arg Lys Ala Leu Glu Leu Leu Arg Glu Asn

180 185 190

Ser Asn Met Phe Asp Leu Val Ile Ser Asp Val Glu Met Pro Asp Thr

195 200 205

Asp Gly Phe Lys Leu Leu Glu Ile Gly Leu Glu Met Asp Leu Pro Val

210 215 220

Ile Met Leu Ser Ala His Ser Asp Tyr Asp Ser Val Met Lys Gly Ile

225 230 235 240

Ile His Gly Ala Cys Asp Tyr Leu Val Lys Pro Val Gly Leu Lys Glu

245 250 255

Leu Gln Asn Ile Trp His His Val Val Lys Lys Asn Ile Lys Ser Tyr

260 265 270

Ala Lys Asn Ile Gly Pro Ser Arg Gln Leu Leu Pro Pro Ser Glu Ser

275 280 285

Asn Leu Val Pro Ser Ala Ser Lys Lys Arg Lys Glu Lys Ala Asn Asp

290 295 300

Ser Gly Asp Glu Asp Asp Ser Asp Arg Glu Glu Asp Asp Gly Glu Gly

305 310 315 320

Ser Glu Gln Asp Gly Asp Glu Ala Gly Thr Arg Lys Lys Pro Arg Val

325 330 335

Val Trp Ser Gln Glu Leu His Gln Lys Phe Val Ser Ala Val Gln Gln

340 345 350

Leu Gly Leu Asp Lys Ala Val Pro Lys Lys Ile Leu Asp Leu Met Ser

355 360 365

Ile Glu Gly Leu Thr Arg Glu Asn Val Ala Ser His Leu Gln Lys Tyr

370 375 380

Arg Leu Tyr Leu Lys Lys Ile Asp Glu Gly Gln Gln Gln Asn Met Thr

385 390 395 400

Pro Asp Ala Phe Gly Thr Arg Asp Ser Ser Tyr Phe Gln Met Ala Gln

405 410 415

Leu Asp Gly Leu Arg Asp Phe Thr Ala Thr Arg Gln Ile Pro Ser Ser

420 425 430

Gly Leu Leu Ser Arg Ser His Leu Thr Lys Leu Gln Pro Pro Met Tyr

435 440 445

Ser Ser Ile Asn Leu Gln Gly Met Asn Ser Ser Ser Phe Ile Gln Gln

450 455 460

Gly His His His Asn Ser Ser Asn Ser Ala Asn Pro Phe Gly Thr Tyr

465 470 475 480

His Thr Thr Leu Ser Pro Arg Ile Gln Asn Val Asn Leu Phe Gln Arg

485 490 495

Thr Ser Ser Pro Leu Glu Thr Leu Gln Phe Pro Arg Ser Lys Ser Tyr

500 505 510

Ile Gly Asp Phe Lys Gly Ile Gly Asp Arg Ala Val Gly Gly Ser Phe

515 520 525

Leu Asp Ser Cys Met Pro Phe Gly Ser Ser Ser Thr Ser Leu Pro Ser

530 535 540

Ala Ser Thr Asn Thr Leu Met Leu Gln Ala Asn Tyr Thr Gln Pro Leu

545 550 555 560

His Ile Ser Ser Asp Gly Asn Gln Pro Cys Ile Glu Gly Thr Pro Ser

565 570 575

Asn Ser Ala Ser Pro Asn Ile Ser Phe Gln Gly Leu Ser Arg Phe Pro

580 585 590

Ser His Ser Trp Gln Gly Asn Leu Asn Thr Thr Arg Phe Pro Pro Ser

595 600 605

Ser Leu Pro Leu Asn Pro Ala Phe Leu Pro Asp Gln Val Thr Cys Ala

610 615 620

Gly Asn Asn Leu Gly Asp Cys Thr Ser Leu Val Ser Ala Gly Asn Pro

625 630 635 640

Gly Gly Glu Ile Gln Cys Glu Pro Gln Leu Leu Gly Gly Phe Met Gln

645 650 655

Asn Met Asn Pro Leu Asp Gly Gln Lys Trp Glu Gln Gln Asn Cys Thr

660 665 670

Met Leu Asn Asn Pro Phe Gly Asn Ile Glu Tyr Pro Leu Pro Ala Asp

675 680 685

Asn Met Val Phe Arg Asp Asn Asn Ala Thr Arg Ser Lys Gly Leu Asp

690 695 700

Glu Ser Leu Met Asn Pro Ile Asp Asn Ser Gln Glu Tyr Val Gly Lys

705 710 715 720

Ala Thr Thr Met Leu Asp Pro Glu Met Lys Ser Gly Lys Pro Glu Asn

725 730 735

Asp Asn Gln His Asp Val Phe Asp Asp Leu Met Asn Glu Met Met Lys

740 745 750

Gln Glu Glu Asn Asn Gly Met Val Ser Val Ala Thr Arg Phe Gly Phe

755 760 765

Asp Ser Phe Pro Pro Pro

770

<210> 25

<211> 578

<212> PRT

<213> 向日葵

<400> 25

Met Thr Thr Gly Ser Ser Phe Gly Ser Gly Ser Leu Gly Cys Lys Gln

1 5 10 15

Glu Thr Gly Val Pro Asp Gln Phe Pro Ala Gly Leu Arg Val Leu Val

20 25 30

Val Asp Asp Asp Val Ile Cys Leu Lys Ile Leu Glu Gln Met Leu Arg

35 40 45

Arg Cys Ser Tyr His Val Thr Thr Cys Ser Gln Ala Thr Ala Ala Leu

50 55 60

Asn Leu Leu Arg Glu Arg Lys Gly Cys Phe Asp Val Val Leu Ser Asp

65 70 75 80

Val His Met Pro Asp Met Asp Gly Phe Lys Leu Leu Glu Leu Val Gly

85 90 95

Leu Glu Met Asp Leu Pro Val Ile Met Met Ser Ala Asp Gly Arg Thr

100 105 110

Asn Leu Val Leu Arg Gly Ile Arg His Gly Ala Cys Asp Tyr Leu Ile

115 120 125

Lys Pro Ile Arg Glu Glu Gln Leu Lys Asn Ile Trp Gln His Val Ile

130 135 140

Arg Lys Lys Trp Asn Glu Asn Lys Glu His Glu His Ser Gly Ser Val

145 150 155 160

Asp Asp Lys Asp Arg His Lys Arg Gly Gly Asp Asp Asn Asp Tyr Ala

165 170 175

Ser Ser Val Asn Glu Gly Gly Asp Gly Ile Leu Thr Ser His Lys Lys

180 185 190

Lys Arg His Asn Asn Lys Glu Glu Asp Asp Gly Glu Leu Glu Thr Asp

195 200 205

Glu Pro Gly Gly Ser Lys Lys Ala Arg Val Val Trp Ser Val Glu Leu

210 215 220

His Gln Gln Phe Val Thr Ala Val Asn Gln Leu Gly Ile Asp Lys Ala

225 230 235 240

Val Pro Lys Arg Ile Leu Glu Leu Met Asn Val Pro Gly Leu Thr Arg

245 250 255

Glu Asn Val Ala Ser His Leu Gln Lys Phe Arg Leu Tyr Leu Lys Arg

260 265 270

Leu Ser Gly Val Ala Gln Gln Gly Gly Gly Pro Asn Ser Phe Cys Gly

275 280 285

Ser Ile Asp Gln Asn Pro Lys Leu Ala Ser Tyr Ala Arg Phe Glu Ile

290 295 300

Gln Ala Leu Ala Ala Ser Gly Gln Ile Pro Pro Gln Thr Leu Val Ala

305 310 315 320

Leu His Ala Glu Leu Leu Gly Gln Pro Thr Ala Asn Val Gly Met Pro

325 330 335

Val Leu Asp His Gln Pro Leu Met Gln Pro Ser Lys Cys Gly Pro Val

340 345 350

Asp His Val Met Ser Tyr Gly Gln Thr Leu Pro Ser Asn Val Thr Lys

355 360 365

Gln Val Pro Gln Pro Ala Ile Glu Asp Val His Ser Gly Leu Gly Ala

370 375 380

Trp His Ser Asn Asn Met Val Gly Gly Tyr Gly Gln Leu Gly Gly Gln

385 390 395 400

Asn Trp His Asn Met Leu Leu Gly Met Leu Gln Ser Gln Ser His Gln

405 410 415

Leu Gln Lys Gln Ser Ile Thr Val Gln Pro Ser Arg Leu Val Val Pro

420 425 430

Ser Gln Ser Ser Asn Phe Gln Ala Val Asn Asn Gly Val Pro Val Asn

435 440 445

Gln Thr Thr Gly Phe Asn Asn Ser Thr Val Ile Asn Tyr Ala Val Gly

450 455 460

Gln Arg Thr Glu Arg Asp Val Glu Asn Gln Ile Gly Gly Gln Ser Ser

465 470 475 480

Val Ser Asn Ile Ser Val Lys Glu Met Gly Glu Lys Gln Ile Ser Phe

485 490 495

Gly Glu Ser Val His Val Leu Asp Gln Gly Ser Leu Arg Asn Leu Gly

500 505 510

Phe Val Gly Lys Lys Ser Ser Ile Pro Ser Arg Phe Ala Val Tyr Glu

515 520 525

Ala Ala Glu Ser Leu Thr His Asn Leu Asn Tyr Gly Asp Asn Asn Gly

530 535 540

Glu Arg Arg Val Lys Gln Glu Pro Asn Ile Glu Phe Leu Glu Asn Ser

545 550 555 560

Lys Ala Gly Ala His Arg Val Ser Gln Asn Asp Leu Met Ser Lys Gln

565 570 575

Val Arg

<210> 26

<211> 428

<212> PRT

<213> 葡萄

<400> 26

Met Ala Ala Leu Leu Lys Val Pro Pro Gln Ser Ser Gly Gly Thr Asn

1 5 10 15

Gly Ser Cys Lys Ala Asp Val Val Val Ser Asp Gln Phe Pro Ala Gly

20 25 30

Leu Arg Val Leu Val Val Asp Asp Asp Val Thr Cys Leu Lys Ile Leu

35 40 45

Glu Gln Met Leu Arg Arg Cys Leu Tyr His Val Thr Thr Cys Ser Gln

50 55 60

Ala Thr Ile Ala Leu Asn Ile Leu Arg Glu Lys Lys Gly Cys Phe Asp

65 70 75 80

Ile Val Leu Ser Asp Val His Met Pro Asp Met Asp Gly Tyr Lys Leu

85 90 95

Leu Glu His Val Gly Leu Glu Met Asp Leu Pro Val Ile Met Met Ser

100 105 110

Ala Asp Gly Arg Thr Ser Ala Val Met Arg Gly Ile Arg His Gly Ala

115 120 125

Cys Asp Tyr Leu Ile Lys Pro Ile Arg Glu Glu Glu Leu Lys Asn Ile

130 135 140

Trp Gln His Val Val Arg Lys Lys Trp Asn Glu Asn Lys Glu His Glu

145 150 155 160

His Ser Gly Ser Leu Glu Asp Asn Asp Arg His Lys Arg Gly Gly Glu

165 170 175

Asp Ala Glu Tyr Ala Ser Ser Val Asn Glu Gly Ala Glu Gly Ile Leu

180 185 190

Lys Gly Gln Lys Lys Arg Arg Asp Ser Lys Asp Glu Asp Asp Gly Glu

195 200 205

Leu Glu Asn Glu Asp Pro Ser Thr Ser Lys Lys Pro Arg Val Val Trp

210 215 220

Ser Val Glu Leu His Gln Gln Phe Val Ser Ala Val Asn Gln Leu Gly

225 230 235 240

Ile Asp Lys Ala Val Pro Lys Arg Ile Leu Glu Leu Met Asn Val Pro

245 250 255

Gly Leu Thr Arg Glu Asn Val Ala Ser His Leu Gln Lys Phe Arg Leu

260 265 270

Tyr Leu Lys Arg Leu Ser Gly Val Ala Gln Gln Gln Gly Gly Ile Pro

275 280 285

Asn Ser Phe Cys Gly Pro Val Glu Pro Asn Val Lys Leu Gly Ser Leu

290 295 300

Gly Arg Phe Asp Ile Gln Ala Leu Ala Ala Ser Gly Gln Ile Pro Pro

305 310 315 320

Gln Thr Leu Ala Ala Leu Gln Ala Glu Leu Leu Gly Arg Pro Thr Ser

325 330 335

Asn Leu Val Leu Pro Ala Met Asp Gln Pro Ala Leu Leu Gln Ala Ser

340 345 350

Leu Gln Gly Pro Lys Cys Ile Pro Val Glu His Gly Val Ala Phe Gly

355 360 365

Gln Pro Leu Val Lys Cys Gln Thr Asn Ile Ser Lys His Phe Pro Pro

370 375 380

Thr Val Val Ser Thr Glu Asp Val Pro Ser Gly Phe Gly Ala Trp Pro

385 390 395 400

Ser Asn Ser Leu Gly Thr Val Gly Thr Ser Gly Ser Leu Gly Gly Leu

405 410 415

Ser Ala Gln Asn Asn Asn Ile Leu Met Asp Met Lys

420 425

<210> 27

<211> 659

<212> PRT

<213> 无油樟

<400> 27

Met Ala Asn Val Gln Lys Leu Pro His Ser Ser Ile Ser Thr Ala Ser

1 5 10 15

Ser Tyr Gly Ser Cys Arg Gly Glu Gly Val Pro Asp Gln Phe Pro Ala

20 25 30

Gly Leu Arg Val Leu Val Val Asp Asp Asp Thr Thr Cys Leu Arg Ile

35 40 45

Leu Glu Gln Met Leu Arg Lys Cys Met Tyr Lys Val Thr Thr Cys Cys

50 55 60

Arg Ala Thr Asp Ala Leu Asp Thr Leu Arg Gly Ser Lys Gly Cys Phe

65 70 75 80

Asp Val Val Ile Ser Asp Val Tyr Met Pro Asp Met Asp Gly Phe Lys

85 90 95

Leu Leu Glu His Val Gly Leu Glu Met Asp Leu Pro Val Ile Met Met

100 105 110

Ser Ala Asp Ala Arg Phe Ser Ala Val Met Lys Gly Ile Lys His Gly

115 120 125

Ala Cys Asp Tyr Leu Ile Lys Pro Val Arg Ile Glu Glu Leu Lys Asn

130 135 140

Ile Trp Gln His Val Val Arg Lys Lys Trp Asn Glu Thr Lys Glu His

145 150 155 160

Asp Gln Ser Gly Ser Ile Glu Asp Asn Glu Arg His Lys Arg Gly Ser

165 170 175

Asp Asp Ala Glu Tyr Ala Ser Ser Val Asn Glu Gly Thr Asp Gly Asn

180 185 190

Trp Lys Val Gln Lys Lys Arg Lys Asp Ser Lys Glu Glu Glu Asp Asp

195 200 205

Gly Glu Gln Glu Asn Glu Asp Pro Ser Ala Ala Lys Lys Pro Arg Val

210 215 220

Val Trp Ser Val Glu Leu His Gln Gln Phe Val Asn Ala Val Asn Gln

225 230 235 240

Leu Gly Ile Asp Lys Ala Val Pro Lys Arg Ile Leu Glu Leu Met Asn

245 250 255

Val Gln Gly Leu Thr Arg Glu Asn Val Ala Ser His Leu Gln Lys Phe

260 265 270

Arg Leu Tyr Leu Lys Arg Leu Ser Gly His Gln Ala Gly Val Ser Ser

275 280 285

Ser Phe Cys Gly Ser Val Asp Pro Asn Ser Lys Leu Gly Pro Leu Ser

290 295 300

Gln Leu Asp Ile Arg Ala Leu Thr Ala Ser Gly Gln Ile Pro Ser Gln

305 310 315 320

Thr Leu Ala Ala Leu Gln Ala Glu Leu Leu Gly Arg Pro Ser Asn Asn

325 330 335

Val Ala Met Pro Val Tyr Gly Gln Thr Leu Val Lys Cys Gln Pro Asn

340 345 350

Leu Pro Lys Gln Phe Pro Gln Pro Asn Leu Pro Val Asp Asp Val Gln

355 360 365

Ser Ser Leu Ser Ile Trp Gln His His Leu Ser Ser Gly Met Pro Leu

370 375 380

Gly Gly Leu Asn Pro Gln Asn Asn Gly Leu Leu Met Gln Gln Gln Gln

385 390 395 400

Gln Leu Thr Ile Glu Ser Asn Arg Pro Cys Asn Val Gln Pro Ser Cys

405 410 415

His Val Ala Pro Ser Asn Gly Gly Phe Thr Met Arg Asn Asn Pro Thr

420 425 430

Ser Ser Asn Ala Ser Ser Val Glu Tyr Asn Ser Leu Leu Ser Ser Gln

435 440 445

Gly Asp Val Gly Gln Ile Ser Gln Ala Ser Gly Ser Asp Leu Ala Thr

450 455 460

Thr Val Gln Ser Asn Gly Gly Phe Lys Ser Leu Asp Tyr Arg Asn Met

465 470 475 480

Gly Gln Val Ser Leu Glu Ser Thr Ser Asp Leu Val Ser Thr Gln Asn

485 490 495

Asn Gly Phe Lys Gly Met Glu Leu Arg Asn Val Gly Ser Leu Gly Gly

500 505 510

Tyr Pro Leu Ser Ser Ser Val Ser Ala Gly Ser Thr Lys Thr Glu Asn

515 520 525

Gly Gln Ser Phe Ser Gln Val Arg Thr Gly Pro Arg Met Ser Met Gly

530 535 540

Pro Thr Gly Gln Phe Val Gly Pro Pro Thr Ile Arg Arg Leu Pro Met

545 550 555 560

Val Asp Gly Gly Thr His Arg Asn Ser Leu Gly Phe Val Gly Lys Gly

565 570 575

Val Ser Ile Pro Ser Arg Phe Met Pro Asp Ser Gly Ser Pro Thr Gly

580 585 590

Val Gly Glu Glu Cys Thr Leu Pro Lys Gln Glu Val Asp Pro Asp Phe

595 600 605

Phe Asp Ser Leu Lys Val Gly Pro Val Gly Val Gln His Tyr Ala Ser

610 615 620

Gly Asp Leu Met Ser Val Leu Ser Lys Gln Gln Gln Ala Ser Thr Gly

625 630 635 640

Asn Leu Asp Cys Glu Phe Gly Ile Asp Gly Tyr Gln Leu Gly Asn Ile

645 650 655

His Val Lys

<210> 28

<211> 669

<212> PRT

<213> 蓖麻

<400> 28

Met Ala Ala Leu Gln Arg Val Ala Ser Ser Val Ser Ala Thr Ala Ser

1 5 10 15

Asn Tyr Ser Ser Cys Lys Gly Asn Gly Val Val Thr Ala Thr Ala Asp

20 25 30

Val Ala Val Ser Asp Gln Phe Pro Ala Gly Leu Arg Val Leu Val Val

35 40 45

Asp Asp Asp Thr Thr Cys Leu Arg Ile Leu Glu Gln Met Leu Arg Arg

50 55 60

Cys Leu Tyr His Val Thr Thr Cys Ser Gln Ala Lys Val Ala Leu Asn

65 70 75 80

Leu Leu Arg Glu Arg Lys Gly Cys Phe Asp Val Val Leu Ser Asp Val

85 90 95

His Met Pro Asp Met Asp Gly Phe Lys Leu Leu Glu His Val Gly Leu

100 105 110

Glu Met Asp Leu Pro Val Ile Met Met Ser Ala Asp Gly Arg Thr Ser

115 120 125

Ala Val Met Arg Gly Ile Arg His Gly Ala Cys Asp Tyr Leu Ile Lys

130 135 140

Pro Ile Arg Glu Glu Glu Leu Lys Asn Ile Trp Gln His Val Val Arg

145 150 155 160

Lys Lys Trp His Glu Asn Lys Glu Ile Glu His Ser Gly Ser Leu Glu

165 170 175

Asp Asn Asp Arg His Lys Arg Gly Asn Glu Asp Ala Glu Tyr Thr Ser

180 185 190

Ser Val Asn Glu Gly Thr Glu Gly Val Leu Lys Gly Gln Lys Arg Arg

195 200 205

Ser Asn Ser Lys Asp Glu Asp Asp Gly Glu Pro Asp Ser Asp Asp Pro

210 215 220

Ser Thr Ser Lys Lys Pro Arg Val Val Trp Ser Val Glu Leu His Gln

225 230 235 240

Gln Phe Val Ser Ala Val Asn Gln Leu Gly Ile Asp Lys Ala Val Pro

245 250 255

Lys Arg Ile Leu Glu Leu Met Asn Val Pro Gly Leu Thr Arg Glu Asn

260 265 270

Val Ala Ser His Leu Gln Lys Phe Arg Leu Tyr Leu Lys Arg Leu Ser

275 280 285

Gly Val Ala Gln Gln Gly Gly Ile Ser Ser Thr Phe Cys Gly Pro Met

290 295 300

Asp Ser Asn Val Lys Leu Asn Ser Leu Gly Arg Phe Asp Ile Gln Ala

305 310 315 320

Leu Ala Ala Ser Gly Gln Ile Pro Pro Gln Thr Leu Ala Ala Leu His

325 330 335

Ala Glu Leu Phe Gly Arg Pro Thr Gly Ser Leu Val Thr Thr Met Asp

340 345 350

Gln Pro Thr Leu Leu Gln Ala Ser Arg Gln Ser Pro Lys Cys Ile Pro

355 360 365

Val Glu His Gly Val Thr Phe Gly Gln Pro Ile Val Lys Cys Ser Ser

370 375 380

Gly Ile Ser Lys His Phe Pro Gln Asn Met Val Ser Val Glu Glu Val

385 390 395 400

Ser Ser Gly Tyr Gly Ala Trp Pro Ser Asn Ser Leu Gly Thr Val Gly

405 410 415

Pro Ser Thr Asn Leu Gly Gly Met Thr Thr Gln Asn Gly Asn Met Leu

420 425 430

Met Asp Ile Phe His Gln Gln Gln Lys Gln Gln Gln Pro Gln Gln Gln

435 440 445

Gln Ser Leu Ala Asp Pro Ser Arg Ser Ile Asn Val Gln Pro Ser Cys

450 455 460

Leu Val Val Pro Ser Gln Ser Ser Ala Cys Phe Gln Ala Gly Asn Ser

465 470 475 480

Pro Ala Ser Val Asn Gln Ser Asn Phe Asn Arg Asn Val Val Ile Asp

485 490 495

Tyr Ser Leu Leu Ser Ser Gln Ser Asn Asn Ser Ala Leu Asn Ile Gly

500 505 510

His Ile Pro Glu Gly Asp Leu Lys Thr Thr Gly Ala Val Asn Gly Tyr

515 520 525

Ser Ala Pro Gly Ser Leu Ser Pro Pro Ala Ser Ser Cys Ser Val Asn

530 535 540

Ala Asp Ser Gly Val Pro Arg Gln Val Gln Asn Pro Thr Leu Ala Phe

545 550 555 560

Gly Ala Val Arg Gln Leu Pro Ala Leu Ser Pro Asn Ile Phe Asn Ile

565 570 575

Gln Gly Ser Tyr Gly Val Arg Ser Asp Asp Ile Leu Asp Gln Gly Pro

580 585 590

Phe Phe Lys Asn Leu Gly Phe Val Gly Lys Gly Thr Cys Ile Pro Ser

595 600 605

Arg Phe Ala Val Asp Glu Phe Glu Thr Pro Ser Ser Asn Leu Ser His

610 615 620

Gly Lys Leu Tyr Val Glu Asn Asn Asp Asn Lys Val Lys Gln Glu Pro

625 630 635 640

Asn Ile Asp Phe Thr Asp Thr Ser Arg Val Gly Ile Pro Val Leu Gln

645 650 655

Gln Tyr Pro Pro Asn Asp Leu Met Ser Val Phe Thr Glu

660 665

<210> 29

<211> 654

<212> PRT

<213> 番茄

<400> 29

Met Val Ser Met Ser Gly Glu Val Ala Thr Cys Lys Ser Glu Ala Thr

1 5 10 15

Val Val Thr Asp His Phe Pro Val Gly Leu Arg Val Leu Val Val Asp

20 25 30

Asp Asp Val Val Cys Leu Arg Ile Ile Glu Gln Met Leu Arg Arg Cys

35 40 45

Lys Tyr Ser Val Thr Thr Cys Thr Gln Ala Met Val Ala Leu Asn Leu

50 55 60

Leu Arg Glu Lys Arg Gly Thr Phe Asp Ile Val Leu Ser Asp Val His

65 70 75 80

Met Pro Asp Met Asp Gly Phe Lys Leu Leu Glu Leu Val Gly Leu Glu

85 90 95

Met Asp Leu Pro Val Ile Met Met Ser Gly Asp Gly Arg Thr Asn Leu

100 105 110

Val Met Arg Gly Val Gln His Gly Ala Cys Asp Tyr Leu Ile Lys Pro

115 120 125

Ile Arg Asp Glu Glu Leu Lys Asn Ile Trp Gln His Val Val Arg Lys

130 135 140

Arg Tyr Asn Ser Ser Lys Glu Pro Glu Cys Ser Gly Ser Leu Asp Asp

145 150 155 160

Asn Asp Arg Tyr Arg Arg Arg Ser Asp Asp Ala Glu Cys Ala Ser Ser

165 170 175

Val Ile Glu Gly Ala Asp Gly Val Leu Lys Pro Gln Lys Lys Lys Arg

180 185 190

Glu Ala Lys Glu Asp Asp Thr Glu Met Glu Asn Asp Asp Pro Ser Thr

195 200 205

Thr Lys Lys Pro Arg Val Val Trp Ser Val Glu Leu His Gln Gln Phe

210 215 220

Val Ser Ala Val Asn Gln Leu Gly Ile Asp Lys Ala Val Pro Lys Arg

225 230 235 240

Ile Leu Glu Leu Met Asn Val Pro Gly Leu Thr Arg Glu Asn Val Ala

245 250 255

Ser His Leu Gln Lys Phe Arg Leu Tyr Leu Lys Arg Leu Ser Gly Val

260 265 270

Val Gln Gln Gln Gly Gly Leu Pro Ser Thr Phe Cys Gly Pro Ile Glu

275 280 285

Gln Asn Ser Glu Leu Gly Ser Leu Gly Arg Phe Asp Ile Gln Ala Leu

290 295 300

Ala Ala Ser Gly Gln Ile Pro Pro Glu Thr Leu Thr Ala Leu His Ala

305 310 315 320

Glu Leu Leu Gly Arg Ser Thr Ser Asn Leu Val Leu Pro Ala Val Glu

325 330 335

Gln Gln Asn Leu Val Gln Val Ser Leu Gln Gln Ala Lys Cys Ile Pro

340 345 350

Val Asp Gln Val Met Ala Tyr Gly Gln Pro Leu Leu Lys Cys Pro Ala

355 360 365

Ser Ile Ser Asn Ser Lys His Leu Ser Gln Ala Ile Leu Ser Ala Glu

370 375 380

Asp Val His Ser Gly Phe Gly Ser Gln Arg Ala Lys Asn Ile Cys Met

385 390 395 400

Val Pro Ser Ser Asn Pro Ile Ala Pro Asn Ser Asn Met Leu Thr Ala

405 410 415

Met Met Gln Gln Gln Gln Trp Gln Lys Gln Gln Gln Ile Glu Leu Gln

420 425 430

His Arg Gln Ser Gly Pro Pro Glu Val Asn Arg Ser Ile Asn Val Gln

435 440 445

Pro Ser Cys Leu Val Leu Pro Ser Gln Leu Pro Gly His Phe Gln Val

450 455 460

Gly Asp Ser Pro Ala Ser Ile Ser Arg Ala Gly Ser Leu Ser Lys Ser

465 470 475 480

Ser Val Ile Asp Tyr Gly Val Leu Ser Pro Gln Ser Asn Asn Ser Ser

485 490 495

Gly Val Val Gln Val Leu Asp Arg Glu Leu Lys Pro Glu Cys Gly Leu

500 505 510

Asn Arg Leu Pro Ser Gly Gly Ser Leu Ser Arg Ser Cys Ser Ile Asn

515 520 525

Ala Asp Asn Ser Val Asp Leu Gln Leu His Asn Ser Ser Ser Ala Phe

530 535 540

Gly Ser Ser Lys Gln Leu Pro Gly Leu Ile Pro Ser His Leu Gly Ser

545 550 555 560

Pro Val Pro Tyr Cys Ile Asn Ser Ser Leu Val Leu Asp Gln Gly Arg

565 570 575

Met Lys Gly Ala Ser Ile Pro Ser Arg Phe Ala Val Asp Glu Ser Asp

580 585 590

Ser Pro Met Cys Asn Phe Asn Thr Ala Lys Ile Tyr Leu Glu Glu Thr

595 600 605

Lys Val Lys Gln Glu Pro Asn Met Asn Val Met Glu Asn Ala Lys Val

610 615 620

Gly Pro Ala Ile Phe Gln Lys Phe Gln Pro Gly Asp Leu Met Ser Val

625 630 635 640

Phe Arg Leu Ser Phe Ala Arg Val Lys Val Ser Ser Ser Pro

645 650

<210> 30

<211> 653

<212> PRT

<213> 马铃薯

<400> 30

Met Ser Gly Asp Val Ala Thr Cys Lys Ser Glu Ala Thr Val Val Thr

1 5 10 15

Asp His Phe Pro Leu Gly Leu Arg Val Leu Val Val Asp Asp Asp Val

20 25 30

Val Cys Leu Arg Ile Ile Glu Gln Met Leu Arg Arg Cys Lys Tyr Ser

35 40 45

Val Thr Thr Cys Thr Gln Ala Met Val Ala Leu Asn Leu Leu Arg Glu

50 55 60

Lys Arg Gly Thr Phe Asp Ile Val Leu Ser Asp Val His Met Pro Asp

65 70 75 80

Met Asp Gly Phe Lys Leu Leu Glu Leu Val Gly Leu Glu Met Asp Leu

85 90 95

Pro Val Ile Met Met Ser Gly Asp Gly Arg Thr Asn Leu Val Met Arg

100 105 110

Gly Val Gln His Gly Ala Cys Asp Tyr Leu Ile Lys Pro Ile Arg Asp

115 120 125

Glu Glu Leu Lys Asn Ile Trp Gln His Val Val Arg Lys Arg Tyr Asn

130 135 140

Ser Ser Lys Glu Leu Glu Cys Ser Gly Ser Leu Asp Asp Asn Asp Arg

145 150 155 160

Tyr Lys Arg Gly Ser Asp Asp Ala Glu Cys Ala Ser Ser Val Ile Glu

165 170 175

Gly Ala Asp Gly Val Leu Lys Pro Gln Lys Lys Lys Arg Glu Ala Lys

180 185 190

Glu Glu Asp Asp Thr Glu Met Glu Asn Asp Asp Pro Ser Thr Ser Lys

195 200 205

Lys Pro Arg Val Val Trp Ser Val Glu Leu His Gln Gln Phe Val Ser

210 215 220

Ala Val Asn Gln Leu Gly Ile Asp Lys Ala Val Pro Lys Arg Ile Leu

225 230 235 240

Glu Leu Met Asn Val Pro Gly Leu Thr Arg Glu Asn Val Ala Ser His

245 250 255

Leu Gln Glu Asn Gln Lys Phe Arg Leu Tyr Leu Lys Arg Leu Ser Gly

260 265 270

Val Val Gln Gln Gln Gly Gly Leu Pro Ser Thr Phe Cys Gly Pro Ile

275 280 285

Glu Gln Asn Ser Glu Leu Gly Ser Leu Gly Arg Phe Asp Ile Gln Ala

290 295 300

Leu Ala Ala Ser Gly Gln Ile Pro Pro Glu Thr Leu Thr Ala Leu His

305 310 315 320

Ala Glu Leu Leu Gly Arg Ser Thr Ser Asn Leu Val Leu Pro Ala Val

325 330 335

Glu Ile Gln Asn Leu Leu Gln Ala Ser Leu Gln Gln Ala Lys Cys Ile

340 345 350

Pro Ala Asp Gln Val Met Ala Tyr Gly Gln Pro Leu Leu Lys Cys His

355 360 365

Pro Ser Ile Ser Asn Ser Lys His Leu Ser Gln Ser Ile Leu Ser Ala

370 375 380

Glu Asp Val His Ser Gly Phe Gly Ser Gln Arg Ala Lys Asn Ile Cys

385 390 395 400

Leu Val Pro Ser Ser Asn Pro Ile Gly Leu Ala Ala Pro Asn Ser Asn

405 410 415

Met Leu Met Ala Met Met Gln Gln Gln Gln Trp Gln Lys Gln Gln Gln

420 425 430

Met Glu Leu Gln His Arg Arg Ser Gly Pro Pro Glu Val Asn His Ser

435 440 445

Ile Asn Val Gln Pro Ser Cys Leu Val Leu Pro Ser Gln Leu Pro Gly

450 455 460

Asn Phe Gln Val Gly Asp Ser Pro Ala Ser Ile Ser Arg Ala Gly Ser

465 470 475 480

Leu Ser Lys Ser Ser Val Ile Asp Tyr Gly Val Leu Ser Pro Gln Ser

485 490 495

Asn Asn Ser Ser Gly Val Val Gln Val Leu Asp Arg Glu Leu Lys Pro

500 505 510

Glu Cys Gly Leu Asn Arg Leu Pro Ser Gly Gly Ser Leu Ser Arg Ser

515 520 525

Cys Ser Ile Asn Ala Asp Asn Ser Val Gly Leu Gln Leu His Asn Ser

530 535 540

Ser Ser Ala Phe Gly Ser Ser Lys Gln Leu Pro Ala Leu Ile Pro Asn

545 550 555 560

His Leu Gly Ser Pro Val Pro Tyr Tyr Ile Asn Ser Ser Gln Val Leu

565 570 575

Asp Gln Gly His Thr Arg Asn Pro Gly Val Gly Lys Cys Ala Ser Ile

580 585 590

Pro Ser Arg Phe Ala Val Asp Glu Ser Asp Ser Pro Met Cys Asn Phe

595 600 605

Asn Thr Ala Lys Asn Tyr Leu Glu Glu Thr Lys Val Lys Gln Glu Pro

610 615 620

Asn Met Asn Val Met Glu Asn Ala Lys Val Gly Pro Ala Ile Phe Gln

625 630 635 640

Lys Phe Gln Pro Gly Asp Leu Met Ser Val Phe Ser Asp

645 650

<210> 31

<211> 669

<212> PRT

<213> 陆地棉

<400> 31

Met Ala Thr Met His Arg Val Val Gln Ser Ser Val Ser Thr Ser Asp

1 5 10 15

Ala Thr Thr Thr Ser Tyr Asp Gly Leu Thr Ser Cys Lys Ala Ala Asp

20 25 30

Ile Val Ile Ser Asp Gln Phe Pro Ala Gly Leu Arg Val Leu Val Val

35 40 45

Asp Asp Asp Ile Thr Cys Leu Lys Ile Leu Glu Lys Met Leu His Arg

50 55 60

Cys Arg Tyr His Val Thr Thr Cys Pro Gln Ala Lys Val Ala Leu Asn

65 70 75 80

Leu Leu Arg Glu Arg Lys Gly Cys Phe Asp Val Ile Leu Ser Asp Val

85 90 95

Tyr Met Pro Asp Met Asp Gly Tyr Lys Leu Leu Glu His Val Gly Leu

100 105 110

Glu Met Asp Leu Pro Val Ile Met Met Ser Ala Asp Gly Ser Thr Arg

115 120 125

Ala Val Met Lys Gly Ile Arg His Gly Ala Cys Asp Tyr Leu Ile Lys

130 135 140

Pro Ile Arg Glu Glu Glu Leu Lys Asn Ile Trp Gln His Val Val Arg

145 150 155 160

Lys Lys Trp Asn Glu Asn Lys Glu Leu Glu His Ser Gly Ser Leu Asp

165 170 175

Asp Thr Asp Gln His Lys Gln Arg His Asp Asp Ala Glu Tyr Ala Ser

180 185 190

Ser Val Asn Asp Ala Thr Glu Thr Ser Leu Lys Pro Leu Lys Lys Arg

195 200 205

Ser Asn Ser Lys Glu Glu Asp Asp Gly Glu Ile Asp Asn Asp Asp Pro

210 215 220

Ser Thr Ser Lys Lys Pro Arg Val Val Trp Ser Val Glu Leu His Gln

225 230 235 240

Gln Phe Val Ser Ala Val Asn Gln Leu Gly Ile Asp Lys Ala Val Pro

245 250 255

Lys Arg Ile Leu Glu Leu Met Asn Val Pro Gly Leu Thr Arg Glu Asn

260 265 270

Val Ala Ser His Leu Gln Lys Phe Arg Leu Tyr Leu Lys Arg Ile Ser

275 280 285

Gly Val Ala Gln Gln Gly Gly Ile Ala Asn Pro Leu Cys Gly Pro Val

290 295 300

Glu Ala Asn Val Lys Ile Gly Ser Leu Gly Ser Phe Asn Ile Gln Ala

305 310 315 320

Leu Ala Ala Ser Gly Gln Ile Pro Pro Gln Thr Leu Ala Ala Val His

325 330 335

Ala Glu Leu Leu Gly Arg Ser Ala Gly Asn Leu Val Val Ala Thr Asp

340 345 350

Gln Pro Ala Leu Leu Gln Ala Thr Pro Gln Gly Ala Lys Cys Ile Gln

355 360 365

Val Asp Gln Gly Val Ala Phe Val Gln His Ser Val Lys Ser Glu Ser

370 375 380

Ser Ser Ser Lys His Phe Ser Gln Ser Phe Ala Pro Val Glu Asp Val

385 390 395 400

Ala Ser Gly Phe Arg Ser Trp Pro Ser Asn Asn Ile Gly Thr Ala Gly

405 410 415

Pro Ser Asn Ser Gly Gly Leu Ser Ser Gln Asn Gly Asn Met Leu Ile

420 425 430

Asp Leu Leu Gln Gln Gln Gln Gln Leu Gln Lys Pro Gln Gln Arg Ser

435 440 445

Thr Val Ser Glu Leu Arg Arg Ser Ile Asn Val Gln Pro Ser Cys His

450 455 460

Val Val Pro Ser Gln Ser Ser Ala Ser Phe Arg Ala Gly Asn Ser Pro

465 470 475 480

Val Ser Val Thr Gln Asn Gly Ser Tyr Ser Arg Thr Ala Val Ile Asp

485 490 495

Tyr Ser Leu Leu Ser Ser Gln Ser Asn Cys Pro Ser Leu Asn Ile Gly

500 505 510

Gln Val Ser Asp Val Asn Leu Gln Thr Thr Gly Val Leu Ser Gly Tyr

515 520 525

Ile Pro Pro Ala Ser Val Ser Pro Ser Val Ser Ser Cys Ser Val Asn

530 535 540

Ala Asp Asn Cys Ala Ser Gln Gln Val Gln Thr Ser Ser Met Thr Phe

545 550 555 560

Lys Ala Ser Arg His Leu Pro Gly Phe Val His Ser Thr Ser Asn Ile

565 570 575

Pro Asp Pro Tyr Gly Ser Thr Lys Ser Gly Asp Leu Leu Asn Gln Glu

580 585 590

Pro Phe Asn Asn Leu Gly Tyr Ile Asn Lys Gly Thr Cys Leu Pro Ala

595 600 605

Lys Phe Ala Val Asp Glu Phe Gln Ser His Leu Ser Ser Ser Ser His

610 615 620

Gly Lys Val Phe Ser Glu Asn Ile Gly Thr Arg Val Lys Gln Glu Pro

625 630 635 640

Ser Met Glu Phe Gly Asp Asn Ala Lys Val Gly Ile Pro Met Leu Gln

645 650 655

Gln Phe Arg Pro Asn Asp Leu Met Ser Val Phe Thr Glu

660 665

<210> 32

<211> 681

<212> PRT

<213> 可可

<400> 32

Met Asn Ser Ser Ser Gly Lys Gly Ser Met Ser Ala Ala Ser Ser Ser

1 5 10 15

Ala Ala Trp Lys Ala Gly Asp Val Val Pro Asp Gln Phe Pro Ala Gly

20 25 30

Leu Arg Val Leu Val Val Asp Asp Asp Pro Thr Cys Leu Met Ile Leu

35 40 45

Glu Lys Met Leu Arg Thr Cys Leu Tyr Glu Val Thr Lys Cys Asn Arg

50 55 60

Ala Glu Thr Ala Leu Ser Leu Leu Arg Glu Asn Lys Asn Gly Phe Asp

65 70 75 80

Ile Val Ile Ser Asp Val His Met Pro Asp Met Asp Gly Phe Lys Leu

85 90 95

Leu Glu His Ile Gly Leu Glu Met Asp Leu Pro Val Ile Met Met Ser

100 105 110

Ala Asp Asp Gly Lys His Val Val Met Lys Gly Val Thr His Gly Ala

115 120 125

Cys Asp Tyr Leu Ile Lys Pro Val Arg Ile Glu Ala Leu Lys Asn Ile

130 135 140

Trp Gln His Val Val Arg Lys Arg Lys Asn Glu Trp Lys Asp Phe Glu

145 150 155 160

Gln Ser Gly Ser Val Glu Glu Gly Asp Arg Gln Pro Lys Gln Ser Glu

165 170 175

Glu Ala Asp Tyr Ser Ser Ser Ala Asn Glu Gly Asn Trp Lys Ser Ser

180 185 190

Lys Lys Arg Lys Asp Asp Asp Asp Glu Ala Glu Glu Arg Asp Asp Thr

195 200 205

Ser Thr Leu Lys Lys Pro Arg Val Val Trp Ser Val Glu Leu His Gln

210 215 220

Gln Phe Val Ala Ala Val Asn Gln Leu Gly Ile Asp Lys Ala Val Pro

225 230 235 240

Lys Lys Ile Leu Glu Leu Met Asn Val Pro Gly Leu Thr Arg Glu Asn

245 250 255

Val Ala Ser His Leu Gln Lys Tyr Arg Leu Tyr Leu Arg Arg Leu Ser

260 265 270

Gly Val Ser Gln His Gln Ser Asn Leu Asn Asn Ser Phe Met Ser Pro

275 280 285

Gln Glu Ala Thr Phe Gly Pro Leu Ser Pro Leu Asn Gly Leu Asp Leu

290 295 300

Gln Thr Leu Ala Ala Thr Gly Gln Leu Pro Ala Gln Ser Leu Ala Thr

305 310 315 320

Phe Gln Ala Ala Gly Leu Gly Arg Ser Thr Ala Lys Ser Gly Ile Ala

325 330 335

Met Pro Leu Val Asp Gln Arg Asn Ile Phe Ser Phe Glu Asn Pro Lys

340 345 350

Leu Arg Phe Gly Glu Gly Gln Gln Gln His Met Asn Asn Asn Lys Gln

355 360 365

Leu Asn Leu Leu His Gly Ile Pro Thr Thr Met Glu Pro Lys Gln Leu

370 375 380

Ala Ser Leu His His Ser Ala Gln Ser Ile Gly Asn Ile Asn Met Gln

385 390 395 400

Val Thr Ser His Gly Val Gln Gly Ser Gln Asn Asn Ser Leu Leu Ile

405 410 415

Gln Met Ala Gln Pro Gln Pro Arg Gly Gln Ile Leu Asn Asp Ser Thr

420 425 430

Gly Ser His Ala Pro Arg Leu Pro Ser Thr Leu Gly Gln Pro Ile Leu

435 440 445

Ser Asn Gly Ile Ala Ala Asn Val Ser Thr Arg Asn Gly Ile Pro Glu

450 455 460

Asn Ile Arg Gly Pro Gly Tyr Asn Pro Val Ser Gln Thr Ser Ser Leu

465 470 475 480

Leu Asn Phe Pro Met Asn His Thr Ser Glu Leu Pro Gly Asn Ser Phe

485 490 495

Pro Leu Gly Thr Thr Pro Gly Ile Ser Ser Leu Thr Ser Lys Gly Ala

500 505 510

Phe Gln Glu Asp Ile Asn Ser Asp Val Lys Gly Ser Gly Gly Phe Met

515 520 525

Pro Ser Tyr Asp Ile Phe Asn Asp Leu Asn Gln His Lys Pro Gln Asn

530 535 540

Trp Glu Leu Gln Asn Val Gly Met Thr Phe Asp Ala Ser Gln His Ser

545 550 555 560

Asn Ser Leu Gln Gly Asn Leu Asp Leu Ala Gln Ser Ile Leu Val Gln

565 570 575

Gln Gly Phe Ser Ser Gly Gln Met Asn Gly Gln Asn Arg Ser Ala Ala

580 585 590

Val Val Ser Lys Ala Met Phe Ser Ala Gly Asp Cys Thr Glu Gln Gly

595 600 605

Asn Ala Gln Asn Val Asn His His Leu Asn Asn Leu Leu Val Asp Asn

610 615 620

Thr Ile Arg Ile Lys Ser Glu Arg Val Ala Asp Ala Gly Pro Ala Asn

625 630 635 640

Leu Phe Pro Asp His Phe Gly Gln Glu Asp Leu Met Ser Ala Leu Leu

645 650 655

Lys Gln Gln Asp Gly Ile Ala Pro Ala Glu Asn Glu Phe Asp Phe Asp

660 665 670

Gly Tyr Ser Met Asp Asn Ile Pro Val

675 680

<210> 33

<211> 579

<212> PRT

<213> 菜豆

<400> 33

Met Asn Leu Ser Asn Gly Lys Gly Ser Met Ser Thr Val Thr Thr Thr

1 5 10 15

Ala Val Met Lys Ser Gly Asp Ala Val Ser Asp Gln Phe Pro Ala Gly

20 25 30

Leu Arg Val Leu Val Val Asp Asp Asp Pro Thr Cys Leu Met Ile Leu

35 40 45

Glu Lys Met Leu Arg Thr Cys Leu Tyr Glu Val Thr Lys Cys Asn Arg

50 55 60

Ala Glu Thr Ala Leu Ser Leu Leu Arg Glu Asn Lys Asn Gly Phe Asp

65 70 75 80

Ile Val Ser Ala Asn Glu Gly Ser Trp Arg Asn Ser Lys Lys Arg Arg

85 90 95

Asp Glu Glu Glu Glu Ala Glu Asp Arg Asp Asp Thr Ser Thr Leu Lys

100 105 110

Lys Pro Arg Val Val Trp Ser Val Glu Leu His Gln Gln Phe Val Ala

115 120 125

Ala Val Asp Gln Leu Gly Ile Asp Lys Ala Val Pro Lys Lys Ile Leu

130 135 140

Glu Leu Met Asn Val Pro Gly Leu Thr Arg Glu Asn Val Ala Ser His

145 150 155 160

Leu Gln Lys Tyr Arg Leu Tyr Leu Arg Arg Leu Ser Gly Val Ser Gln

165 170 175

His Gln Asn Asn Leu Asn Asn Ser Phe Leu Gly Ser Gln Glu Ala Thr

180 185 190

Phe Gly Thr Ile Ser Ser Ile Asn Gly Ile Asp Leu Gln Thr Leu Ala

195 200 205

Val Thr Gly Gln Leu Pro Ala Gln Ser Leu Ala Thr Leu Gln Ala Ala

210 215 220

Gly Leu Gly Arg Ser Thr Ala Lys Thr Gly Val Pro Met Pro Leu Met

225 230 235 240

Asp Gln Arg Asn Leu Phe Ser Phe Glu Asn Pro Arg Val Arg Phe Gly

245 250 255

Glu Gly Gln Gln Gln His Leu Ser Ser Ser Lys Pro Met Asn Leu Leu

260 265 270

Leu Gly Ile Pro Thr Asn Met Glu Pro Lys Gln Leu Ala Asn Leu His

275 280 285

Gln Ser Thr Gln Ser Ile Ala Ser Leu Asn Met Arg Val Asn Ala Ser

290 295 300

Ala Thr Gln Gly Asn Pro Leu Met Met Gln Met Pro Gln Ser Gln Pro

305 310 315 320

Arg Gly Gln Met Leu Ser Glu Asn Thr Gly Pro Arg Val Pro Arg Leu

325 330 335

Pro Ser Ser Leu Gly Gln Pro Thr Val Ser Asn Gly Ile Ser Asn Gly

340 345 350

Phe Leu Gly Arg Asn Gly Ile Ala Gly Asn Asn Arg Gly Pro Ala Tyr

355 360 365

Asn Pro Val Pro Pro Asn Ser Ser Leu Leu Ser Phe Pro Met Asn Gln

370 375 380

Ser Ser Glu Val Ser Val Asn Asn Ser Leu Pro Leu Gly Ser Ser Pro

385 390 395 400

Gly Ile Ser Ser Ile Thr Thr Lys Gly Ser Phe Gln Glu Glu Val Thr

405 410 415

Ser Gly Ile Lys Ala Thr Gly Gly Phe Pro Ser Tyr Asp Ile Phe Asn

420 425 430

Glu Leu His His Gln Lys Ser His Asp Trp Glu Ile Thr Asn Pro Ser

435 440 445

Leu Thr Tyr Ser Ala Ser His His Ala Asn Pro Leu Gln Gly Asn Ile

450 455 460

Asp Val Ser Pro Ser Val Leu Val His Gln Gly Phe Ser Ser Thr Gln

465 470 475 480

Gln Asn Gly Gln Ser Arg Asp Ala Thr Leu Ile Gly Lys Ala Met Phe

485 490 495

Ser Leu Gly Glu Gly Ser Glu Gln Asp Asn Leu Gln Asn Ala Val Gln

500 505 510

His Leu His Pro Leu Leu Val Asp Asn Ser Ile Arg Val Lys Ala Glu

515 520 525

Arg Ile Pro Asp Ala Ser Ser Gln Thr Asn Leu Phe Pro Asp His Tyr

530 535 540

Val Gln Glu Asp Leu Met Ser Ala Leu Leu Lys Gln Gln Glu Gly Met

545 550 555 560

Gly Pro Ala Glu Ser Glu Phe Glu Phe Asp Ala Tyr Ser Leu Asp Asn

565 570 575

Ile Pro Val

<210> 34

<211> 679

<212> PRT

<213> 大豆

<400> 34

Met Asn Leu Ser Asn Gly Lys Gly Ser Met Ser Thr Leu Thr Ala Ser

1 5 10 15

Val Val Met Lys Ser Gly Asp Ala Val Ser Asp Gln Phe Pro Ala Gly

20 25 30

Leu Arg Val Leu Val Val Asp Asp Asp Pro Thr Cys Leu Met Ile Leu

35 40 45

Glu Lys Met Leu Arg Thr Cys Leu Tyr Glu Val Thr Lys Cys Asn Arg

50 55 60

Ala Glu Thr Ala Leu Ser Leu Leu Arg Glu Asn Lys Asn Gly Phe Asp

65 70 75 80

Ile Val Ile Ser Asp Val His Met Pro Asp Met Asp Gly Phe Lys Leu

85 90 95

Leu Glu His Ile Gly Leu Glu Met Asp Leu Pro Val Ile Met Met Ser

100 105 110

Ala Asp Asp Gly Lys Ser Val Val Met Lys Gly Val Thr His Gly Ala

115 120 125

Cys Asp Tyr Leu Ile Lys Pro Val Arg Ile Glu Ala Leu Lys Asn Ile

130 135 140

Trp Gln His Val Val Arg Lys Arg Lys Asn Glu Trp Lys Asp Ala Glu

145 150 155 160

Gln Ser Gly Ser Ala Glu Glu Gly Asp Arg Gln Pro Lys Ala Ser Asp

165 170 175

Glu Ala Asp Tyr Ser Ser Ser Ala Asn Glu Gly Ser Trp Arg Asn Ser

180 185 190

Lys Lys Arg Arg Asp Glu Glu Glu Glu Ala Glu Asp Arg Asp Asp Thr

195 200 205

Ser Thr Leu Lys Lys Pro Arg Val Val Trp Ser Val Glu Leu His Gln

210 215 220

Gln Phe Val Ala Ala Val Asp Gln Leu Gly Ile Asp Lys Ala Val Pro

225 230 235 240

Lys Lys Ile Leu Glu Leu Met Asn Val Pro Gly Leu Thr Arg Glu Asn

245 250 255

Val Ala Ser His Leu Gln Lys Tyr Arg Leu Tyr Leu Arg Arg Leu Ser

260 265 270

Gly Val Ser Gln His Gln Asn Asn Met Asn Asn Ser Phe Leu Ser Pro

275 280 285

Gln Glu Ala Thr Phe Gly Thr Ile Ser Ser Ile Asn Gly Ile Asp Leu

290 295 300

Gln Thr Leu Ala Val Ala Gly Gln Leu Pro Ala Gln Ser Leu Ala Thr

305 310 315 320

Leu Gln Ala Ala Gly Leu Gly Arg Pro Thr Gly Lys Ala Gly Val Pro

325 330 335

Met Pro Leu Met Asp Gln Arg Asn Leu Phe Ser Phe Glu Asn Pro Arg

340 345 350

Leu Arg Phe Gly Glu Gly Gln Gln Gln His Leu Ser Thr Ser Lys Pro

355 360 365

Met Asn Leu Leu His Gly Ile Pro Thr Asn Met Glu Pro Lys Gln Leu

370 375 380

Ala Asn Leu His Gln Ser Thr Gln Ser Ile Gly Ser Leu Asn Met Arg

385 390 395 400

Val Asn Ala Ser Ala Thr Gln Gly Ser Pro Leu Leu Met Gln Met Ala

405 410 415

Gln Ser Gln Pro Arg Gly Gln Met Leu Ser Glu Asn Ile Gly Pro Arg

420 425 430

Val Pro Arg Leu Pro Ser Ser Leu Gly Gln Pro Thr Val Ser Asn Gly

435 440 445

Ile Ser Asn Gly Leu Leu Gly Arg Asn Gly Ile Ala Gly Asn Asn Arg

450 455 460

Gly Pro Ala Tyr Asn Pro Val Pro Pro Ser Ser Ser Leu Leu Ser Phe

465 470 475 480

Pro Met Asn Gln Thr Ser Glu Met Ser Val Asn Asn Ser Phe Pro Leu

485 490 495

Gly Ser Thr Pro Gly Ile Ser Ser Ile Thr Thr Lys Gly Ser Phe Gln

500 505 510

Glu Glu Val Thr Ser Gly Ile Lys Gly Ser Gly Gly Phe Pro Ser Tyr

515 520 525

Asp Ile Phe Asn Glu Leu His His Gln Lys Pro His Asp Trp Glu Ile

530 535 540

Thr Asn Pro Asn Leu Thr Tyr Asn Ala Ser Gln His Ala Asn Pro Leu

545 550 555 560

Gln Gly Asn Ile Asp Val Thr Pro Ser Val Leu Val His Gln Gly Phe

565 570 575

Ser Ser Thr Gln Gln Thr Gly Gln Ser Arg Asp Ala Ala Leu Ile Gly

580 585 590

Lys Ala Met Phe Ser Met Gly Glu Gly Leu Glu Gln Asn Asn Phe Gln

595 600 605

Asn Ala Ser Gln Asn Leu Asn Ser Leu Leu Leu Asp Asn Ser Ile Arg

610 615 620

Val Lys Ala Glu Arg Ile Pro Asp Ala Ser Ser Gln Thr Asn Leu Phe

625 630 635 640

Pro Glu His Tyr Gly Gln Glu Asp Leu Met Ser Ala Leu Leu Lys Gln

645 650 655

Gln Glu Gly Met Gly Pro Ser Glu Asn Glu Phe Asp Phe Asp Gly Tyr

660 665 670

Ser Leu Asp Asn Ile Pro Val

675

<210> 35

<211> 668

<212> PRT

<213> 藜麦

<400> 35

Met Asn Leu Gly Gly Gly Leu Met Gly Ser Met Ala Met Pro Ser Ser

1 5 10 15

Thr Val Ser Arg Lys Ser Ser Glu Val Val Thr Ala Asp Gln Phe Pro

20 25 30

Val Gly Leu Arg Val Leu Val Val Asp Asp Asp Pro Thr Cys Leu Thr

35 40 45

Ile Leu Glu Lys Met Leu Arg Thr Cys Arg Tyr Glu Val Thr Lys Thr

50 55 60

Asn Arg Ala Glu His Ala Leu Asn Met Leu Arg Glu Asn Lys Asn Gly

65 70 75 80

Phe Asp Val Val Ile Ser Asp Val His Met Pro Asp Met Asp Gly Phe

85 90 95

Lys Leu Leu Glu Gln Val Gly Leu Glu Met Asp Leu Pro Val Ile Met

100 105 110

Met Ser Ala Asp Asp Ser Lys Gln Val Val Met Lys Gly Val Thr His

115 120 125

Gly Ala Cys Asp Tyr Leu Ile Lys Pro Val Arg Ile Glu Ala Leu Lys

130 135 140

Asn Ile Trp Gln His Val Val Arg Lys Lys Lys Tyr Glu Tyr Asn Lys

145 150 155 160

Asp Val Glu Gln Ser Gly Ser Trp Asp Glu Gly Asp Arg Gln Leu Lys

165 170 175

His Asp Asp Ala Val Ser Ser Pro Ala Asn Asp Gly Ser Trp Lys Asn

180 185 190

Ser Lys Arg Lys Ser Gly Glu Asp Asp Glu Ala Asp Asp Lys Asp Asp

195 200 205

Thr Thr Thr Leu Lys Lys Pro Arg Val Val Trp Ser Val Glu Leu His

210 215 220

Gln Gln Phe Val Ala Ala Val Asn Gln Leu Gly Ile Asp Lys Ala Val

225 230 235 240

Pro Lys Lys Ile Leu Glu Leu Met Asn Val Pro Gly Leu Thr Arg Glu

245 250 255

Asn Val Ala Ser His Leu Gln Lys Tyr Arg Leu Tyr Leu Arg Arg Leu

260 265 270

Ser Gly Val Ser Gln His Gln Gly Gly Leu Asn Ser Ser Phe Met Pro

275 280 285

Gln Asp Pro Ser Phe Ser Thr Met Ser Ser Leu Gly Gly Ile Asp Leu

290 295 300

Gln Thr Leu Ala Ala Thr Gly Gln Leu Ser Ala Gln Thr Leu Ala Ala

305 310 315 320

Tyr Thr Arg Leu Pro Pro Thr Ile Lys Pro Gly Ile Ser Met Pro Phe

325 330 335

Val Asp Gln Arg Asn Leu Phe Ser Phe Glu Asn Ser Lys Leu Arg Tyr

340 345 350

Gly Asp Gly Gln Gln Ser Gln Ile Ser Asn Val Ser Lys Gln Met Asn

355 360 365

Leu Leu His Gly Phe Pro Thr Thr Met Glu Pro Lys Gln Leu Ala Val

370 375 380

Leu Asn Gln Ser Ala Gln Thr Leu Gly Ser Met Asn Met Gln Ala Asn

385 390 395 400

Ala Ser Ser Ser His Gln Ser Ser Ser Leu Leu Met Gln Gln Met Val

405 410 415

Pro Gln Gln Arg Gly His Ile Ser Asn Glu Ser Ile Ser Ser Gln Val

420 425 430

Pro Arg Ile Gln Pro Ser Val Gly Gln Pro Leu Gln Ser Asn Gly Asn

435 440 445

Ala Asn Ala Val Leu Ser Arg Asn Gly Ile Pro Tyr Asp Pro Val Asn

450 455 460

Gln Ser Ala Ser Val Val Asp Phe Ser Val Asn His Ile Pro Glu Leu

465 470 475 480

Pro Gly Asn Ser Phe Pro Leu Gly Ser Thr Pro Gly Ile Thr Ser Ile

485 490 495

Thr Ser Lys Gly Phe Asn Gln Glu Glu Ile Gly Ser Asp Ile Lys Val

500 505 510

Ser Arg Gly Phe Val Gly Ser Tyr Asp Met Phe Ser Glu Leu Gln His

515 520 525

Lys Pro Gln Glu Trp Gln Met Gln Asn Pro Asn Met Gly Phe Ala Gly

530 535 540

Ser Ser Gln His Val Pro Ser Val Gln Ser Gly Val Asn Val Ala Pro

545 550 555 560

Ser Ile Met Val Asn Gln Ser Tyr Val Ser Gly Gln Lys Asn Glu Gln

565 570 575

Asn Gly His Ser Met Ala Gly Lys Pro Met Tyr Ser Ala Gly Leu Glu

580 585 590

Asn Gln His Met Gly Met Gln Asn Val Asn Gln Asn Tyr Asn Ser Ile

595 600 605

His Val Asn Asn Ser Ser Arg Val Lys Ala Glu Ser Val Ser Asp Val

610 615 620

Val Asn Leu Gly Ala Asn Leu Phe Asp Tyr Ser Pro Glu Asp Met Leu

625 630 635 640

Ser Thr Ile Met Leu Lys Gln Gln Glu Gly Ile Gly Ser Gly Asp Phe

645 650 655

Asp Phe Asp Gly Tyr Thr Leu Asp Asn Ile Pro Val

660 665

<210> 36

<211> 670

<212> PRT

<213> 苹果

<220>

<221> MOD_RES

<222> (195)..(195)

<223> 任何氨基酸

<220>

<221> MOD_RES

<222> (215)..(215)

<223> 任何氨基酸

<220>

<221> MOD_RES

<222> (530)..(530)

<223> 任何氨基酸

<220>

<221> MOD_RES

<222> (540)..(540)

<223> 任何氨基酸

<220>

<221> MOD_RES

<222> (579)..(579)

<223> 任何氨基酸

<400> 36

Met Ala Ala Leu Gln Arg Val Ala Gln Ser Ser Val Ser Thr Thr Ala

1 5 10 15

Ser Ser Tyr Gly Ser Cys Lys Val Gly Gly Gly Val Leu Ser Pro Ser

20 25 30

Ala Gly Ile Glu Met Ala Val Pro Asn Gln Phe Pro Ala Gly Leu Arg

35 40 45

Val Leu Val Val Asp Asp Asp Thr Thr Cys Leu Arg Ile Leu Glu Leu

50 55 60

Met Leu Leu Arg Cys Leu Tyr Gln Val Thr Thr Cys Ser Glu Ala Thr

65 70 75 80

Val Ala Leu Asn Leu Leu Arg Glu Arg Lys Asp Cys Phe Asp Val Val

85 90 95

Leu Ser Asp Val His Met Pro Asp Met Asp Gly Phe Lys Leu Leu Glu

100 105 110

His Val Gly Leu Glu Met Asp Leu Pro Val Ile Met Met Ser Ala Asp

115 120 125

Gly Arg Thr Ser Val Val Met Arg Gly Ile Arg His Gly Ala Cys Asp

130 135 140

Phe Leu Ile Lys Pro Ile Ser Glu Ala Glu Leu Lys Asn Ile Trp Gln

145 150 155 160

His Val Val Arg Lys Lys Trp Asn Gly Ser Lys Glu Leu Glu His Ser

165 170 175

Gly Ser Leu Glu Asp Asn Asp Pro His Lys Arg Gly Asn Asn Asp Phe

180 185 190

Glu Tyr Xaa Ser Ser Val Asn Glu Gly Thr Glu Val Ser Leu Lys Gly

195 200 205

His Lys Lys Arg Ile Asn Xaa Lys Glu Asp Asp Asp Gly Asp Thr Glu

210 215 220

Asn Asp Asp Leu Ser Thr Ser Lys Lys Pro Arg Val Val Trp Ser Val

225 230 235 240

Glu Leu His Gln Gln Phe Val Thr Ala Val Asn Gln Leu Gly Leu Asp

245 250 255

Lys Ala Val Pro Lys Arg Ile Leu Glu Leu Met Asn Val Pro Gly Leu

260 265 270

Thr Arg Glu Asn Val Ala Ser His Leu Gln Lys Phe Arg Leu Tyr Leu

275 280 285

Lys Arg Leu Ser Gly Val Ala Gln Gln Gln Ser Gly Ile Ala Asn Pro

290 295 300

Leu Cys Gly Pro Val Asp Ser Asn Gly Lys Leu Gly Ser Leu Ser Arg

305 310 315 320

Phe Asp Phe Gln Ala Leu Ala Ala Ser Gly Gln Ile Pro Pro Gln Thr

325 330 335

Leu Ala Ala Leu Gln Ala Glu Leu Leu Gly Gln Pro Ala Gly Asn Leu

340 345 350

Val Pro Ala Met Asp Gln Pro Ala Leu Leu His Ala Ser Leu Gln Ala

355 360 365

Pro Lys Arg Pro Pro Val Glu His Gly Val Pro Phe Met Gln Pro Phe

370 375 380

Val Lys Ser Gln Ser Asn Val Ser Lys His Phe Pro Gln Ser Val Ile

385 390 395 400

Ser Ala Glu Asp Ala Ser Leu Gly Phe Gly Gln Trp Arg Ser Asn Ser

405 410 415

Arg Ser Thr Val Ala Pro Ser Asn Asp His Gly Gly Leu Ser Thr Gln

420 425 430

Asn Ser Asn Leu Leu Met Gly Ile Val Pro Gln Glu Gln Arg Gln His

435 440 445

Lys Arg Thr Gln Gln Gln Ser Val Leu Thr Glu Pro Ser Arg Ser Phe

450 455 460

Asn Val Gln Pro Ser Cys Leu Val Val Pro Ser Gln Ser Ser Thr Gly

465 470 475 480

Phe Gln Ala Gly Asn Ser Pro Ala Ser Val Asn Gln Ser Ser Ser Phe

485 490 495

Asn Arg Ser Thr Val Val Asp Tyr Ser Leu Pro Ser Asp Gln Ser Asn

500 505 510

Asn Ser Leu Asn Val Gly His Ile Pro Thr Gly Asn Pro Lys Thr Ser

515 520 525

Gly Xaa Leu Gly Gly Tyr Ser Gly Pro Gly Ser Xaa Cys Ala Thr Ser

530 535 540

Cys Leu Val Asn Ala Asp Asn Ser Thr Ser Tyr Gln Asn Ser Thr Ala

545 550 555 560

Thr Phe Ser Asp Ser Arg Glu Leu Pro Gly Phe Leu His Asn Thr Ala

565 570 575

Asn Ser Xaa Gly Phe Tyr Val Asp Lys Ser Gly Glu Met Leu Asp Gln

580 585 590

Gly Pro Leu Arg Asn Leu Gly Phe Val Gly Lys Glu Thr Cys Ile Pro

595 600 605

Ser Arg Phe Ala Val Asp Asp Phe Glu Ser Gln Met Ser Asn Leu Asn

610 615 620

Pro Gly Arg Ile His Val Glu Ser Ser Gly Thr Leu Val Lys Gln Glu

625 630 635 640

Pro Ser Glu Asp Tyr Val Asp Asn Ala Lys Leu Gly Ile Pro Ile Leu

645 650 655

His Gln Tyr Ser Ser Ser Asp Phe Met Ser Pro Phe Ala Asp

660 665 670

<210> 37

<211> 802

<212> PRT

<213> 玉米

<400> 37

Pro Tyr Pro Thr His Thr Leu Leu Pro Gln Pro His Leu Ser Leu Ser

1 5 10 15

Ala Cys Val Leu Leu Val Leu Leu Ser Leu Ser Ser Pro Ala Leu Thr

20 25 30

Ser Pro Pro Phe Pro Ala Val Ser Trp Ile Ser Arg Ile Gln Thr Thr

35 40 45

Ala Leu Val Ser Leu Pro Ser Cys Leu Leu Pro Ala Tyr Val Gln Glu

50 55 60

Gly Pro Cys Leu Gly Asp Pro Gly Ala Trp Phe Leu Gly Ser Ala Ala

65 70 75 80

Ser Ala Ala Val Gly Phe Ala Glu Pro Glu Pro Pro Glu Met Thr Val

85 90 95

Asp Glu Leu Lys Leu Gln Ala Arg Ala Ser Gly Gly His Gly Ala Lys

100 105 110

Asp Gln Phe Pro Val Gly Met Arg Val Leu Ala Val Asp Asp Asp Pro

115 120 125

Thr Cys Leu Lys Ile Leu Glu Asn Leu Leu Leu Arg Cys Gln Tyr His

130 135 140

Val Thr Thr Thr Gly Gln Ala Ala Thr Ala Leu Lys Leu Leu Arg Glu

145 150 155 160

Lys Lys Asp Gln Phe Asp Leu Val Ile Ser Asp Val His Met Pro Asp

165 170 175

Met Asp Gly Phe Lys Leu Leu Glu Leu Val Gly Leu Glu Met Asp Leu

180 185 190

Pro Val Ile Met Leu Ser Ala Asn Gly Glu Thr Gln Thr Val Met Lys

195 200 205

Gly Ile Thr His Gly Ala Cys Asp Tyr Leu Leu Lys Pro Val Arg Ile

210 215 220

Glu Gln Leu Arg Thr Ile Trp Gln His Val Val Arg Arg Arg Ser Cys

225 230 235 240

Asp Ala Lys Asn Ser Gly Asn Asp Asn Asp Asp Ser Gly Lys Lys Leu

245 250 255

Gln Val Val Ser Ala Glu Gly Asp Asn Gly Gly Val Asn Arg Asn Lys

260 265 270

Arg Ile Ser Arg Lys Gly Arg Asp Asp Asn Gly Asp Asp Gly Asp Asp

275 280 285

Ser Asp Asp Asn Ser Asn Glu Asn Gly Asp Ser Ser Ser Gln Lys Lys

290 295 300

Pro Arg Val Val Trp Ser Val Glu Leu His Arg Lys Phe Val Ala Ala

305 310 315 320

Val Asn Gln Leu Gly Ile Asp Lys Ala Val Pro Lys Lys Ile Leu Asp

325 330 335

Leu Met Asn Val Glu Asn Ile Thr Arg Glu Asn Val Ala Ser His Leu

340 345 350

Gln Lys Tyr Arg Leu Tyr Leu Lys Arg Leu Ser Ala Asp Ala Ser Arg

355 360 365

Gln Ala Asn Leu Thr Ala Ala Phe Gly Gly Arg Asn Pro Ala Tyr Val

370 375 380

Asn Met Gly Leu Asp Ala Phe Arg Gln Tyr Asn Ala Tyr Gly Arg Tyr

385 390 395 400

Arg Pro Val Pro Thr Thr Asn His Ser Gln Pro Asn Asn Leu Leu Ala

405 410 415

Arg Met Asn Ser Pro Ala Phe Gly Met His Gly Leu Leu Pro Ser Gln

420 425 430

Pro Leu Gln Ile Gly His Asn Gln Asn Asn Leu Ser Thr Ser Leu Gly

435 440 445

Asn Val Gly Gly Met Asn Asn Gly Asn Leu Ile Arg Gly Ala His Met

450 455 460

Pro Leu Gln Asp Thr Ser Lys Cys Phe Pro Thr Gly Pro Ser Gly Asn

465 470 475 480

Ser Phe Ala Asn Ile Ser Asn Ser Thr Gln Leu Val Thr Thr Asn Asn

485 490 495

Leu Pro Leu Gln Ser Leu Glu Pro Ser Asn Gln Gln His Leu Gly Arg

500 505 510

Leu His Ser Ser Ala Asp Pro Phe Asn Ser Phe Val Gly Glu Pro Pro

515 520 525

Gln Phe Ala Asp Leu Gly Arg Cys Asn Thr Thr Trp Pro Thr Ala Val

530 535 540

Ser Ser Ser Asn Val Gln Glu Ile Gly Gln Lys Asp Arg Ile Val Asn

545 550 555 560

Arg Pro Lys Leu Glu Pro Leu Ser Ser Phe Thr Glu Ala Ser Ser Gln

565 570 575

Ile Pro Leu Leu Gly Asn Glu Met Gln Ser His Gln Val Ala Ser Leu

580 585 590

Ala Ser Asn Gly Leu Pro Met Pro Phe Thr Gln Glu Ala Val Pro Phe

595 600 605

Ala Tyr Gly Ser Ser Thr Asn Ser Arg Glu Met Leu Asn Asn Asn Leu

610 615 620

Ala Leu Ser Asn Ser Gly Val Asn Ser Thr Leu Pro Asn Leu Arg Ile

625 630 635 640

Asp Gly Ser Val Val Pro Gly Gln Thr Leu Gly Gly Ser Asn Ser Gly

645 650 655

Gly Cys Val Val Pro Pro Leu Gln Asp Gly Arg Ile Asp His Gln Ala

660 665 670

Val Ser Ser His Leu Asn Tyr Asn Asn Glu Leu Met Gly Thr Gly Arg

675 680 685

Leu Gln Arg Gly Leu Ser Gly Gly Leu Asp Asp Ile Val Val Asp Met

690 695 700

Phe Arg Pro Asp Arg Ala Asp Asp Gly Val Ser Phe Ile Asp Gly Asp

705 710 715 720

Trp Glu Leu Arg Pro Gly Ser Ser Val Thr Ser Glu Tyr Gln Leu Cys

725 730 735

Gly Ile Cys Tyr Leu Asn Ser Tyr Asp Tyr Val Phe Lys Ser Gly Val

740 745 750

Asn Cys Gly Tyr Arg Asp Ile Gln His Val Tyr Glu Pro Arg Asn Asp

755 760 765

Val Leu Phe Pro Leu Gly Asn Arg Phe Ala Val Pro Phe Val Asp Cys

770 775 780

His Cys Ile Val Ala Ser Leu Ala Glu Thr Glu Val Lys Gly Lys Asp

785 790 795 800

Gln Ala

<210> 38

<211> 591

<212> PRT

<213> 芜菁

<400> 38

Met Leu Asn Pro Gly Val Val Gly Gly Ser Ser Asn Ser Asp Pro Phe

1 5 10 15

Pro Ser Gly Leu Arg Val Leu Val Val Asp Asp Asp Pro Thr Cys Leu

20 25 30

Met Ile Leu Glu Arg Met Leu Lys Thr Cys Leu Tyr Arg Val Thr Lys

35 40 45

Cys Asn Arg Ala Glu Ile Ala Leu Ser Leu Leu Arg Lys Asn Lys Asn

50 55 60

Gly Phe Asp Ile Val Ile Ser Asp Val His Met Pro Asp Met Asn Gly

65 70 75 80

Phe Lys Leu Leu Glu His Val Gly Leu Glu Met Asp Leu Pro Val Ile

85 90 95

Met Met Ser Ala Asp Asp Ser Lys Ser Val Val Leu Lys Gly Val Thr

100 105 110

His Gly Ala Val Asp Tyr Leu Ile Lys Pro Val Arg Ile Glu Ala Leu

115 120 125

Lys Asn Ile Trp Gln His Val Val Arg Lys Lys Gln Asn Val Ser Glu

130 135 140

His Ser Gly Ser Val Glu Glu Thr Gly Gly Asp Arg Gln Gln Gln Gln

145 150 155 160

Arg Gly Asp Asp Asp Asp Asp Gly Asn Asn Ser Ser Ser Gly Asn Asn

165 170 175

Glu Gly Asn Leu Arg Lys Arg Lys Glu Glu Glu Gln Gly Asp Asp Lys

180 185 190

Glu Asp Thr Ser Ser Leu Lys Lys Pro Arg Val Val Trp Ser Val Glu

195 200 205

Leu His Gln Gln Phe Val Ala Ala Val Asn His Leu Gly Val Asp Lys

210 215 220

Ala Val Pro Lys Lys Ile Leu Glu Met Met Asn Val Gln Gly Leu Thr

225 230 235 240

Arg Glu Asn Val Ala Ser His Leu Gln Lys Tyr Arg Ile Tyr Leu Lys

245 250 255

Arg Leu Gly Gly Val Ser Gln Gly Asn Met Asn His Ser Phe Leu Thr

260 265 270

Gly Gln Asp Pro Ser Tyr Gly Pro Leu Asn Gly Phe Asp Leu Gln Gly

275 280 285

Leu Ala Thr Ala Gly Gln Leu Gln Ala Gln Ser Leu Ala Gln Leu Gln

290 295 300

Ala Val Gly Leu Gly Gln Ser Ser Ser Pro Leu Ile Lys Pro Gly Ile

305 310 315 320

Thr Ser Val Asp Gln Arg Ser Phe Phe Thr Phe Gln Asn Ser Lys Ser

325 330 335

Arg Phe Gly Asp Gly His Gly Pro Met Met Met Asn Gly Gly Gly Gly

340 345 350

Asn Lys Gln Thr Ser Leu Leu His Gly Val Pro Thr Gly His Met Arg

355 360 365

Leu Gln Gln Gln Gln Met Ala Gly Met Arg Val Ala Gly Pro Ser Met

370 375 380

Gln Gln Gln Gln Gln Gln Ser Met Leu Ser Arg Arg Ser Val Pro Glu

385 390 395 400

Thr Arg Ser Ser Arg Val Leu Pro Ala Ala Thr His Ser Ala Leu Asn

405 410 415

Asn Ser Phe Pro Leu Ala Ser Ala Pro Gly Met Met Ser Val Ser Asp

420 425 430

Thr Lys Gly Val Asn Glu Phe Cys Asn Pro Ser Tyr Asp Ile Leu Asn

435 440 445

Asn Phe Pro Gln Gln Gln His His Asn Asn Asn Asn Asn Arg Val Asn

450 455 460

Glu Trp Asp Leu Arg Asn Val Gly Met Val Phe Asn Ser His Gln Asp

465 470 475 480

Asn Thr Thr Ser Ala Ala Phe Ser Thr Ser Glu Ala Tyr Ser Ser Ser

485 490 495

Ser Thr His Lys Arg Lys Arg Glu Ala Glu Leu Val Val Glu His Gly

500 505 510

Gln Asn Gln Gln Gln Pro Gln Ser Arg Ser Val Lys Pro Met Asn Gln

515 520 525

Thr Tyr Met Asp Gly Gly Gly Ser Val Arg Met Lys Thr Glu Thr Val

530 535 540

Thr Cys Pro Pro Gln Ala Thr Thr Met Phe His Glu Gln Tyr Ser Asn

545 550 555 560

Gln Asp Asp Leu Leu Ser Asp Leu Leu Lys Gln Glu Gly Leu Leu Asp

565 570 575

Thr Glu Phe Asp Phe Glu Gly Tyr Ser Phe Asp Ser Ile Leu Val

580 585 590

<210> 39

<211> 691

<212> PRT

<213> 稻

<400> 39

Met Ala Pro Val Glu Asp Gly Gly Gly Val Glu Phe Pro Val Gly Met

1 5 10 15

Lys Val Leu Val Val Asp Asp Asp Pro Thr Cys Leu Ala Val Leu Lys

20 25 30

Arg Met Leu Leu Glu Cys Arg Tyr Asp Ala Thr Thr Cys Ser Gln Ala

35 40 45

Thr Arg Ala Leu Thr Met Leu Arg Glu Asn Arg Arg Gly Phe Asp Val

50 55 60

Ile Ile Ser Asp Val His Met Pro Asp Met Asp Gly Phe Arg Leu Leu

65 70 75 80

Glu Leu Val Gly Leu Glu Met Asp Leu Pro Val Ile Met Met Ser Ala

85 90 95

Asp Ser Arg Thr Asp Ile Val Met Lys Gly Ile Lys His Gly Ala Cys

100 105 110

Asp Tyr Leu Ile Lys Pro Val Arg Met Glu Glu Leu Lys Asn Ile Trp

115 120 125

Gln His Val Ile Arg Lys Lys Phe Asn Glu Asn Lys Glu His Glu His

130 135 140

Ser Gly Ser Leu Asp Asp Thr Asp Arg Thr Arg Pro Thr Asn Asn Asp

145 150 155 160

Asn Glu Tyr Ala Ser Ser Ala Asn Asp Gly Ala Glu Gly Ser Trp Lys

165 170 175

Ser Gln Lys Lys Lys Arg Asp Lys Asp Asp Asp Asp Gly Glu Leu Glu

180 185 190

Ser Gly Asp Pro Ser Ser Thr Ser Lys Lys Pro Arg Val Val Trp Ser

195 200 205

Val Glu Leu His Gln Gln Phe Val Asn Ala Val Asn His Leu Gly Ile

210 215 220

Asp Lys Ala Val Pro Lys Lys Ile Leu Glu Leu Met Asn Val Pro Gly

225 230 235 240

Leu Thr Arg Glu Asn Val Ala Ser His Leu Gln Lys Phe Arg Leu Tyr

245 250 255

Leu Lys Arg Ile Ala Gln His His Ala Gly Ile Ala Asn Pro Phe Cys

260 265 270

Pro Pro Ala Ser Ser Gly Lys Val Gly Ser Leu Gly Gly Leu Asp Phe

275 280 285

Gln Ala Leu Ala Ala Ser Gly Gln Ile Pro Pro Gln Ala Leu Ala Ala

290 295 300

Leu Gln Asp Glu Leu Leu Gly Arg Pro Thr Asn Ser Leu Val Leu Pro

305 310 315 320

Gly Arg Asp Gln Ser Ser Leu Arg Leu Ala Ala Val Lys Gly Asn Lys

325 330 335

Pro His Gly Glu Arg Glu Ile Ala Phe Gly Gln Pro Ile Tyr Lys Cys

340 345 350

Gln Asn Asn Ala Tyr Gly Ala Phe Pro Gln Ser Ser Pro Ala Val Gly

355 360 365

Gly Met Pro Ser Phe Ser Ala Trp Pro Asn Asn Lys Leu Gly Met Ala

370 375 380

Asp Ser Thr Gly Thr Leu Gly Gly Met Ser Asn Ser Gln Asn Ser Asn

385 390 395 400

Ile Val Leu His Glu Leu Gln Gln Gln Pro Asp Ala Met Leu Ser Gly

405 410 415

Thr Leu His Ser Leu Asp Val Lys Pro Ser Gly Ile Val Met Pro Ser

420 425 430

Gln Ser Leu Asn Thr Phe Ser Ala Ser Glu Gly Leu Ser Pro Asn Gln

435 440 445

Asn Thr Leu Met Ile Pro Ala Gln Ser Ser Gly Phe Leu Ala Ala Met

450 455 460

Pro Pro Ser Met Lys His Glu Pro Val Leu Ala Thr Ser Gln Pro Ser

465 470 475 480

Ser Ser Leu Leu Gly Gly Ile Asp Leu Val Asn Gln Ala Ser Thr Ser

485 490 495

Gln Pro Leu Ile Ser Ala His Gly Gly Gly Asn Leu Ser Gly Leu Val

500 505 510

Asn Arg Asn Pro Asn Val Val Pro Ser Gln Gly Ile Ser Thr Phe His

515 520 525

Thr Pro Asn Asn Pro Tyr Leu Val Ser Pro Asn Ser Met Gly Met Gly

530 535 540

Ser Lys Gln Pro Pro Gly Val Leu Lys Thr Glu Asn Ser Asp Ala Leu

545 550 555 560

Asn His Ser Tyr Gly Tyr Leu Gly Gly Ser Asn Pro Pro Met Asp Ser

565 570 575

Gly Leu Leu Ser Ser Gln Ser Lys Asn Thr Gln Phe Gly Leu Leu Gly

580 585 590

Gln Asp Asp Ile Thr Gly Ser Trp Ser Pro Leu Pro Asn Val Asp Ser

595 600 605

Tyr Gly Asn Thr Val Gly Leu Ser His Pro Gly Ser Ser Ser Ser Ser

610 615 620

Phe Gln Ser Ser Asn Val Ala Leu Gly Lys Leu Pro Asp Gln Gly Arg

625 630 635 640

Gly Lys Asn His Gly Phe Val Gly Lys Gly Thr Cys Ile Pro Ser Arg

645 650 655

Phe Ala Val Asp Glu Ile Glu Ser Pro Thr Asn Asn Leu Ser His Ser

660 665 670

Ile Gly Ser Ser Gly Asp Ile Met Ser Pro Asp Ile Phe Gly Phe Ser

675 680 685

Gly Gln Met

690

<210> 40

<211> 428

<212> PRT

<213> 卵囊藻属

<400> 40

Met Ala Leu Lys Arg Val Pro Ser Phe Ser Gly Arg Pro Asn Phe Pro

1 5 10 15

Ala Gly Leu Gln Ile Leu Val Val Asp Ser Asp Ser Ser Ser Arg Glu

20 25 30

Ala Val Glu Met Gln Leu Lys Ser His Ser Tyr Leu Ala Thr Cys Cys

35 40 45

Cys Thr Cys Gly Glu Ala Val Glu Gln Leu Gly Thr Ser Lys Tyr Asp

50 55 60

Ile Val Leu Ala Glu Ser Lys Leu Val Ala Ala Glu Cys Val Asp Ser

65 70 75 80

Thr Arg Leu Cys Glu Ala Ala Arg Ala Leu Pro Leu Val Leu Met Cys

85 90 95

Glu Asp Ser Thr Ala Asp Asp Val Leu Lys Gly Ile Arg Leu Gly Ala

100 105 110

Cys Asp Phe Leu Glu Lys Pro Leu Ser Pro Leu Lys Leu Lys Asn Ile

115 120 125

Trp Gln His Val Val Arg Lys Met Met Glu Gln Met His Val Arg Arg

130 135 140

Thr Asp Asp Ala Asp Thr Cys Thr Thr Lys Ser Ser Arg Asp Gln Ser

145 150 155 160

Cys Ala Ile Lys Gly Lys Ser Val Ala Ser Thr Pro Ser Cys Pro Lys

165 170 175

Thr Pro Ser Pro Ala Ala Ser Gly Ala Asp Ile Gly Cys Ser Ile Ala

180 185 190

Thr Ser Val Ser Lys Ala Gly Asp Val Val Gly Glu Ser Ser Ser Ser

195 200 205

Glu Thr Arg Lys Glu His Cys Ser Glu Thr Thr Glu Cys Ser Asp Leu

210 215 220

Lys Ser Cys Ala Ala Lys Ser Ala Val Ser Ala Gln Thr Pro Val Ser

225 230 235 240

Thr Ala Thr Val Ala Ala Thr Trp Gly Ala Ser Lys Lys Lys Ser Thr

245 250 255

Ala Ser Ala Thr Thr Ser Ser Val Ser Asn Arg Pro Pro Leu Ala Ile

260 265 270

Lys Met Pro Ala Pro Ala Val Ala Tyr Thr Ser Gly Leu Ala Pro Phe

275 280 285

Pro Pro Pro Met Phe Val Pro Gly Gly Trp Gly Gln Ser Ser Asn Pro

290 295 300

Cys Val Val Gly Thr Pro Met Met Pro Pro Pro Pro Gly Met Gly Met

305 310 315 320

Pro Pro His His His Ala Pro Tyr Gly Gln Val Pro Pro Pro Gly Tyr

325 330 335

Pro Val Ala Cys Met Pro Ser Ala Phe Val Pro Thr Pro Met Gly Pro

340 345 350

Gly Gly Val Ala Phe Ala Pro Pro Pro Gly Ala Ser Cys Thr Ser Ala

355 360 365

Ala Tyr Tyr Pro His Pro Ala Val Asp Ala Ser Ala Ser Ala Thr Ala

370 375 380

Thr Phe Thr Gly His Val Gln Ile Asp Leu Thr Asn Val Ser Ala Glu

385 390 395 400

Glu Pro Ala Pro Ile Gly Leu Ala Leu Arg Lys Thr Ala Ser Leu Leu

405 410 415

Asp Leu Val Ser Asp Arg Leu Gly Gln Arg Ala Cys

420 425

<210> 41

<211> 341

<212> PRT

<213> 四爿藻属

<400> 41

Met Leu Cys Pro Ala Val Gln Val Ala Thr Met Ala Thr Val Leu Ala

1 5 10 15

Ser Thr His Phe Ser Glu Arg Pro Ser Phe Pro Ala Asp Leu Glu Val

20 25 30

Leu Leu Leu Asp Ser Ala Thr Gln Gly Ala Glu Thr Ala Ser Lys Leu

35 40 45

Leu Leu Ser Cys Ser Tyr Arg Val Thr Val Cys Arg Ser Val Ser Glu

50 55 60

Ala Leu Ser His Met Ala Cys Lys Ala Phe Asp Val Val Leu Val Glu

65 70 75 80

Gln Lys Leu Phe Ser Gly Arg Asp Ala Ala Ala Ala Gln Leu Lys Ala

85 90 95

Leu Ala Gly Val Ile Pro Thr Val Val Leu Ser Asp Ser Gly Ser Ala

100 105 110

Lys Asp Thr Trp Ala Ala Ile Val Gly Gln Ala Ala Asp Val Leu Ile

115 120 125

Arg Pro Leu Thr Lys Gln Lys Leu Gln Thr Leu Trp Gln His Thr Val

130 135 140

Arg Met Gln Arg Ala Ala Ser Ser Ala Ser Ala Ala Thr Ser Met Val

145 150 155 160

Ala Lys Pro Val Ala Val Leu Ser Ser Ala Leu Lys Pro Ala Ala Ser

165 170 175

Ser Ala Ser Leu Asp Lys Gly Gln Lys Arg Lys Leu Lys Asp His Met

180 185 190

Met Gly Pro Ile Met Ala His Pro Gln Val Ser Asn Pro Gly Phe Ile

195 200 205

Trp Gly Ala Pro Val Met Gly Val Pro Ala Gly Gln Gln Ala Pro Gln

210 215 220

Lys Ser Glu Ala Pro Val Thr Pro Gln Lys Pro Gly Ser Glu Met His

225 230 235 240

Pro Glu Leu Asp Ala Thr Ser His Ile Ala Met Gly Ser Ser Asp Asn

245 250 255

Phe Asn Val Pro Val Tyr Glu Ser Gly Thr Asp Ser Gln Glu Ser Gln

260 265 270

Pro Thr Cys Asp Pro Thr Ser Leu Asp Asp Ile Asn Glu Asp Asp Tyr

275 280 285

Ala Phe Ile Asp Phe Ala Leu Ser Asp Ser Phe Pro Thr Val Glu Glu

290 295 300

Asp Glu Ile Leu Pro Pro Ile Gly Leu Ser Leu Lys Lys Ser Ser Ser

305 310 315 320

Leu Leu Asn Met Leu Asn Gly Val Leu Leu Ser Ala His Ser Val Pro

325 330 335

Leu Gln Leu Pro Gln

340

<210> 42

<211> 558

<212> PRT

<213> 拟南芥

<400> 42

Met Ser Ser Ser Glu Glu Val Val Glu Val Thr Val Val Lys Ala Pro

1 5 10 15

Glu Ala Gly Gly Gly Lys Leu Ser Arg Arg Lys Ile Arg Lys Lys Asp

20 25 30

Ala Gly Val Asp Gly Leu Val Lys Trp Glu Arg Phe Leu Pro Lys Ile

35 40 45

Ala Leu Arg Val Leu Leu Val Glu Ala Asp Asp Ser Thr Arg Gln Ile

50 55 60

Ile Ala Ala Leu Leu Arg Lys Cys Ser Tyr Arg Val Ala Ala Val Pro

65 70 75 80

Asp Gly Leu Lys Ala Trp Glu Met Leu Lys Gly Lys Pro Glu Ser Val

85 90 95

Asp Leu Ile Leu Thr Glu Val Asp Leu Pro Ser Ile Ser Gly Tyr Ala

100 105 110

Leu Leu Thr Leu Ile Met Glu His Asp Ile Cys Lys Asn Ile Pro Val

115 120 125

Ile Met Met Ser Thr Gln Asp Ser Val Asn Thr Val Tyr Lys Cys Met

130 135 140

Leu Lys Gly Ala Ala Asp Tyr Leu Val Lys Pro Leu Arg Arg Asn Glu

145 150 155 160

Leu Arg Asn Leu Trp Gln His Val Trp Arg Arg Gln Thr Ser Leu Ala

165 170 175

Pro Asp Ser Phe Pro Trp Asn Glu Ser Val Gly Gln Gln Lys Ala Glu

180 185 190

Gly Ala Ser Ala Asn Asn Ser Asn Gly Lys Arg Asp Asp His Val Val

195 200 205

Ile Gly Asn Gly Gly Asp Ala Gln Ser Ser Cys Thr Arg Pro Glu Met

210 215 220

Glu Gly Glu Ser Ala Asp Val Glu Val Ser Ala Arg Asp Ala Val Gln

225 230 235 240

Met Glu Cys Ala Lys Ser Gln Phe Asn Glu Thr Gln Leu Leu Ala Asn

245 250 255

Glu Leu Gln Ser Lys Gln Ala Glu Ala Ile Asp Phe Met Gly Ala Ser

260 265 270

Phe Arg Arg Thr Gly Arg Arg Asn Arg Glu Glu Ser Val Ala Gln Tyr

275 280 285

Glu Ser Arg Ile Glu Leu Asp Leu Ser Leu Arg Arg Pro Asn Ala Ser

290 295 300

Glu Asn Gln Ser Ser Gly Asp Arg Pro Ser Leu His Pro Ser Ser Ala

305 310 315 320

Ser Ala Phe Thr Arg Tyr Val His Arg Pro Leu Gln Thr Gln Cys Ser

325 330 335

Ala Ser Pro Val Val Pro Asp Gln Arg Lys Asn Val Ala Ala Ser Gln

340 345 350

Asp Asp Asn Ile Val Leu Met Asn Gln Tyr Asn Thr Ser Glu Pro Pro

355 360 365

Pro Asn Ala Pro Arg Arg Asn Asp Thr Ser Phe Tyr Thr Gly Thr Asp

370 375 380

Ser Pro Gly Pro Pro Phe Ser Asn Gln Met Asn Ser Trp Pro Gly Gln

385 390 395 400

Gly Ser Tyr Pro Thr Pro Thr Pro Ile Asn Asn Ile Gln Phe Arg Gly

405 410 415

Pro Asn Thr Ala Tyr Thr Ser Ala Met Ala Pro Ala Ser Leu Ser Pro

420 425 430

Ser Pro Ser Ser Val Ser Pro His Glu Tyr Ser Ser Met Phe His Pro

435 440 445

Phe Asn Ser Lys Pro Glu Gly Leu Gln Asp Arg Asp Cys Ser Met Asp

450 455 460

Val Asp Asp Arg Arg Tyr Val Ser Ser Ala Thr Glu His Ser Ala Ile

465 470 475 480

Gly Asn His Ile Asp Gln Leu Ile Glu Lys Lys Asn Glu Asp Gly Tyr

485 490 495

Ser Ser Ser Val Gly Lys Ile Gln Gln Ser Leu Gln Arg Glu Ala Ala

500 505 510

Leu Thr Lys Phe Arg Met Lys Arg Lys Asp Arg Cys Phe Glu Lys Lys

515 520 525

Val Arg Tyr Glu Ser Arg Lys Lys Leu Ala Glu Gln Arg Pro Arg Ile

530 535 540

Lys Gly Gln Phe Val Arg Gln Val Gln Ser Thr Gln Ala Pro

545 550 555

<210> 43

<211> 186

<212> PRT

<213> 拟南芥

<400> 43

Met Ala Glu Val Met Leu Pro Arg Lys Met Glu Ile Leu Asn His Ser

1 5 10 15

Ser Lys Phe Gly Ser Pro Asp Pro Leu His Val Leu Ala Val Asp Asp

20 25 30

Ser His Val Asp Arg Lys Phe Ile Glu Arg Leu Leu Arg Val Ser Ser

35 40 45

Cys Lys Val Thr Val Val Asp Ser Ala Thr Arg Ala Leu Gln Tyr Leu

50 55 60

Gly Leu Asp Val Glu Glu Lys Ser Val Gly Phe Glu Asp Leu Lys Val

65 70 75 80

Asn Leu Ile Met Thr Asp Tyr Ser Met Pro Gly Met Thr Gly Tyr Glu

85 90 95

Leu Leu Lys Lys Ile Lys Glu Ser Ser Ala Phe Arg Glu Val Pro Val

100 105 110

Val Ile Met Ser Ser Glu Asn Ile Leu Pro Arg Ile Asp Arg Cys Leu

115 120 125

Glu Glu Gly Ala Glu Asp Phe Leu Leu Lys Pro Val Lys Leu Ser Asp

130 135 140

Val Lys Arg Leu Arg Asp Ser Leu Met Lys Val Glu Asp Leu Ser Phe

145 150 155 160

Thr Lys Ser Ile Gln Lys Arg Glu Leu Glu Thr Glu Asn Val Tyr Pro

165 170 175

Val His Ser Gln Leu Lys Arg Ala Lys Ile

180 185

<210> 44

<211> 727

<212> PRT

<213> 拟南芥

<400> 44

Met Asn Ala Asn Glu Glu Gly Glu Gly Ser Arg Tyr Pro Ile Thr Asp

1 5 10 15

Arg Lys Thr Gly Glu Thr Lys Phe Asp Arg Val Glu Ser Arg Thr Glu

20 25 30

Lys His Ser Glu Glu Glu Lys Thr Asn Gly Ile Thr Met Asp Val Arg

35 40 45

Asn Gly Ser Ser Gly Gly Leu Gln Ile Pro Leu Ser Gln Gln Thr Ala

50 55 60

Ala Thr Val Cys Trp Glu Arg Phe Leu His Val Arg Thr Ile Arg Val

65 70 75 80

Leu Leu Val Glu Asn Asp Asp Cys Thr Arg Tyr Ile Val Thr Ala Leu

85 90 95

Leu Arg Asn Cys Ser Tyr Glu Val Val Glu Ala Ser Asn Gly Ile Gln

100 105 110

Ala Trp Lys Val Leu Glu Asp Leu Asn Asn His Ile Asp Ile Val Leu

115 120 125

Thr Glu Val Ile Met Pro Tyr Leu Ser Gly Ile Gly Leu Leu Cys Lys

130 135 140

Ile Leu Asn His Lys Ser Arg Arg Asn Ile Pro Val Ile Met Met Ser

145 150 155 160

Ser His Asp Ser Met Gly Leu Val Phe Lys Cys Leu Ser Lys Gly Ala

165 170 175

Val Asp Phe Leu Val Lys Pro Ile Arg Lys Asn Glu Leu Lys Ile Leu

180 185 190

Trp Gln His Val Trp Arg Arg Cys Gln Ser Ser Ser Gly Ser Gly Ser

195 200 205

Glu Ser Gly Thr His Gln Thr Gln Lys Ser Val Lys Ser Lys Ser Ile

210 215 220

Lys Lys Ser Asp Gln Asp Ser Gly Ser Ser Asp Glu Asn Glu Asn Gly

225 230 235 240

Ser Ile Gly Leu Asn Ala Ser Asp Gly Ser Ser Asp Gly Ser Gly Ala

245 250 255

Gln Ser Ser Trp Thr Lys Lys Ala Val Asp Val Asp Asp Ser Pro Arg

260 265 270

Ala Val Ser Leu Trp Asp Arg Val Asp Ser Thr Cys Ala Gln Val Val

275 280 285

His Ser Asn Pro Glu Phe Pro Ser Asn Gln Leu Val Ala Pro Pro Ala

290 295 300

Glu Lys Glu Thr Gln Glu His Asp Asp Lys Phe Glu Asp Val Thr Met

305 310 315 320

Gly Arg Asp Leu Glu Ile Ser Ile Arg Arg Asn Cys Asp Leu Ala Leu

325 330 335

Glu Pro Lys Asp Glu Pro Leu Ser Lys Thr Thr Gly Ile Met Arg Gln

340 345 350

Asp Asn Ser Phe Glu Lys Ser Ser Ser Lys Trp Lys Met Lys Val Gly

355 360 365

Lys Gly Pro Leu Asp Leu Ser Ser Glu Ser Pro Ser Ser Lys Gln Met

370 375 380

His Glu Asp Gly Gly Ser Ser Phe Lys Ala Met Ser Ser His Leu Gln

385 390 395 400

Asp Asn Arg Glu Pro Glu Ala Pro Asn Thr His Leu Lys Thr Leu Asp

405 410 415

Thr Asn Glu Ala Ser Val Lys Ile Ser Glu Glu Leu Met His Val Glu

420 425 430

His Ser Ser Lys Arg His Arg Gly Thr Lys Asp Asp Gly Thr Leu Val

435 440 445

Arg Asp Asp Arg Asn Val Leu Arg Arg Ser Glu Gly Ser Ala Phe Ser

450 455 460

Arg Tyr Asn Pro Ala Ser Asn Ala Asn Lys Ile Ser Gly Gly Asn Leu

465 470 475 480

Gly Ser Thr Ser Leu Gln Asp Asn Asn Ser Gln Asp Leu Ile Lys Lys

485 490 495

Thr Glu Ala Ala Tyr Asp Cys His Ser Asn Met Asn Glu Ser Leu Pro

500 505 510

His Asn His Arg Ser His Val Gly Ser Asn Asn Phe Asp Met Ser Ser

515 520 525

Thr Thr Glu Asn Asn Ala Phe Thr Lys Pro Gly Ala Pro Lys Val Ser

530 535 540

Ser Ala Gly Ser Ser Ser Val Lys His Ser Ser Phe Gln Pro Leu Pro

545 550 555 560

Cys Asp His His Asn Asn His Ala Ser Tyr Asn Leu Val His Val Ala

565 570 575

Glu Arg Lys Lys Leu Pro Pro Gln Cys Gly Ser Ser Asn Val Tyr Asn

580 585 590

Glu Thr Ile Glu Gly Asn Asn Asn Thr Val Asn Tyr Ser Val Asn Gly

595 600 605

Ser Val Ser Gly Ser Gly His Gly Ser Asn Gly Pro Tyr Gly Ser Ser

610 615 620

Asn Gly Met Asn Ala Gly Gly Met Asn Met Gly Ser Asp Asn Gly Ala

625 630 635 640

Gly Lys Asn Gly Asn Gly Asp Gly Ser Gly Ser Gly Ser Gly Ser Gly

645 650 655

Ser Gly Asn Leu Ala Asp Glu Asn Lys Ile Ser Gln Arg Glu Ala Ala

660 665 670

Leu Thr Lys Phe Arg Gln Lys Arg Lys Glu Arg Cys Phe Arg Lys Lys

675 680 685

Val Arg Tyr Gln Ser Arg Lys Lys Leu Ala Glu Gln Arg Pro Arg Val

690 695 700

Arg Gly Gln Phe Val Arg Lys Thr Ala Ala Ala Thr Asp Asp Asn Asp

705 710 715 720

Ile Lys Asn Ile Glu Asp Ser

725

<210> 45

<211> 444

<212> PRT

<213> 拟南芥

<400> 45

Met Gly Glu Ile Val Val Leu Ser Ser Asp Asp Gly Met Glu Thr Ile

1 5 10 15

Lys Asn Arg Val Lys Ser Ser Glu Val Val Gln Trp Glu Lys Tyr Leu

20 25 30

Pro Lys Thr Val Leu Arg Val Leu Leu Val Glu Ser Asp Tyr Ser Thr

35 40 45

Arg Gln Ile Ile Thr Ala Leu Leu Arg Lys Cys Cys Tyr Lys Val Val

50 55 60

Ala Val Ser Asp Gly Leu Ala Ala Trp Glu Val Leu Lys Glu Lys Ser

65 70 75 80

His Asn Ile Asp Leu Ile Leu Thr Glu Leu Asp Leu Pro Ser Ile Ser

85 90 95

Gly Phe Ala Leu Leu Ala Leu Val Met Glu His Glu Ala Cys Lys Asn

100 105 110

Ile Pro Val Ile Met Met Ser Ser Gln Asp Ser Ile Lys Met Val Leu

115 120 125

Lys Cys Met Leu Arg Gly Ala Ala Asp Tyr Leu Ile Lys Pro Met Arg

130 135 140

Lys Asn Glu Leu Lys Asn Leu Trp Gln His Val Trp Arg Arg Leu Thr

145 150 155 160

Leu Arg Asp Asp Pro Thr Ala His Ala Gln Ser Leu Pro Ala Ser Gln

165 170 175

His Asn Leu Glu Asp Thr Asp Glu Thr Cys Glu Asp Ser Arg Tyr His

180 185 190

Ser Asp Gln Gly Ser Gly Ala Gln Ala Ile Asn Tyr Asn Gly His Asn

195 200 205

Lys Leu Met Glu Asn Gly Lys Ser Val Asp Glu Arg Asp Glu Phe Lys

210 215 220

Glu Thr Phe Asp Val Thr Met Asp Leu Ile Gly Gly Ile Asp Lys Arg

225 230 235 240

Pro Asp Ser Ile Tyr Lys Asp Lys Ser Arg Asp Glu Cys Val Gly Pro

245 250 255

Glu Leu Gly Leu Ser Leu Lys Arg Ser Cys Ser Val Ser Phe Glu Asn

260 265 270

Gln Asp Glu Ser Lys His Gln Lys Leu Ser Leu Ser Asp Ala Ser Ala

275 280 285

Phe Ser Arg Phe Glu Glu Ser Lys Ser Ala Glu Lys Ala Val Val Ala

290 295 300

Leu Glu Glu Ser Thr Ser Gly Glu Pro Lys Thr Pro Thr Glu Ser His

305 310 315 320

Glu Lys Leu Arg Lys Val Thr Ser Asp Gln Gly Ser Ala Thr Thr Ser

325 330 335

Ser Asn Gln Glu Asn Ile Gly Ser Ser Ser Val Ser Phe Arg Asn Gln

340 345 350

Val Leu Gln Ser Thr Val Thr Asn Gln Lys Gln Asp Ser Pro Ile Pro

355 360 365

Val Glu Ser Asn Arg Glu Lys Ala Ala Ser Lys Glu Val Glu Ala Gly

370 375 380

Ser Gln Ser Thr Asn Glu Gly Ile Ala Gly Gln Ser Ser Ser Thr Glu

385 390 395 400

Lys Pro Lys Glu Glu Glu Ser Ala Lys Gln Arg Trp Ser Arg Ser Gln

405 410 415

Arg Glu Ala Ala Leu Met Lys Phe Arg Leu Lys Arg Lys Asp Arg Cys

420 425 430

Phe Asp Lys Lys Val Arg Asp Thr Gln Ala Ser Ser

435 440

<210> 46

<211> 204

<212> PRT

<213> 拟南芥

<400> 46

Met Ala Leu Arg Asp Leu Ser Ser Ser Ser Ser Ser Pro Glu Leu His

1 5 10 15

Val Leu Ala Val Asp Asp Ser Phe Val Asp Arg Lys Val Leu Glu Arg

20 25 30

Leu Leu Lys Ile Ser Ala Cys Lys Val Thr Thr Val Glu Ser Gly Thr

35 40 45

Arg Ala Leu Gln Tyr Leu Gly Leu Asp Gly Asp Asn Gly Ser Ser Gly

50 55 60

Leu Lys Asp Leu Lys Val Asn Leu Ile Val Thr Asp Tyr Ser Met Pro

65 70 75 80

Gly Leu Thr Gly Tyr Glu Leu Leu Lys Lys Ile Lys Glu Ser Ser Ala

85 90 95

Leu Arg Glu Ile Pro Val Val Ile Met Ser Ser Glu Asn Ile Gln Pro

100 105 110

Arg Ile Glu Gln Cys Met Ile Glu Gly Ala Glu Glu Phe Leu Leu Lys

115 120 125

Pro Val Lys Leu Ala Asp Val Lys Arg Leu Lys Glu Leu Ile Met Arg

130 135 140

Gly Gly Glu Ala Glu Glu Gly Lys Thr Lys Lys Leu Ser Pro Lys Arg

145 150 155 160

Ile Leu Gln Asn Asp Ile Asp Ser Ser Pro Ser Ser Ser Ser Ser Thr

165 170 175

Ser Ser Ser Ser Ser Ser His Asp Val Ser Ser Leu Asp Asp Asp Thr

180 185 190

Pro Ser Ser Lys Arg Ile Lys Leu Glu Ser Arg Gly

195 200

<210> 47

<211> 691

<212> PRT

<213> 大豆

<400> 47

Met Gly Glu Val Val Ile Met Ser Gly Glu Lys Lys Ser Val Arg Val

1 5 10 15

Glu Gly Val Glu Lys Glu Asp Ser Gly Gly Ser Gly Ser Lys Ala Gly

20 25 30

Glu Phe Lys Gly Leu Met Arg Trp Glu Lys Phe Leu Pro Lys Met Val

35 40 45

Leu Arg Val Leu Leu Val Glu Ala Asp Asp Ser Thr Arg Gln Ile Ile

50 55 60

Ala Ala Leu Leu Arg Lys Cys Ser Tyr Lys Val Val Ala Val Pro Asp

65 70 75 80

Gly Leu Lys Ala Trp Glu Leu Leu Lys Gly Arg Pro His Asn Val Asp

85 90 95

Leu Ile Leu Thr Glu Val Asp Leu Pro Ser Ile Ser Gly Tyr Ala Leu

100 105 110

Leu Thr Leu Ile Met Glu His Glu Ile Cys Lys Asn Ile Pro Val Ile

115 120 125

Met Met Ser Ser Gln Asp Ser Ile Ser Thr Val Tyr Lys Cys Met Leu

130 135 140

Arg Gly Ala Ala Asp Tyr Leu Val Lys Pro Ile Arg Lys Asn Glu Leu

145 150 155 160

Arg Asn Leu Trp Gln His Val Trp Arg Arg Gln Ser Ser Thr Thr Gly

165 170 175

Ile Asn Gly Leu Gln Asp Glu Ser Val Ala Gln Gln Lys Val Glu Ala

180 185 190

Thr Ala Glu Asn Asn Ala Ala Ser Asn Arg Ser Ser Gly Asp Ala Ala

195 200 205

Cys Ile Gln Arg Asn Ile Glu Leu Ile Glu Lys Gly Ser Asp Ala Gln

210 215 220

Ser Ser Cys Thr Lys Pro Asp Cys Glu Ala Glu Ser Asp Pro Val Gly

225 230 235 240

Asn Met Gln Glu Phe Ser Leu Leu Lys Cys Gly Glu Ala Tyr Pro Ser

245 250 255

Gly Thr Glu Thr Gln Gln Val Glu Thr Ser Phe Arg Leu Gly Gln Thr

260 265 270

Leu Met Met His Asp Cys His Ala Gly Gly Leu Asn Val Ser Ile Arg

275 280 285

Lys Asn Gly Glu Ala Ser Thr Thr Asn Asp Lys Asp Thr Asp Thr Glu

290 295 300

His Phe Gly Asn Ala Ser Ile Ser Gly Glu Ala His Asp Asn Pro Tyr

305 310 315 320

Val Gln Ile Asn Ser Ser Lys Glu Ala Met Asp Leu Ile Gly Ala Phe

325 330 335

His Thr His Pro Asn Cys Ser Leu Lys Asn Ser Thr Val Asn Cys Thr

340 345 350

Gly Asn Phe Asp His Ser Pro Gln Leu Asp Leu Ser Leu Arg Arg Ser

355 360 365

Cys Pro Gly Ser Phe Glu Asn Lys Leu Thr Glu Glu Arg His Thr Leu

370 375 380

Met His Ser Asn Ala Ser Ala Phe Lys Arg Tyr Thr Thr Arg Gln Leu

385 390 395 400

Gln Ile Ser Met Pro Ala Val Leu Ile Asn Phe Ser Asp Gln Gln Arg

405 410 415

Glu Gln Ile Thr Asn Cys Glu Lys Asn Ile Ser His Ile Ala Thr Gly

420 425 430

Ser Asn Ser Asp Ser Ser Thr Pro Met Gln Arg Cys Ile Val Ser Pro

435 440 445

Thr Thr Val Gln Ser Lys Glu Ser Glu Leu Ala Thr Ser His Pro Pro

450 455 460

Gln Gly His Ser Leu Pro Ile Pro Val Lys Gly Val Arg Phe Asn Asp

465 470 475 480

Leu Cys Thr Ala Tyr Gly Ser Val Leu Pro Ser Val Phe His Thr Gln

485 490 495

Ser Gly Pro Pro Ala Met Pro Ser Pro Asn Ser Val Val Leu Leu Glu

500 505 510

Pro Asn Phe Gln Val Asn Ala Phe Tyr Gln Ser Asn Met Lys Glu Ser

515 520 525

Ser Ser Glu Gln Leu Tyr Glu Ser Arg Gly Pro Asn Gly Asn Thr Thr

530 535 540

Gln Asn His Ile Val Tyr Thr Gln Glu His Lys Ser Glu His Ala Glu

545 550 555 560

Asp Arg Gly His Ile Ser Pro Thr Thr Asp Gln Ser Val Ser Ser Ser

565 570 575

Phe Cys Asn Gly Asn Ala Ser His Leu Asn Ser Ile Gly Tyr Gly Ser

580 585 590

Asn Cys Gly Ser Ser Ser Asn Val Asp Gln Val Asn Thr Val Trp Ala

595 600 605

Ala Ser Glu Gly Lys His Glu Asp Leu Thr Asn Asn Ala Asn Ser His

610 615 620

Arg Ser Ile Gln Arg Glu Ala Ala Leu Asn Lys Phe Arg Leu Lys Arg

625 630 635 640

Lys Glu Arg Cys Tyr Glu Lys Lys Val Arg Tyr Glu Ser Arg Lys Lys

645 650 655

Leu Ala Glu Gln Arg Pro Arg Val Lys Gly Gln Phe Val Arg Gln Val

660 665 670

His Pro Asp Pro Leu Val Ala Glu Lys Asp Gly Lys Glu Tyr Asp His

675 680 685

Ser Asp Phe

690

<210> 48

<211> 747

<212> PRT

<213> 葡萄

<400> 48

Met Gly Glu Val Val Val Ser Ser Glu Ala Gly Gly Gly Gly Met Glu

1 5 10 15

Gly Glu Val Glu Lys Lys Glu Val Gly Ser Gly Val Val Arg Trp Glu

20 25 30

Arg Phe Leu Pro Arg Met Val Leu Arg Val Leu Leu Val Glu Ala Asp

35 40 45

Asp Ser Thr Arg Gln Ile Ile Ala Ala Leu Leu Arg Lys Cys Ser Tyr

50 55 60

Lys Val Ala Ala Val Pro Asp Gly Leu Lys Ala Trp Glu Val Leu Lys

65 70 75 80

Ala Arg Pro His Asn Ile Asp Leu Ile Leu Thr Glu Val Glu Leu Pro

85 90 95

Ser Ile Ser Gly Phe Ala Leu Leu Thr Leu Val Met Glu His Glu Ile

100 105 110

Cys Lys Asn Ile Pro Val Ile Met Met Ser Ser His Gly Ser Ile Asn

115 120 125

Thr Val Tyr Lys Cys Met Leu Arg Gly Ala Ala Asp Phe Leu Val Lys

130 135 140

Pro Val Arg Arg Asn Glu Leu Lys Asn Leu Trp Gln His Val Trp Arg

145 150 155 160

Arg Gln Ser Ser Thr Val Ser Gly Asn Gly Pro Gln Asp Glu Ser Val

165 170 175

Ala Gln Gln Lys Val Glu Ala Thr Ser Glu Asn Asn Pro Thr Ser Asn

180 185 190

His Ser Ser Asp His Val Ala Cys Ile Gln Lys Asn Lys Glu Ala Leu

195 200 205

Asn Lys Val Ser Asp Ala Gln Ser Ser Cys Ser Lys Pro Asp Leu Glu

210 215 220

Ala Glu Ser Ala Tyr Met Glu Thr Met Gln Asp Phe Ser Asn Pro Thr

225 230 235 240

Trp Ser Arg Ser Leu Val Ser Asp Thr Lys Met Gln Lys Asn Glu Glu

245 250 255

Cys Ala Lys Leu Gly Pro Lys Phe Leu Met His Asn Lys Glu Ala Gly

260 265 270

Gly Thr Leu Glu Ala Ala Cys Arg Asp Val Asn Thr Met Thr Gln Pro

275 280 285

Glu Ala Val Glu Pro Glu Asn Asp Gly Gln Gly Ala Asn Ala Pro Ser

290 295 300

Glu Ala Cys Gly Asn Asn Ala Ile Leu Gly Ser Ser Ser Arg Glu Ala

305 310 315 320

Ile Asp Leu Ile Gly Val Phe Asp Asn Ser Lys Lys Cys Thr Tyr Gly

325 330 335

Asn Ser Ser Ser Asn Asn Gly Thr Lys Lys Ser Asp Ser Ile Pro Gln

340 345 350

Leu Asp Leu Ser Leu Arg Arg Ser His Pro Ser Ser Pro Glu Asn Gln

355 360 365

Val Ala Asp Glu Arg His Thr Leu Asn His Ser Asn Gly Ser Ala Phe

370 375 380

Ser Arg Tyr Ile Asn Arg Ser Leu Gln Pro Pro His Leu Pro Ser Thr

385 390 395 400

Gly Val Phe Asn Gln Gln Lys Asn Phe Gly Ala Asp Ser Asp Lys Arg

405 410 415

Leu Ser Gln Leu Val Thr Gly Tyr Asn Ser Asp Ile Thr Ser Pro Thr

420 425 430

Leu Ser Thr Gln Arg Ser Val Ile Ser Leu Ala Thr Ser Pro Ser Gly

435 440 445

Arg Val Glu Ile Ala Leu Cys Gly Pro Gln Gln Arg Ala Phe Pro Ala

450 455 460

Pro Val Pro Gln Asn Ala Asn Asn Ser Thr Ser Gln Thr Asn His Lys

465 470 475 480

Pro Glu His Lys Leu Asp Ser Leu Glu Gly Gln Gly His Phe Ser Pro

485 490 495

Ala Thr Asp Gln Asn Ser Ser Ser Ser Phe Gly Asn Gly Gly Ala Ser

500 505 510

Asn Leu Asn Ser Phe Gly Cys Gly Ser Ile Cys Gly Ser Asn Gly Asn

515 520 525

Ala Asn Thr Val Ala Val Val Gln Ala Ala Ala Glu Gly Lys Asn Glu

530 535 540

Glu Gly Ile Phe Ser His Glu Gly His Ser Gln Arg Ser Ile Gln Arg

545 550 555 560

Glu Ala Ala Leu Thr Lys Phe Arg Leu Lys Arg Lys Asp Arg Cys Phe

565 570 575

Glu Lys Lys Val Arg Tyr Glu Ser Arg Lys Lys Leu Ala Glu Gln Arg

580 585 590

Pro Arg Val Lys Gly Gln Phe Val Arg Gln Val His Thr Ile Pro Pro

595 600 605

Pro Ala Glu Pro Asp Thr Tyr Tyr Gly Ser Ser Phe Asp Val Gln Pro

610 615 620

Gln Arg Ser Arg Tyr Leu Ser Ala Gln Pro Leu Arg Ala Ser Ser Ser

625 630 635 640

Gln Leu Leu Tyr Pro Thr His Thr Pro Leu Gln Glu Ser Lys Tyr Glu

645 650 655

Gly His Glu Glu Ser Asn Leu Leu Thr Ala Ser Leu Val Gly Thr Ala

660 665 670

Leu Pro Val Ala Pro Ser Phe Gly Tyr Glu Val Gly Arg Asp Gln Thr

675 680 685

Ala Gly Lys Leu Val Leu Ser Leu Lys Leu Asp Gly Arg Val Arg Trp

690 695 700

Lys Val Gly Thr Trp Val Ser Gly Arg Tyr Arg Leu Asn Val Asn Cys

705 710 715 720

Val Ala Val Met Ala Phe Gly Pro Ser Ile Pro Ser Gly Pro Leu Ser

725 730 735

Ser Lys Glu Gly Thr Gln Cys Ser Thr Thr Val

740 745

<210> 49

<211> 799

<212> PRT

<213> 可可

<400> 49

Met Gly Ile Val Gln Met Asn Asn Asn Gly Pro Val Ala Asn Gly Leu

1 5 10 15

Val Glu Leu Asn Thr His Ile His Asp Glu His Lys Lys Ile Arg Gly

20 25 30

Gly Val Ile Gly Glu Gly Gln Gly Leu Ser Val Glu Glu Glu Ser Trp

35 40 45

Ile Asn Glu Asp Val Glu Asp Arg Asn Asp Gly Lys Thr Glu Leu Val

50 55 60

Gln Val Gln Gly His Ala His Gly Glu Gln Glu Arg Ser Gln Gln Gln

65 70 75 80

Pro Gln Gly Pro Leu Val His Trp Glu Arg Phe Leu Pro Leu Arg Ser

85 90 95

Leu Lys Val Leu Leu Val Glu Asn Asp Asp Ser Thr Arg His Val Val

100 105 110

Cys Ala Leu Leu Arg Asn Cys Gly Phe Glu Val Thr Ala Val Ser Asn

115 120 125

Gly Leu Gln Ala Trp Lys Ile Leu Glu Asp Leu Thr Asn His Ile Asp

130 135 140

Leu Val Leu Thr Glu Val Val Met Pro Cys Leu Ser Gly Ile Gly Leu

145 150 155 160

Leu Cys Lys Ile Met Ser His Lys Thr Arg Met Asn Ile Pro Val Ile

165 170 175

Met Met Ser Ser His Asp Ser Met Ser Thr Val Phe Arg Cys Leu Ser

180 185 190

Lys Gly Ala Val Asp Phe Leu Val Lys Pro Ile Arg Lys Asn Glu Leu

195 200 205

Lys Asn Leu Trp Gln His Val Trp Arg Lys Cys His Ser Ser Ser Ser

210 215 220

Ser Gly Gly Gln Ser Gly Thr Gln Thr Gln Lys Ser Ser Lys Ser Lys

225 230 235 240

Gly Thr Asp Ser Asp Asn Asn Thr Gly Ser Asn Asp Glu Asp Asp Asn

245 250 255

Gly Ser Val Gly Leu Asn Val Gln Asp Gly Ser Asp Asn Gly Ser Gly

260 265 270

Thr Gln Ser Ser Trp Thr Lys Arg Ala Val Glu Val Asp Ser Ser Gln

275 280 285

Pro Ile Ser Pro Trp Asp Gln Leu Ala Asp Pro Pro His Ser Thr Cys

290 295 300

Ala Gln Val Ile His Ser Arg His Glu Val Leu Gly Asp Ser Trp Val

305 310 315 320

Pro Val Thr Ala Thr Arg Glu Tyr Asp Glu Leu Asp Asn Glu Leu Glu

325 330 335

Asn Val Val Met Gly Lys Asp Leu Glu Ile Gly Val Pro Lys Ile Thr

340 345 350

Ala Ser Gln Leu Glu Asp Pro Ser Glu Lys Val Met Thr Asn Ile Ala

355 360 365

Gly Val Asn Lys Asp Lys Leu Ser Ala Ile Asn Pro Lys Lys Asp Asp

370 375 380

Glu Lys Leu Glu Lys Ala Gln Leu Glu Leu Asn Ser Glu Lys Ser Gly

385 390 395 400

Gly Asp Leu Arg Asn Gln Ala Ala Asp Leu Ile Gly Val Ile Thr Asn

405 410 415

Asn Thr Glu Pro His Ile Glu Ser Ala Val Phe Asp Ile Pro Asn Gly

420 425 430

Leu Pro Lys Val Ser Asp Ala Lys Glu Lys Val Asn Tyr Asp Thr Lys

435 440 445

Glu Met Pro Phe Leu Glu Leu Ser Leu Lys Arg Leu Arg Asp Val Gly

450 455 460

Asp Thr Gly Thr Ser Ala His Glu Arg Asn Val Leu Arg His Ser Asp

465 470 475 480

Leu Ser Ala Phe Ser Arg Tyr Asn Ser Gly Ser Thr Ala Asn Gln Ala

485 490 495

Pro Thr Gly Asn Val Gly Ser Cys Ser Pro Leu Asp Asn Ser Ser Glu

500 505 510

Ala Val Lys Thr Asp Ser Met Lys Asn Phe Gln Ser Thr Ser Asn Ser

515 520 525

Ile Pro Pro Lys Gln Gln Ser Asn Gly Ser Ser Asn Asn Asn Asp Met

530 535 540

Gly Ser Thr Thr Asn Asn Ala Phe Ser Lys Pro Ala Val Leu Ser Asp

545 550 555 560

Lys Pro Ala Pro Lys Thr Ser Ala Lys Ser Phe His Pro Ser Ser Ala

565 570 575

Phe Gln Pro Val Gln Ser Gly His Gly Ser Ala Leu Gln Pro Val Ala

580 585 590

Gln Gly Lys Ala Asp Ala Ala Leu Gly Asn Met Ile Leu Val Lys Ala

595 600 605

Arg Gly Thr Asp Gln Gln Gly Lys Val Gln His His His His His Tyr

610 615 620

His His His His His His His Val His Asn Met Leu Pro Asn Gln Lys

625 630 635 640

Leu Gly Asn His Asp Asp Leu Ser Leu Glu Asn Met Ala Ala Ala Ala

645 650 655

Pro Gln Cys Gly Ser Ser Asn Leu Ser Ser Leu Pro His Val Glu Gly

660 665 670

Asn Ala Ala Asn His Ser Leu Thr Arg Ser Ala Ser Gly Ser Asn His

675 680 685

Gly Ser Asn Gly Gln Asn Gly Ser Ser Thr Val Leu Asn Thr Arg Gly

690 695 700

Met Asn Leu Glu Ser Glu Asn Gly Val Pro Gly Lys Gly Gly Ala Gly

705 710 715 720

Gly Gly Ile Gly Ser Gly Gly Arg Asn Val Val Asp Gln Asn Arg Phe

725 730 735

Ala Gln Arg Glu Ala Ala Leu Asn Lys Phe Arg Gln Lys Arg Lys Glu

740 745 750

Arg Cys Phe Glu Lys Lys Val Arg Tyr Gln Ser Arg Lys Lys Leu Ala

755 760 765

Glu Gln Arg Pro Arg Ile Arg Gly Gln Phe Val Arg Gln Ile Ser Thr

770 775 780

Thr Gly Lys Glu Ala Phe Arg Phe Arg Gly Ala Gly Leu Cys Thr

785 790 795

<210> 50

<211> 742

<212> PRT

<213> 稻

<400> 50

Met Met Gly Thr Ala His His Asn Gln Thr Ala Gly Ser Ala Leu Gly

1 5 10 15

Val Gly Val Gly Asp Ala Asn Asp Ala Val Pro Gly Ala Gly Gly Gly

20 25 30

Gly Tyr Ser Asp Pro Asp Gly Gly Pro Ile Ser Gly Val Gln Arg Pro

35 40 45

Pro Gln Val Cys Trp Glu Arg Phe Ile Gln Lys Lys Thr Ile Lys Val

50 55 60

Leu Leu Val Asp Ser Asp Asp Ser Thr Arg Gln Val Val Ser Ala Leu

65 70 75 80

Leu Arg His Cys Met Tyr Glu Val Ile Pro Ala Glu Asn Gly Gln Gln

85 90 95

Ala Trp Thr Tyr Leu Glu Asp Met Gln Asn Ser Ile Asp Leu Val Leu

100 105 110

Thr Glu Val Val Met Pro Gly Val Ser Gly Ile Ser Leu Leu Ser Arg

115 120 125

Ile Met Asn His Asn Ile Cys Lys Asn Ile Pro Val Ile Met Met Ser

130 135 140

Ser Asn Asp Ala Met Gly Thr Val Phe Lys Cys Leu Ser Lys Gly Ala

145 150 155 160

Val Asp Phe Leu Val Lys Pro Ile Arg Lys Asn Glu Leu Lys Asn Leu

165 170 175

Trp Gln His Val Trp Arg Arg Cys His Ser Ser Ser Gly Ser Gly Ser

180 185 190

Glu Ser Gly Ile Gln Thr Gln Lys Cys Ala Lys Ser Lys Ser Gly Asp

195 200 205

Glu Ser Asn Asn Asn Asn Gly Ser Asn Asp Asp Asp Asp Asp Asp Gly

210 215 220

Val Ile Met Gly Leu Asn Ala Arg Asp Gly Ser Asp Asn Gly Ser Gly

225 230 235 240

Thr Gln Ala Gln Ser Ser Trp Thr Lys Arg Ala Val Glu Ile Asp Ser

245 250 255

Pro Gln Ala Met Ser Pro Asp Gln Leu Ala Asp Pro Pro Asp Ser Thr

260 265 270

Cys Ala Gln Val Ile His Leu Lys Ser Asp Ile Cys Ser Asn Arg Trp

275 280 285

Leu Pro Cys Thr Ser Asn Lys Asn Ser Lys Lys Gln Lys Glu Thr Asn

290 295 300

Asp Asp Phe Lys Gly Lys Asp Leu Glu Ile Gly Ser Pro Arg Asn Leu

305 310 315 320

Asn Thr Ala Tyr Gln Ser Ser Pro Asn Glu Arg Ser Ile Lys Pro Thr

325 330 335

Asp Arg Arg Asn Glu Tyr Pro Leu Gln Asn Asn Ser Lys Glu Ala Ala

340 345 350

Met Glu Asn Leu Glu Glu Ser Ser Val Arg Ala Ala Asp Leu Ile Gly

355 360 365

Ser Met Ala Lys Asn Met Asp Ala Gln Gln Ala Ala Arg Ala Ala Asn

370 375 380

Ala Pro Asn Cys Ser Ser Lys Val Pro Glu Gly Lys Asp Lys Asn Arg

385 390 395 400

Asp Asn Ile Met Pro Ser Leu Glu Leu Ser Leu Lys Arg Ser Arg Ser

405 410 415

Thr Gly Asp Gly Ala Asn Ala Ile Gln Glu Glu Gln Arg Asn Val Leu

420 425 430

Arg Arg Ser Asp Leu Ser Ala Phe Thr Arg Tyr His Thr Pro Val Ala

435 440 445

Ser Asn Gln Gly Gly Thr Gly Phe Met Gly Ser Cys Ser Leu His Asp

450 455 460

Asn Ser Ser Glu Ala Met Lys Thr Asp Ser Ala Tyr Asn Met Lys Ser

465 470 475 480

Asn Ser Asp Ala Ala Pro Ile Lys Gln Gly Ser Asn Gly Ser Ser Asn

485 490 495

Asn Asn Asp Met Gly Ser Thr Thr Lys Asn Val Val Thr Lys Pro Ser

500 505 510

Thr Asn Lys Glu Arg Val Met Ser Pro Ser Ala Val Lys Ala Asn Gly

515 520 525

His Thr Ser Ala Phe His Pro Ala Gln His Trp Thr Ser Pro Ala Asn

530 535 540

Thr Thr Gly Lys Glu Lys Thr Asp Glu Val Ala Asn Asn Ala Ala Lys

545 550 555 560

Arg Ala Gln Pro Gly Glu Val Gln Ser Asn Leu Val Gln His Pro Arg

565 570 575

Pro Ile Leu His Tyr Val His Phe Asp Val Ser Arg Glu Asn Gly Gly

580 585 590

Ser Gly Ala Pro Gln Cys Gly Ser Ser Asn Val Phe Asp Pro Pro Val

595 600 605

Glu Gly His Ala Ala Asn Tyr Gly Val Asn Gly Ser Asn Ser Gly Ser

610 615 620

Asn Asn Gly Ser Asn Gly Gln Asn Gly Ser Thr Thr Ala Val Asn Ala

625 630 635 640

Glu Arg Pro Asn Met Glu Ile Ala Asn Gly Thr Ile Asn Lys Ser Gly

645 650 655

Pro Gly Gly Gly Asn Gly Ser Gly Ser Gly Ser Gly Asn Asp Met Tyr

660 665 670

Leu Lys Arg Phe Thr Gln Arg Glu His Arg Val Ala Ala Val Ile Lys

675 680 685

Phe Arg Gln Lys Arg Lys Glu Arg Asn Phe Gly Lys Lys Val Arg Tyr

690 695 700

Gln Ser Arg Lys Arg Leu Ala Glu Gln Arg Pro Arg Val Arg Gly Gln

705 710 715 720

Phe Val Arg Gln Ala Val Gln Asp Gln Gln Gln Gln Gly Gly Gly Arg

725 730 735

Glu Ala Ala Ala Asp Arg

740

<210> 51

<211> 766

<212> PRT

<213> 玉米

<400> 51

Met Gly Ser Ala Cys Gln Ala Gly Thr Asp Gly Pro Ser Arg Lys Asp

1 5 10 15

Val Leu Gly Ile Gly Asn Ala Ala Leu Glu Asn Gly His His Gln Ala

20 25 30

Glu Ala Asp Ala Asp Glu Trp Arg Glu Lys Glu Glu Asp Leu Ala Asn

35 40 45

Asn Gly His Ser Ala Pro Pro Pro Gly Met Gln Gln Val Asp Glu His

50 55 60

Lys Glu Glu Gln Arg Gln Ser Ile His Trp Glu Arg Phe Leu Pro Val

65 70 75 80

Lys Thr Leu Arg Val Leu Leu Val Glu Asn Asp Asp Ser Thr Arg Gln

85 90 95

Val Val Ser Ala Leu Leu Arg Lys Cys Cys Tyr Glu Val Ile Pro Ala

100 105 110

Glu Asn Gly Leu His Ala Trp Arg Tyr Leu Glu Asp Leu Gln Asn Asn

115 120 125

Ile Asp Leu Val Leu Thr Glu Val Phe Met Pro Cys Leu Ser Gly Ile

130 135 140

Gly Leu Leu Ser Lys Ile Thr Ser His Lys Ile Cys Lys Asp Ile Pro

145 150 155 160

Val Ile Met Met Ser Thr Asn Asp Ser Met Ser Met Val Phe Lys Cys

165 170 175

Leu Ser Lys Gly Ala Val Asp Phe Leu Val Lys Pro Leu Arg Lys Asn

180 185 190

Glu Leu Lys Asn Leu Trp Gln His Val Trp Arg Arg Cys His Ser Ser

195 200 205

Ser Gly Ser Glu Ser Gly Ile Gln Thr Gln Lys Cys Ala Lys Leu Asn

210 215 220

Thr Gly Asp Glu Tyr Glu Asn Gly Ser Asp Ser Asn His Asp Asp Glu

225 230 235 240

Glu Asn Asp Asp Gly Asp Asp Asp Asp Phe Ser Val Gly Leu Asn Ala

245 250 255

Arg Asp Gly Ser Asp Asn Gly Ser Gly Thr Gln Ser Ser Trp Thr Lys

260 265 270

Arg Ala Val Glu Ile Asp Ser Pro Gln Pro Ile Ser Pro Asp Gln Leu

275 280 285

Val Asp Pro Pro Asp Ser Thr Cys Ala Gln Val Ile His Pro Arg Ser

290 295 300

Glu Ile Cys Ser Asn Lys Trp Leu Pro Thr Ala Asn Lys Arg Asn Val

305 310 315 320

Lys Lys Gln Lys Glu Asn Lys Asp Glu Ser Met Gly Arg Tyr Leu Gly

325 330 335

Ile Gly Ala Pro Arg Asn Ser Ser Ala Glu Tyr Gln Ser Ser Leu Asn

340 345 350

Asp Val Ser Val Asn Pro Ile Glu Lys Gly His Glu Asn His Met Ser

355 360 365

Lys Cys Lys Ser Lys Lys Glu Thr Met Ala Glu Asp Asp Cys Thr Asn

370 375 380

Met Pro Ser Ala Thr Asn Ala Glu Thr Ala Asp Leu Ile Ser Ser Ile

385 390 395 400

Ala Arg Asn Thr Glu Gly Gln Gln Ala Val Gln Ala Val Asp Ala Pro

405 410 415

Asp Gly Pro Ser Lys Met Ala Asn Gly Asn Asp Lys Asn His Asp Ser

420 425 430

His Ile Glu Val Thr Pro His Glu Leu Gly Leu Lys Arg Ser Arg Thr

435 440 445

Asn Gly Ala Thr Ala Glu Ile His Asp Glu Arg Asn Ile Leu Lys Arg

450 455 460

Ser Asp Gln Ser Ala Phe Thr Arg Tyr His Thr Ser Val Ala Ser Asn

465 470 475 480

Gln Gly Gly Ala Arg Tyr Gly Glu Ser Ser Ser Pro Gln Asp Asn Ser

485 490 495

Ser Glu Ala Met Lys Thr Asp Ser Thr Cys Lys Met Lys Ser Asn Ser

500 505 510

Asp Ala Ala Pro Ile Lys Gln Gly Ser Asn Gly Ser Ser Asn Asn Asp

515 520 525

Val Gly Ser Ser Thr Lys Asn Val Ala Ala Arg Pro Ser Gly Asp Arg

530 535 540

Glu Arg Val Ala Ser Pro Leu Ala Ile Lys Ser Thr Gln His Ala Ser

545 550 555 560

Ala Phe His Thr Ile Gln Asn Gln Thr Ser Pro Ala Asn Leu Ile Gly

565 570 575

Glu Asp Lys Ala Asp Glu Gly Ile Ser Asn Thr Val Lys Met Ser His

580 585 590

Pro Thr Glu Val Pro Gln Gly Cys Val Gln His His His His Val His

595 600 605

Tyr Tyr Leu His Val Met Thr Gln Lys Gln Pro Ser Thr Asp Arg Gly

610 615 620

Ser Ser Asp Val His Cys Gly Ser Ser Asn Val Phe Asp Pro Pro Val

625 630 635 640

Glu Gly His Ala Ala Asn Tyr Ser Val Asn Gly Gly Val Ser Val Gly

645 650 655

His Asn Gly Cys Asn Gly Gln Asn Gly Ser Ser Ala Val Pro Asn Ile

660 665 670

Ala Arg Pro Asn Ile Glu Ser Ile Asn Gly Thr Met Ser Gln Asn Ile

675 680 685

Ala Gly Gly Gly Ile Val Ser Gly Ser Gly Ser Gly Asn Asp Met Tyr

690 695 700

Gln Asn Arg Phe Leu Gln Arg Glu Ala Ala Leu Asn Lys Phe Arg Leu

705 710 715 720

Lys Arg Lys Asp Arg Asn Phe Gly Lys Lys Val Arg Tyr Gln Ser Arg

725 730 735

Lys Arg Leu Ala Glu Gln Arg Pro Arg Val Arg Gly Gln Phe Val Arg

740 745 750

Gln Ser Glu Gln Glu Asp Gln Thr Ala Gln Gly Ser Glu Arg

755 760 765

<210> 52

<211> 917

<212> PRT

<213> 小立碗藓

<400> 52

Met Thr Ala Asp Leu Cys Glu Phe Glu Ser Glu Ser Asp Pro Leu Gln

1 5 10 15

Pro Leu Ser Ala Val Gly Arg Ala Trp Val Glu Pro Ile Val Gly Thr

20 25 30

Pro Val Gly Ala Glu Trp Arg Ile Lys Gly Gly Phe Lys Ala His Lys

35 40 45

Glu Val Asp Arg Ser Arg Glu Gln Val Gly Ser Lys Arg Val Asp Asp

50 55 60

Arg Glu Lys Asn Ser Gly Arg Leu Glu Asn Gly Cys Arg Phe Ala Asp

65 70 75 80

Arg Thr Gly Gly Ala Val Leu Lys Ala Arg Glu Asp Pro Lys Asp Ile

85 90 95

Ala Glu Gln Ile Arg Arg Glu Leu Asp His Gln Phe Pro Val Asn Asp

100 105 110

Val Leu Arg Thr Ser Glu Ser Asp Glu Asp Gly Arg Arg Glu Asp Ser

115 120 125

Ala Glu Asp His Tyr Glu Glu Gly Asp Ala Val Ala Ala Val Val Phe

130 135 140

Glu Lys Gln Arg Pro Arg Glu Ile Ala Gln Thr Arg Glu Gln Gln Gln

145 150 155 160

Gly Gly Asn Ala Ala Ala Ala Ala Ala Gly Thr Gln Gly Gly Gly Gly

165 170 175

Trp Glu Ser Phe Leu Leu Lys Arg Asn Leu Lys Val Leu Leu Val Glu

180 185 190

Asp Asp Asp Ala Thr Arg His Val Val Gly Ala Leu Leu Arg Asn Cys

195 200 205

Asn Tyr Glu Val Thr Pro Val Ala Asn Gly Ser Leu Ala Trp Gly Leu

210 215 220

Leu Glu Glu Ala Asn Ser Asn Phe Asp Leu Val Leu Thr Asp Val Val

225 230 235 240

Met Pro Tyr Leu Ser Gly Val Gly Leu Leu Ser Lys Met Met Lys Arg

245 250 255

Glu Ala Cys Lys Arg Val Pro Ile Val Ile Met Ser Ser Tyr Asp Ser

260 265 270

Leu Gly Ile Val Phe Arg Cys Leu Ser Lys Gly Ala Cys Asp Tyr Leu

275 280 285

Val Lys Pro Val Arg Lys Asn Glu Leu Lys Asn Leu Trp Gln His Val

290 295 300

Trp Arg Lys Cys His Ser Ser Ser Gly Ser Arg Ser Gly Ser Gly Ser

305 310 315 320

Gln Thr Gly Glu Val Ala Lys Pro Arg Ser Arg Gly Val Ala Ala Ala

325 330 335

Asp Asn Pro Ser Gly Ser Asn Asp Gly Asn Gly Ser Ser Asp Gly Ser

340 345 350

Asp Asn Gly Ser Ser Arg Val Asn Ala Gln Gly Gly Ser Asp Asn Gly

355 360 365

Ser Gly Asn Gln Ala Cys Met Gln Pro Val Gln Val Leu Arg Asn Ser

370 375 380

Ala Ile Pro Glu Ala Val Asp Gly Asp Glu Glu Gly Gln Ala Thr Ser

385 390 395 400

Gln Asp Lys Gly Ala Asp Leu Asp Gly Glu Met Gly His Asp Leu Glu

405 410 415

Met Ala Thr Arg Arg Ser Ala Cys Val Thr Thr Gly Lys Asp Gln Gln

420 425 430

Pro Glu Asp Ala Gln Lys Gln Asp Glu Asp Ala Val Cys Ile Leu Gln

435 440 445

Asp Ala Gly Pro Ser Pro Asp Gly Ala Asn Ala Glu Ser Pro Ser Ser

450 455 460

Ser Gly Arg Asn Asp Ala Ala Glu Glu Ser Ser Pro Lys Ile Ile Asp

465 470 475 480

Leu Ile Asn Val Ile Ala Cys Gln Pro Gln Thr Gln Asp Ala Glu Pro

485 490 495

Gln Glu Ser Glu Asn Asp Asp Glu Glu Leu Asp Pro Arg Gly Arg Ser

500 505 510

Ser Pro Lys Asn Asn Ser Ala Ser Asp Ser Gly Thr Ser Leu Glu Leu

515 520 525

Ser Leu Lys Arg Pro Arg Ser Ala Val Gly Asn Gly Gly Glu Leu Glu

530 535 540

Glu Arg Gln Pro Leu Arg His Ser Gly Gly Ser Ala Phe Ser Arg Tyr

545 550 555 560

Gly Ser Gly Gly Thr Ile Ile Gln Gln Tyr His Gln Thr Gly Gly Ser

565 570 575

Leu Pro Leu Ser Gly Tyr Pro Val Ser Gly Gly Tyr Gly Val Tyr Gly

580 585 590

Met Ser Gly Gly Ser Pro Gly Gly Ser Leu Arg Leu Gly Met Gly Met

595 600 605

Asp Arg Ser Gly Ser Ser Lys Gly Ser Val Glu Gly Thr Thr Pro Pro

610 615 620

Pro Ser His Pro Gln Ser Met Glu Lys Val Gly Gly Gln Asp Gly Tyr

625 630 635 640

Gly Asn Ala Arg Gln Thr Thr Glu Asp Ala Met Ile Val Pro Gly Met

645 650 655

Pro Met Ala Ile Pro Leu Pro Pro Pro Gly Met Leu Ala Tyr Asp Gly

660 665 670

Val Ile Gly Thr Tyr Gly Pro Ala Met His Pro Met Tyr Tyr Ala His

675 680 685

Pro Ser Ala Trp Met Ala Ala Pro Ser Arg His Met Gly Glu Arg Gly

690 695 700

Asp Val Tyr Asn Gln Ser Pro Ala Phe Gln Glu Gln Asp Ser Gly Ser

705 710 715 720

Gly Asn His Ser Gln Ala Gly Gln Thr His Gln His Met His His His

725 730 735

Gln Gly Asn Gln His His His His His His His His His His Gly Ser

740 745 750

Gly Ala Gln Pro Ser Gly Asn Ala Gly Val Gln Asp Glu Gln Gln Gln

755 760 765

Ser Val Val Pro Pro Gly Ser Ser Ala Pro Arg Cys Gly Ser Thr Gly

770 775 780

Val Asp Gly Arg Ser Gly Ser Ser Asn Gly Tyr Gly Ser Thr Gly Asn

785 790 795 800

Gly Asn Gly Ser Met Asn Gly Ser Ala Ser Gly Ser Asn Thr Gly Val

805 810 815

Asn Asn Gly Gln Ser Gly Phe Gly Ala Thr Pro Met Leu Thr Asp Asn

820 825 830

Ser Gly Ser Asn Gly Val Gly Gly Thr Asp Ala Ala Met Asp Gly Val

835 840 845

Ser Gly Gly Asn Gly Leu Cys Thr Glu Gln Met Arg Phe Ala Arg Arg

850 855 860

Glu Ala Ala Leu Asn Lys Phe Arg Gln Lys Arg Lys Glu Arg Cys Phe

865 870 875 880

Glu Lys Lys Val Arg Tyr Gln Ser Arg Lys Arg Leu Ala Glu Gln Arg

885 890 895

Pro Arg Val Arg Gly Gln Phe Val Arg Gln Ala Val His Asp Pro Ser

900 905 910

Ala Gly Asp Ala Glu

915

<210> 53

<211> 1359

<212> PRT

<213> 卡特氏团藻

<400> 53

Met Glu Phe His Val Leu Leu Val Glu Asp Asp Arg Val Thr Leu Lys

1 5 10 15

Thr Val Glu Gln Leu Leu Arg Lys Cys Asn Tyr Lys Val Thr Cys Ala

20 25 30

Ala Asn Gly Arg Glu Ala Ile Lys Val Leu Thr Ala Cys Arg His Ser

35 40 45

Gly Val Lys Val Asp Leu Ile Leu Thr Asp Ile Leu Met Pro Glu Val

50 55 60

Thr Gly Phe Asp Leu Ile Asn Glu Val Val His Gly Asp Thr Phe Cys

65 70 75 80

Asp Val Pro Val Val Val Met Ser Ser Gln Asp Ser Gln Glu Asn Val

85 90 95

Leu Gln Ala Phe Gln Ala Gly Ala Ala Asp Tyr Leu Ile Lys Pro Ile

100 105 110

Arg Lys Asn Glu Leu Ala Thr Leu Trp Gln His Val Trp Arg Ala Asn

115 120 125

Lys Ala Lys Gly Ser Gly Ser Gly Thr Thr Thr Asn Val Thr Gly Gln

130 135 140

Pro Leu Ser Gly Arg Glu Asp Leu Glu Ala Gly Glu Ala Val Ala Val

145 150 155 160

Ala Ala Ala Ala Ala Ala Ala Ser Gly Lys Ala Cys Ala Ala Thr His

165 170 175

Gly His Leu Lys Asp Ser Ser Gly Gly Ser Ser Gly Ala Ala Ala Ser

180 185 190

Val Leu Gln Ser Thr Gly Gly Thr Leu Leu Pro Asp Arg Ala Ala Thr

195 200 205

Val Arg Tyr Pro Ala Ala Ala Ala Ala Pro Pro Pro Pro Gly Ala Ser

210 215 220

Glu Leu Ser Gly Asn Val Thr Ala Gly Glu Ala Gln Gly Ser Arg Thr

225 230 235 240

Gln His Leu Arg His Leu Ser Gly Leu Ala Gly Met Glu Ser Thr Ala

245 250 255

Ala Thr Ser Ala Ala Ala Gln Gly Ser Ser Ala Ala Gly Pro Leu Arg

260 265 270

Gly Cys Gly Gly Ala Gly Thr Ala Ile Ala Gly Gly Pro Arg Ala Pro

275 280 285

Leu Gly Pro Leu Ser Phe Ala Pro Phe Gly Thr Ser Val Ala Val His

290 295 300

Phe Asp Leu Asn Pro Ala Ser Gly Ala Ala Arg Arg Leu Val Asn Ser

305 310 315 320

Ser Gly Ala Ile Asp Ala Ser Thr Gly Ser Gly Thr Ala Gly Val Ala

325 330 335

Ala Ser Ser Arg Cys Ala Ala Gly Thr Ser Ala Thr Val Ile Ser Trp

340 345 350

Ser His Val Asp Pro Thr Glu Thr Asp Pro Ala Glu Ala Glu Pro Met

355 360 365

Tyr Asp Thr Asn Ala Asp Ala Thr Ala Ala Lys Ala Ala Ala Asp Gly

370 375 380

Val Ala Glu Ala Asp Asp Asp Asp Val Gly Asp Asp Gly Gly Ala Gly

385 390 395 400

Pro Asn His Asn Asp Asp Asp Asp Glu Gly Gly Gly Asp Asp Asp Val

405 410 415

Ser Gly Asp Gly Asp Glu Asp Gly Asn Arg Pro Arg Lys Arg Pro Arg

420 425 430

Leu Leu Gln Gly Ser Ser His His His Ser His Gln His Arg Leu His

435 440 445

Ser Leu Gly Gly Thr Thr Thr Asn Thr Thr Thr Thr Thr Thr Ala Ala

450 455 460

Lys Pro Lys Ser Thr Ala Gly Glu Arg Gly Gly Ala Ala Ala Leu Leu

465 470 475 480

Ala Cys Arg Thr Ala Ala Ala Ala Pro Leu Arg Gly Ser Gly Cys Ala

485 490 495

Thr Ala Gly Ala Thr Gly Ala Cys Arg Leu Ala Ala Ala Ala Ala Ala

500 505 510

Ala Glu Gly Ser Gln Gly Ser Arg Ala Ala Ser Ala Ser Ala Gly Pro

515 520 525

Asp Gly Gly Ala Arg Glu Ser Thr Ala Thr Pro Ser Gly Asp Thr Phe

530 535 540

Ala Glu Ser Pro Ser Ala Tyr Thr Ala Thr Ala Thr Thr Thr Ser Thr

545 550 555 560

Ala Thr Thr Ser Thr Thr Thr Gly Ser Gly Ile Glu Met Gln Asp Asp

565 570 575

Glu Gln Gln Gln Arg Gln Gln Pro Lys Gln Arg Pro Pro Ala Ser Gln

580 585 590

Pro Glu Leu Glu Gly His His His Gln Gln Gln Tyr His His Tyr Tyr

595 600 605

Arg Arg Thr Ser Leu Glu Gly Gly Cys Ala Asn Ala Pro Pro Leu Pro

610 615 620

Val Pro Ser Ser Ala Arg Gly Ala Ser Pro Ala Gly Thr Gly Pro Thr

625 630 635 640

Glu Ser Gly Ser Gly Arg Asp Ser Gly Cys Ala Arg Ile Thr Asn Gly

645 650 655

Thr Ala Ala Gly Ala Thr Ala Ala Met Pro Pro Ser His Val Ser Ser

660 665 670

Ala Ser Pro Pro Arg Cys Thr Ala Thr Ser Ala Ala Ala Thr Arg Gly

675 680 685

Ser Ser Gly Ala Ala Thr Ala Ala Ala Gly Ala Met Thr Thr Ala Leu

690 695 700

Ala Thr Ala Gly Ser Tyr Pro Arg Gly Val Asp Ala Ser Pro Pro Pro

705 710 715 720

Asn Arg Ser Met Gly Ser Ser Gly Gly Asp Gly Gly Gly Thr Ala Ala

725 730 735

Ala Ala Ala Gly Thr Ala Arg Gly Ser Ser Pro Ala Ala Ala Thr Pro

740 745 750

Pro Leu Pro Pro Ser Thr Gln Gln His Gly Leu Pro His Pro Ala Ala

755 760 765

Ala Pro Pro Pro Gly Ala Ala Ser Pro Gly Gly Ala Val Thr Leu Pro

770 775 780

Pro Ala Leu Gln Glu Leu Ala Ala Leu Gly Ala Ala Arg His Ala Gly

785 790 795 800

Leu Trp Thr Gln Arg Ala Leu Leu His Gln Gln Gln Leu Leu Leu Gln

805 810 815

Gln Gln Lys Gln Gln Lys Gln Gln Gln His Gln Gln Asp Gln Val Val

820 825 830

Gly Ala Glu Lys Ile His Gly Gly Ser Thr Ser Ala Val Ala Asn Ala

835 840 845

Ala Glu Gln Gln Gln Gln Gln Pro Leu Gly Ala Ala Ala Ala Arg Arg

850 855 860

Pro Ser Lys Ala Gly Val Asp Gly Thr Glu Ala Gly Ser Gly Ala Val

865 870 875 880

Gly Gly Cys Ala Ser Ala Thr Ala Ala Val Met Ala Met Glu Ala Ser

885 890 895

Glu Pro His Gly Ala Val Gly Ser Ser Phe Thr Ala Ala Asp Arg Gln

900 905 910

Glu Thr Pro Leu Gln Pro Leu His Ala Glu Ser Ala Ala Ala Gly Gly

915 920 925

Asp Met Asp Gly Asn Arg Ser Thr Pro Ala Thr Met Pro Ser Gly Pro

930 935 940

Thr Ala Ala Ala Ser Gly Pro Ser Gln Thr Ser Asn Ser Leu Thr Val

945 950 955 960

Leu Arg His Ser Asp Arg Ser Ala Phe Thr Ala Phe Thr Val Phe Leu

965 970 975

Pro Ser Arg Val Ala Gly Ala Ala Ala Ala Ala Ala Ala Ala Ala Ala

980 985 990

Ala Arg Pro Pro Pro Pro Pro Ala Pro Val Gln Pro Pro Ala Pro Ile

995 1000 1005

Phe Thr His Pro Ala Ala Ala Ala Ala Ala Ala Ala Ala Ala Ala

1010 1015 1020

Ala Gly Ser Gly Gly Ala Ala Ser Val Trp Tyr Pro His Leu His

1025 1030 1035

His His His His Tyr Leu Gln Gln Gln Gln Thr His Met Gly Pro

1040 1045 1050

Leu Pro Pro Leu Pro Gly Ala Val His Val Leu Pro Ser Ile Met

1055 1060 1065

Gln Leu His Met Gly Val Leu Ala Pro Gly Pro Pro Pro Gln Gln

1070 1075 1080

Gln Gln Gln Gln His Leu Gln Ala Lys Ala Pro Gln Lys Pro His

1085 1090 1095

Asp Ser Ala Ala Ala Ala Gly Gly Ala Asn Gly Ser Leu Gly Pro

1100 1105 1110

Ala Thr Ser Ala Ala Ala Ala Thr His Met Ser Tyr Thr Gly Met

1115 1120 1125

Gln Gln Arg Pro Gly Ala Ser Ser Ala Thr Thr Thr Ser Ala Gly

1130 1135 1140

Ala Val Ala Phe Gly Gln Ser Pro Pro His Gly Leu Ala Ala Ala

1145 1150 1155

Ala Ala Ala Ala Ser Thr Pro Pro Pro Pro Pro Pro Pro Pro Val

1160 1165 1170

Cys Ile Pro Glu Ser Val Leu Gln Leu Ile Ala His Leu Ser Gly

1175 1180 1185

Arg Ala Ala Ala Glu Leu Pro Val Pro Glu Thr Val Thr Thr Ala

1190 1195 1200

Pro Leu Val Val Gln Lys Ala Pro Ser Ala Ala Arg Leu Ala Ala

1205 1210 1215

Val Ala Lys Tyr Leu Glu Lys Arg Lys His Arg Asn Phe Gln Lys

1220 1225 1230

Lys Val Arg Tyr Glu Ser Arg Lys Arg Leu Ala Glu Ala Arg Pro

1235 1240 1245

Arg Val Arg Gly Gln Phe Val Lys Ala Ser Thr Ser Ala Val Ala

1250 1255 1260

Ala Thr Thr Pro Ala Ala Thr Gly Ala Thr Val Thr Ser Thr Ser

1265 1270 1275

Leu Arg Gln Pro Val Tyr Thr Ala Ala Gly Pro Ala Gly Leu Ala

1280 1285 1290

Leu Pro Pro Ala Ala Ala Ala Ala Ala Ala Ser Ala Ala Ala Ala

1295 1300 1305

Arg Gly Val Pro Pro Pro Ser Ser Arg Ile Gly Ala Val Glu Leu

1310 1315 1320

Ala Glu Leu Val Pro Asp His Asp Ala Asp Ile Glu Asp Glu Gly

1325 1330 1335

Cys Asp Glu Pro Ala Ala Ala Glu Asp Ser Asp Gly Ser Val Ala

1340 1345 1350

Val Glu Leu Ala Glu Val

1355

<210> 54

<211> 1102

<212> PRT

<213> 莱茵衣藻

<400> 54

Met Glu Ala Asn Gly Phe His Val Val Leu Val Glu Asp Asp Asn Ile

1 5 10 15

Cys Leu Lys Val Val Glu Gln Leu Leu Arg Lys Leu Ser Tyr Arg Val

20 25 30

Ser Thr Ala Ser Asp Gly Ala Ala Ala Leu Lys Val Leu Ala Asp Cys

35 40 45

Lys Gln Arg Gly Asp Lys Val Asp Leu Ile Leu Thr Asp Ile Leu Met

50 55 60

Pro Glu Val Thr Gly Phe Asp Leu Ile Asn Glu Val Val His Gly Glu

65 70 75 80

Thr Phe Ala Asp Ile Pro Val Val Val Met Ser Ser Gln Asp Ser Gln

85 90 95

Glu Ser Val Leu Gln Ala Phe Gln Ala Gly Ala Ala Asp Tyr Leu Ile

100 105 110

Lys Pro Ile Arg Lys Asn Glu Leu Ala Thr Leu Trp Gln His Val Trp

115 120 125

Arg Ala Asn Arg Ala Lys Gly Gly Gln Thr Ser Ser Gly Ala Ala His

130 135 140

Val Gly Ala Gly Gly Arg Gly Gly Thr Ser Ser Arg Asp Gly Gly Gly

145 150 155 160

Val Ala Gly Thr Arg Cys Gly Pro Gly Asp Arg Gly Gly Ser Gly Gly

165 170 175

Asp Ala Glu Gly Ser Gly Leu Gly Gly Gly Ala Gly Ala Val Lys Asp

180 185 190

Ser Ser Gly Gly Ser Thr Gly Ala Ala Thr Ser Val Leu His Ser Thr

195 200 205

Gly Gly Thr Thr Leu Pro Ser Arg Ala Ala Thr Gly Arg His Ala Ser

210 215 220

Thr Ser Ala Gly His Gly Val Thr Ser Ala Asp Pro Ser Asn Asn Gln

225 230 235 240

Thr Ser His Ala His Ala His Ala His Ala His Ala His Gly Asn Ala

245 250 255

His Ala His Ala His Leu His Met His Gly Ala Thr Asp Arg Ala Ala

260 265 270

Gln Gly Ser Ser Ala Asn Gly Pro Ala Asn His Gly Ala Ala Gly Thr

275 280 285

Gly Leu Gln Ser Ala Gly Met Ala Gly Ser Thr Ala Ala Gly Ala Ala

290 295 300

Ala Pro Ala Gly Glu Ser Leu Ala Lys Pro Pro Phe Ala Ser Leu Ala

305 310 315 320

Val His Phe Asp Leu His Ser Val Leu Ala Gly Ala Gly Ala Ala Ala

325 330 335

Ala Asn Gly Gly Ala Asn Ala Ala Ala His Thr Ala Gly Ala Thr Gly

340 345 350

Arg Glu Ser Gly Gln Ala Ala Gly Ala Ala Thr Gly Gly Ile Ala Ala

355 360 365

Ala Gly Thr Val Ile Gly Trp Ser His Ala Asp Met Asp Val Asp Gly

370 375 380

Gly Glu Ala Gly Ala Gln Asp Glu Asp Asp Glu Asp Glu Asp Asp Gly

385 390 395 400

Val Glu Ala Pro Ala Gly Thr Gln Asn Arg Lys Arg Ala Ala Asp Asp

405 410 415

Ser Gly Cys Asp Gly Ala Ala Ala Asn Asn Asn Gly Asn Thr Ala Ala

420 425 430

Lys Ala Gly Ala Ala Ala Ile Ala Ala Gly Gly Pro Gly Ser Ser Gly

435 440 445

Arg Ala Lys Ala Thr Asp Gly Ala Arg Ala Glu Ile Arg His Asn Gly

450 455 460

Gly Pro Met Ala Ala Arg Met Ala Ala Ala Glu Gly Ser Gln Gly Ser

465 470 475 480

Arg Ala Ala Ser Gly Ser Ala Ala Thr Gly Pro Gly Gly Ala Arg Glu

485 490 495

Gly Thr Ala Thr Pro Ser Gly Asp Thr Phe Ala Glu Ser Pro Ser Thr

500 505 510

Phe Thr Ser Ile Ile Asn Thr Thr Gly Ser Gly Ser Glu Ala Asp Glu

515 520 525

Gln Pro Val Pro Leu Lys His Gln Glu Gln Gln Gln Gln Gln Gln Gln

530 535 540

Gln Arg Val Gly Glu Gly Asp Arg Ala Lys Pro Glu Pro His Pro Gln

545 550 555 560

Asn Pro Ala Gln Ala Ala His Leu Pro His Pro Ser Ala Ala Pro Cys

565 570 575

Ser Gly Gly Gly Gly Ile Ala Gln Ala Ala Leu Pro Leu Gly Leu Gln

580 585 590

Glu Leu Ala Ala Leu Gly Ala Ala Arg His Lys Glu Leu Trp Thr Gln

595 600 605

Arg His Leu Met His Gln Arg Gln Ala Ala Ala Ala Ala Thr Ala Ala

610 615 620

Ala Ala Ser Ala Ala Ala Ala Ala Ala Met Pro Thr Ala Gly Ala Ser

625 630 635 640

Ala Ala Ala Pro Ala Gly Pro Pro Ser Ala Arg Pro Ser Ala Ser Leu

645 650 655

Ala Asp Thr Gly Gly Asp Gly Pro Ala Ala Ala Thr Ala Pro Glu Thr

660 665 670

Arg Ala Asp Gly Pro Ser Gly Pro Ala Thr Thr Gln Gly Pro Lys Arg

675 680 685

Asp Ala Val Ala Gly Ala Ala Ala Val Gly Ser Ser Ala Arg Ser Asp

690 695 700

Ser Pro Leu Pro Ala Ala Ala Ala Ala Thr Ala Gly Ala Asn Gly Ala

705 710 715 720

Ser Gly Ala Ala Ser Asp Val Leu Ala Gly Ala Gly Ser Leu Ala Leu

725 730 735

Leu Arg His Ser Asp Arg Ser Ala Phe Thr Ala Phe Thr Val Phe Leu

740 745 750

Pro Gly Arg Val Ala Ala Ala Ala Ala Ala Ala Ala Ala Ala Ala Ala

755 760 765

Ala Ala Thr Ser Ala Gly Ala Ser Thr Gly Thr Ala Asn Gly Ala Pro

770 775 780

Pro Ala Pro Gly Thr Ala Leu Ala Ala Ala Ala Ala Ala Ala Ala Ala

785 790 795 800

Ala Ala Ser Ala Val Pro Leu Pro His Pro His Thr Ala Pro Pro Ala

805 810 815

Leu Phe Gly Val Pro Pro Pro Ser Ser Val Pro Pro Ser Ser Leu Ser

820 825 830

Val Leu Pro Pro Val Met Pro Leu His Pro Ala Ala Ala Ala Ala Ala

835 840 845

Ala Thr Ala Gly Gly Gly Lys Pro Ser Asp Ala Ala Thr Tyr Ala Ala

850 855 860

Ala Ala Ala Ala Gly Leu Val Pro Tyr Pro Gly Phe Ala Pro Ala Arg

865 870 875 880

Pro Gly Pro Phe Pro Pro Pro Pro Gly Ser Gly Gly Pro Gly Ala Pro

885 890 895

Pro Val Tyr Ile Pro Glu Ser Val Leu Gln Leu Ile Ala His Leu Ser

900 905 910

Gly Arg Ala Ala Ala Glu Ile Pro Ala Val Pro Ala Glu Ser Val Thr

915 920 925

Ala Ala Pro Val Val Val Gln Lys Ser Gly Gly Pro Ala Ser Ala Ala

930 935 940

Arg Leu Ala Ala Val Ala Lys Tyr Leu Glu Lys Arg Lys His Arg Asn

945 950 955 960

Phe Gln Lys Lys Val Arg Tyr Glu Ser Arg Lys Arg Leu Ala Glu Ala

965 970 975

Arg Pro Arg Val Arg Gly Gln Phe Val Lys Ala Gly Thr Ala Gly Ala

980 985 990

Ala Ala Ala Ala Ala Ala Ala Ala Ala Ala Ala Ala Ala Gly Thr Ala

995 1000 1005

Ala Thr Ala Ala Gly Thr Gly Thr Ala Arg Gly Ala Ala Thr Ala

1010 1015 1020

Ser Gly Ala Ala Gly Lys Pro Glu Leu Gln Gly Pro Asp Thr Ala

1025 1030 1035

Glu Glu Ala Ala Ala Ala Thr Leu Leu Ser Ala Ala Ala Ala Met

1040 1045 1050

Ala Ala Ala Ala Ala Gly Thr Ser Gly Pro Ser Gly Ser Gly Ser

1055 1060 1065

Gly Ala Met Asp Val Asp Gly Ala Asp Pro Glu Ala Asp Ala Asp

1070 1075 1080

Val Met Asp Glu Asp Asp Gly Glu Asp Asp Gly Ser Asp Glu Ser

1085 1090 1095

Ala Gly Glu Pro

1100

<210> 55

<211> 936

<212> PRT

<213> 佐夫色绿藻

<400> 55

Met Ser Ala Asp Ala Gly Gly Gln Lys Pro Gly Val Ala Glu Pro Gly

1 5 10 15

Ala Arg Thr Gly Pro Gly Phe Ser Val Asn Ser Ser Phe His Val Leu

20 25 30

Leu Val Asp Asp Asp Ala Val Thr Leu Lys Tyr Val Glu Gln Leu Leu

35 40 45

Arg Lys Cys Ser Tyr Glu Val Thr Thr Ala Thr Asn Gly Arg Glu Ala

50 55 60

Ile Glu Val Leu Glu Gly Arg Arg Gly Gln Val His Ile Asp Leu Ile

65 70 75 80

Leu Thr Asp Ile Ser Met Pro Glu Val Asn Gly Val Gln Leu Ile Glu

85 90 95

Glu Val Val Asn Gly Gly Lys Trp Lys Asn Leu Pro Val Ile Val Met

100 105 110

Ser Ser His Glu Ala Gln Ala Asn Val Leu Glu Ala Phe Gln Ala Gly

115 120 125

Ala Ser Asp Tyr Leu Ile Lys Pro Leu Arg Arg Asn Glu Leu Ser Thr

130 135 140

Leu Trp Gln His Val Trp Lys Ala Ser Lys Leu His Gln Pro His Leu

145 150 155 160

His Gly Glu Asp Asp Glu Asp Glu Asp Asp Thr Ala Leu Asp Asn Gly

165 170 175

Lys Phe Asp Ser Ser Ala Gly Asn Asn Lys Gly Ser Ser Gly Ala Ser

180 185 190

Thr Ser Ala Ala Gly Asp Ala Thr Ala Leu Ser Met Ala Asp Ala Ser

195 200 205

Arg Ala Leu Tyr Glu His His Pro Ser His His His Asn His Ile Gly

210 215 220

Glu Pro Ser Ile Asp Thr Gln Ala Ser Gly Gln Val Gly Ser Asn Asp

225 230 235 240

Pro Ser Leu Val Ile His Pro Leu Asp Ile Ser Pro Leu Pro Ala Ala

245 250 255

Ala Pro Pro Leu Ala Val Ala Gly Asp Pro Ala Leu Ala Ala Ala Pro

260 265 270

Leu Gly Thr Gly Gly Gln Asp Thr Pro Gly Ser Gly Asp Glu Gln Ala

275 280 285

Thr Ala Gly Thr Ser Val Gln Gln His Gln His Ser Gln Ala His His

290 295 300

His Ser Arg His Ile Pro Ala Ser Gly Ser Gly Thr Thr Glu His Ala

305 310 315 320

Pro Gln Ser Leu Ser Gln His Pro His His His Asn His Gln His His

325 330 335

His Asn Ser His His His His His Gln His Asp Leu Ala Gln Gln Arg

340 345 350

Gln Gln His His His His His Ser Asn Gly Val Asn Gln Asp His Ser

355 360 365

Gln Pro Asn Pro Asp Leu Thr Gln Met Pro Ser Ala Asp Gln Gln Ser

370 375 380

Leu Leu Thr Leu Pro His Ser Pro Asn Gly Ala Met Pro Leu Phe Lys

385 390 395 400

Pro Ser Thr Ser Ser Ala Ala Met Asp Cys Ser Thr Gln Gln Pro Leu

405 410 415

Gln Gln Gln Gln Gln His Glu His Gly Ser Ser Ser Pro Ala Leu Ser

420 425 430

Arg Pro His Ala Glu Lys Ser Pro Phe Gly Val Arg Tyr Gly Gly Gly

435 440 445

Asn Gly Gly Tyr Ser Ser Ser Met Ser Gly Ala Ser Leu Pro Pro Gly

450 455 460

Leu Gln Glu Leu Ala Val Leu Gly Gln Gln Arg Gln Ala Ala Arg Glu

465 470 475 480

Lys Asp Leu Gln Gln Arg Gln Gln Gln Gln Gln Lys Gln Gln Gln Gln

485 490 495

Gln Gln Gln Thr Ser Ala Leu Arg His Ser Asp Ser Ser Ala Phe Thr

500 505 510

Ala Phe Thr Val Phe Leu Pro Lys Gly Ser Asn Gly Leu Asn Arg Ser

515 520 525

Ser Gly Val Gly Val His Gly Ser Asn Ser Gln Thr Ser Gly Gly Gly

530 535 540

Ala Ala Asp Leu Gly Arg Ser Ala Ser Ser Met Glu Ile Leu Ser Thr

545 550 555 560

Ala Glu Thr Leu Val Gly Gln Thr Ala Gly Gly Ala Gly Val Asn Gly

565 570 575

Val Gly Ser Ala Lys Pro Gly Gly Asp Cys Leu Lys Glu Glu Ser Pro

580 585 590

Asn Asp Ser Thr Pro Ser Ala Glu Glu Gly Asp Glu Gln Asp Val Lys

595 600 605

Pro Pro Gln Ser Thr Ser Gly Ala Ala Ala Ala Glu Pro Ala Val Ala

610 615 620

Thr Ala Ser Gly Arg Ala Ala Thr Ala Ala Ile Ala Val Val Ala Asp

625 630 635 640

Ala Thr Val Ala Lys Pro Asp Ala Pro Val Ala Thr Ser Asp Gln Lys

645 650 655

Gln Val Leu Pro Phe Pro Gly Val Asn Gly Ala Ala His Leu Thr Gly

660 665 670

Met Asn Asn Gly Val Ser His Ser Gly Thr Ala Gly Ser Tyr Ser Glu

675 680 685

Leu Thr Gln Met Leu Tyr Ala Gln Leu Pro His Gln Gly Gln Pro Leu

690 695 700

Pro Asp His Val Met His Phe Leu His Asn Phe Tyr Arg Thr Met Met

705 710 715 720

Glu His Gln His Gln Gln Gln Ser Gln Gln Met Asp Gln Leu His His

725 730 735

His Val Gln Gln Gln Gln Gln Gln Gln Val Gln Gln Gln Gln Arg His

740 745 750

Leu Gln Gln Phe Ala Thr Ala Pro Asn Gly Gln Ala Pro Pro Asn His

755 760 765

Ser Asn Thr Asn Gln His Leu Gln Gln Gln Gln Gln Ala Cys Gly Asn

770 775 780

Gln Pro Leu Gln His Thr Ser Gln Pro His Cys Asn Gly Ala Ala His

785 790 795 800

Leu Gln His Leu Gln Gln Ser His Ser Ala Pro Ser Leu His Thr Pro

805 810 815

Gly Phe Thr Cys Thr Thr Thr Ala Thr Gln Ser Asn Thr Glu Pro Ser

820 825 830

Cys Met Leu Thr Gln Ser Glu Gly Ala Pro Cys Ala Ser Ser Tyr Arg

835 840 845

Ala Ala Ala Val Ala Lys Tyr Arg Glu Lys Arg Lys Asn Arg Asn Tyr

850 855 860

Asp Lys Lys Val Arg Tyr Glu Ser Arg Lys Arg Leu Ala Glu Ser Arg

865 870 875 880

Pro Arg Val Lys Gly Gln Phe Val Lys Gln Glu Val Leu Ala Ala Ala

885 890 895

Gly Leu Thr Ala Leu Ala Glu Leu Ala Thr Ala Asn Lys Arg Ala Arg

900 905 910

Leu Asp Val Asp Tyr Val Thr Ala Thr Gly Met Thr Asp Ala Asp His

915 920 925

Met Asp Thr Ala Glu Glu Ser Ser

930 935

<210> 56

<211> 444

<212> PRT

<213> 胶球藻

<400> 56

Met Ala Ala Gly Leu Lys Arg Ile Pro Ser Phe Ser Gly Arg Pro Gly

1 5 10 15

Phe Pro Asn Gly Leu Gln Val Leu Val Val Asp Gly Asp Thr Ser Ser

20 25 30

Ser Gln Cys Leu Arg Gln Lys Leu Glu Glu Leu Ala Tyr Glu Val Ser

35 40 45

Cys Cys Ser Ser Gly Ser Asp Ala Ser Ala Leu Leu Arg Lys Glu Asp

50 55 60

Ser Ser Tyr Asp Ile Leu Leu Val Glu Ala Lys Ala Leu Ala Lys Asp

65 70 75 80

Ala Thr Asp Gly Gly Ser Leu Arg Asp Ser Ala Ala His Leu Pro Leu

85 90 95

Val Leu Met Ser Glu Lys Ser Ser Ser Thr Asp Ala Val Trp Arg Gly

100 105 110

Ile Glu Leu Gly Ala Ala Asp Val Leu Glu Lys Pro Leu Ser Ser Leu

115 120 125

Lys Leu Arg Asn Ile Trp Gln His Val Val Arg Lys Met Met Ser Ser

130 135 140

Ser Gln Asp Ser Ser Arg Glu Ala Val Pro Cys Lys Met Glu Pro Lys

145 150 155 160

Ser Lys Gly Lys Gly Val Ser Ala Pro Ser Ser Pro Arg Thr Pro Ser

165 170 175

Pro Ala Ala Ser Leu Leu Thr Ile Ser Ser Gly Thr Met Thr Glu Lys

180 185 190

Ser Cys Lys Gly Gly Gly Asp Glu Ala Ser Phe Ser Gly Val Gly Asp

195 200 205

Val Lys Met Ser Cys Ser Ala Glu Ala Pro Glu Pro Cys Asp Ser Arg

210 215 220

Ala Thr Ala Glu Ser Pro Ala Ser Thr Gln Thr Lys Val Thr Phe Pro

225 230 235 240

Gly Cys Leu Asn Ser Gly Gly Thr Ala Leu Ala Ala Ser Lys Asn Cys

245 250 255

Ser Arg Lys Arg Lys Ala Lys Ala Pro Asp Thr Pro Ala Ser Val Ala

260 265 270

Ser Arg Pro Pro Leu Ala Ile Arg Pro Pro Ala Trp Ala Ser Pro Phe

275 280 285

Gly Pro Pro His Gln Gly Asn Thr His Val Val Gly Met Ala Pro Pro

290 295 300

Gln Cys Tyr Met Gln Gly Val Asp Pro Thr Asn Gly Cys Val Trp Gly

305 310 315 320

Thr Pro Ala Gly Gly Val Ser Gln Ala Pro Ala Tyr Met Pro Gly Trp

325 330 335

Gly Phe Ser Pro Gln Pro Met Leu Ser Gly Ser Phe Leu Gln His Pro

340 345 350

Ser Thr Ser Asp Leu His Lys Cys Pro Ser Val Gly Ala Ser Ser Leu

355 360 365

Ala Ser Ser Leu Asp Ser Ser Leu Thr Leu Cys Gly Phe Gly Ala Asp

370 375 380

Leu Pro Asp Asp Asp Leu Leu Leu Glu Asp Val Leu Leu Pro Asp Glu

385 390 395 400

Asp Leu Leu Asp Leu Ala Pro Asp Glu Pro Ala Thr Met Lys Ala Pro

405 410 415

Glu Gln Pro Pro Ile Gly Leu Lys Leu Lys Lys Ser Ala Ser Leu Ile

420 425 430

Asp Leu Ile Asn Ala Gln Leu Ser Ala Ala Thr Ala

435 440

<210> 57

<211> 1284

<212> DNA

<213> 卵囊藻属

<400> 57

atggcgctga agcgcgttcc tagcttttcc ggtcggccga actttcccgc cggtctgcag 60

atcctggtgg tggacagcga ttcttcctca agggaggctg tagagatgca actcaaatcg 120

cactcctatc tagcaacctg ttgttgcacc tgcggcgagg ctgtggagca gctcggcacg 180

tcaaagtatg acatcgtgct ggcagagtcc aagctggttg ctgcggagtg cgttgactcg 240

acacggttgt gcgaggccgc aagggctctg cctctggttt tgatgtgcga ggactcgacg 300

gcggacgacg tgttgaaggg aatcaggctc ggcgcttgcg actttctgga gaagccgctg 360

tccccactga agctcaagaa catatggcag cacgttgttc gcaagatgat ggagcagatg 420

cacgtccgcc gcacggacga cgcggatacg tgcactacta agagcagccg cgaccaaagc 480

tgcgcgatca agggcaagtc ggtggcttcc acgccctcgt gtcccaagac accttctccc 540

gcggcttctg gcgcagacat cggctgcagc atagccacgt cggtcagcaa ggccggggac 600

gtggtcggcg agtccagcag ttccgagacg cgcaaggagc attgcagcga gaccacggag 660

tgctccgacc tcaagagctg cgccgcaaag tcagctgtgt cggcgcaaac gccggtatcc 720

accgcgaccg ttgcagctac ctggggtgcg tcgaagaaga agtcgacagc atcagccact 780

accagcagtg tcagcaaccg gccgccgctg gcgatcaaga tgccggcgcc agctgtggca 840

tacacgtcag ggcttgcgcc ctttccgccg ccgatgtttg tacctggcgg ctggggccag 900

tcaagcaacc catgcgtggt gggcacgcca atgatgccac cgccgcccgg catgggcatg 960

ccgccccacc accacgcgcc ctatggccag gtgccgccgc cgggctatcc agtcgcatgc 1020

atgcccagcg cctttgtgcc gacgccgatg ggccctggcg gcgtggcgtt tgcgccgcca 1080

cctggcgcca gctgcacatc tgctgcgtac tacccccatc ctgctgtgga tgcaagcgcg 1140

tctgcaactg ccaccttcac gggccatgtg cagatcgacc tgactaacgt gtctgctgaa 1200

gagccggcgc ccattggttt ggcgctgcgc aagaccgcgt cgctgctcga cctggtcagc 1260

gatcgcctgg gccagcgtgc gtgc 1284

<210> 58

<211> 1026

<212> DNA

<213> 四爿藻属

<400> 58

atgctgtgcc ctgctgtcca ggttgccacc atggccactg tcctggcttc cacgcatttt 60

tcggagcgcc ccagcttccc ggctgatctg gaggtgctgc ttctggattc agcaacgcag 120

ggcgcagaaa ctgcctcgaa gctgttgctg tcgtgttcct atcgtgtcac cgtgtgccga 180

tccgtgtctg aggctctgag ccacatggca tgcaaggctt tcgacgtggt cctggtggag 240

cagaaacttt tcagcggcag ggatgcggcc gctgcgcagc tcaaggccct ggcaggcgtc 300

atccccaccg tggtcctgag tgacagcggc agtgcgaagg atacctgggc tgccatcgtt 360

gggcaggcag ccgatgtcct catccgcccg ctgaccaagc agaagctgca gacgctgtgg 420

cagcacactg tccgtatgca gcgcgcagca tcttcggctt cggcggctac tagcatggtt 480

gccaagcctg ttgccgtgct ctcctcggct ctgaagcccg ctgcttccag tgcttcactg 540

gacaaggggc agaagcgcaa gttgaaggat catatgatgg ggcccatcat ggcacacccg 600

caagtgtcca accctggctt tatctggggc gcaccagtga tgggcgttcc ggctggacag 660

caggctcccc agaagtcaga ggccccggtc accccccaga agccaggctc agagatgcac 720

cccgagctgg atgccacaag ccacatcgcc atgggctcca gcgacaactt caacgtacct 780

gtgtatgaaa gcggcactga cagccaggag tcgcagccaa cctgcgaccc cacctctctt 840

gatgacatca atgaggatga ctacgcgttt atcgatttcg cgctcagcga ttcttttccc 900

actgtggagg aggatgagat ccttccaccc attggccttt cgctgaagaa gtccagctcc 960

ctcctgaaca tgctgaacgg tgtgcttctc tcggctcact ctgtaccgct gcagctgccc 1020

cagtag 1026

<210> 59

<211> 2076

<212> DNA

<213> 大豆

<400> 59

atgggagagg tggtcatcat gagtggagag aagaagtcag ttagagtgga gggggtggag 60

aaggaagata gtggtggaag tgggagcaag gctggtgaat ttaaggggtt gatgaggtgg 120

gagaagttct tgcccaagat ggttttgagg gtgctgttgg ttgaagcaga tgattccaca 180

agacaaatta ttgccgcgct tctcagaaaa tgcagctaca aagtggttgc tgttcctgat 240

ggcttgaagg catgggaatt actcaaggga agaccgcaca atgttgatct aattctgaca 300

gaagtggatt tgccatccat atctggctat gcacttctca cattaattat ggagcacgag 360

atttgcaaaa acatccctgt tataatgatg tcttcccaag attcaattag cacagtatac 420

aaatgcatgt tgagaggtgc tgctgattat cttgttaagc ctattagaaa aaatgaactg 480

aggaacttgt ggcaacatgt ttggagaaga caatcatcaa ccactggtat taatggcctc 540

caagatgaga gtgttgcaca acagaaggtt gaagccactg cagaaaataa tgctgctagt 600

aatcgttcaa gtggtgatgc tgcttgcatt cagagaaata tagaactaat tgagaaggga 660

agtgatgcac agagctcttg taccaagcct gactgtgaag ctgagagtga ccctgtcggt 720

aacatgcagg aattttctct gctgaaatgt ggggaagcat atccaagtgg aacagagaca 780

caacaggttg aaacaagctt tcgcttaggc cagacattaa tgatgcatga ctgtcatgct 840

ggaggattaa atgtgagtat ccgcaaaaat ggtgaggcaa gcacgactaa tgacaaggat 900

actgatacag agcattttgg gaatgctagc atcagtggtg aggctcatga caatccctat 960

gttcaaatta actcttccaa ggaagctatg gacttgattg gagcatttca tactcatcca 1020

aactgttccc tgaaaaattc cacagttaat tgcacaggca actttgacca ttctcctcaa 1080

ttggatcttt ctttgagaag atcttgtccc ggaagctttg agaataaact cactgaagaa 1140

aggcacaccc tgatgcattc taatgcttca gctttcaagc ggtatactac caggcaattg 1200

caaatatcaa tgcctgcagt gttaattaac ttctctgatc aacaaagaga acagataaca 1260

aattgtgaga aaaacatctc acacatcgct actggcagca actcagatag ttcaacacct 1320

atgcaaagat gtattgtgtc tccaactaca gtccaatcaa aagaatctga acttgcaacc 1380

tcacaccccc cgcaaggaca ttctctccca attccagtaa agggtgtaag gttcaatgat 1440

ctatgcacag cctatggttc tgtacttcct tcagtgtttc atacacagtc aggtccacca 1500

gcaatgccaa gtccaaattc agttgtgctc cttgaaccaa actttcaagt aaatgcattt 1560

tatcagtcaa atatgaaaga gagtagttca gagcagcttt atgaatctcg tggtccaaat 1620

ggaaacacca cccaaaacca cattgtgtac acacaggagc acaaatcaga acatgcagaa 1680

gatcgaggac atatctctcc tacaactgat caaagtgtgt caagtagttt ctgcaatgga 1740

aatgcaagcc atcttaacag cattggttat ggaagcaact gtggaagtag cagcaatgtt 1800

gatcaagtta acactgtttg ggcagcttca gagggaaagc atgaagacct cacaaataat 1860

gcaaactctc atcgatctat ccaaagagaa gcagctctaa acaaatttcg cttgaaaagg 1920

aaagagagat gctatgagaa gaaggttcga tacgagagca gaaaaaaact agcagagcag 1980

cgtcccagag ttaaaggaca atttgttcgt caagtgcatc ctgatcctct tgttgcagaa 2040

aaagatggca aagaatatga tcattcagat ttctga 2076

<210> 60

<211> 2244

<212> DNA

<213> 葡萄

<400> 60

atgggtgagg ttgtggtgag cagtgaggca ggaggaggag gcatggaggg tgaggtggag 60

aagaaggagg tgggcagtgg ggttgtgagg tgggagaggt ttcttcccag aatggttctc 120

agggttttgt tggttgaagc ggacgattcc accaggcaaa ttatcgctgc gcttctcagg 180

aaatgcagtt acaaagttgc tgctgttcct gatggcttaa aggcatggga ggtactgaag 240

gctagacccc acaacattga cctcatattg acagaagtgg agttgccatc aatatctggc 300

tttgctctcc tcaccttggt tatggaacat gagatctgca aaaacattcc tgttataatg 360

atgtcctcac atggttcgat aaacacggtt tataaatgca tgttgagagg tgcagctgac 420

tttcttgtta agcctgttag aagaaatgag ctgaagaatt tgtggcaaca tgtctggaga 480

agacaatcgt caactgttag cggaaatggc ccccaagatg agagtgttgc acaacagaag 540

gtcgaagcca cttctgaaaa caaccccaca agtaatcact caagtgatca tgttgcttgt 600

attcagaaaa ataaggaagc actcaataaa gtgagtgatg ctcagagctc ttgttcaaag 660

ccagacttgg aagctgagag tgcctacatg gaaactatgc aggatttctc aaatccgaca 720

tggagcagat ctcttgtgag tgacacaaaa atgcagaaga atgaagaatg tgccaaattg 780

ggcccgaaat ttcttatgca caataaagaa gctgggggaa cactggaggc tgcctgcagg 840

gatgtgaaca caatgactca gcctgaagca gtggaaccag aaaatgatgg gcaaggtgct 900

aacgctccta gtgaggcttg tggtaacaat gccatattgg gcagctcatc tagagaagcc 960

atcgacttga ttggagtatt tgataattct aaaaaatgca cttatggaaa ttcttcttca 1020

aataatggca ccaaaaagag tgattctatt ccacagttgg acctttcctt gagaagatct 1080

catcctagta gccctgagaa tcaagttgct gatgaaaggc atacactgaa ccattctaat 1140

ggctcggcct tttcacgcta cataaacagg tcattgcagc caccacatct accatcaaca 1200

ggtgttttca atcagcagaa aaactttgga gctgattctg ataaacgttt atctcagctg 1260

gttactggtt ataactctga tattactagt cccacactga gtactcaaag aagtgtgatc 1320

tctctagcta ctagtccatc tggacgagtt gaaattgcac tttgtggccc tcaacagaga 1380

gcttttcctg ctccagttcc acaaaatgcc aacaattcca ccagccagac taatcacaag 1440

ccagagcaca aattggactc actggagggt caagggcact tctctcctgc cactgatcag 1500

aattcaagta gtagttttgg taatggtggt gcaagtaatc tgaatagctt tgggtgtgga 1560

agcatttgtg gaagtaatgg gaatgccaat acagttgctg ttgttcaggc cgctgcagag 1620

ggcaagaatg aagaaggtat cttcagtcat gaaggacact ctcaacgatc tatccaaaga 1680

gaagctgctc taaccaagtt tcgcttgaag cggaaagaca gatgctttga gaagaaggtt 1740

cgttatgaaa gcagaaagaa gcttgcagag cagcgacccc gagtaaaagg acagtttgtt 1800

cgacaagtgc ataccatccc cccacctgca gagcctgata catactatgg cagttcgttt 1860

gatgttcagc ctcaaagaag ccgatatcta tcagctcaac ctctcagggc ctcatcttct 1920

caactcctct atccaactca cactcctctc caagaatcca aatacgaagg tcatgaagaa 1980

agcaatctct tgacggcgtc cttggttgga actgccctac cggtggctcc atcttttggt 2040

tatgaagttg gacgtgatca gacggcagga aaacttgttc tgagtttaaa gctcgatggc 2100

cgggttcgat ggaaggtggg gacttgggtt tctggccgat accgacttaa cgttaattgt 2160

gttgctgtga tggcatttgg accctccatc ccatctggtc cactgagttc aaaagaagga 2220

actcagtgct ctactactgt ttga 2244

<210> 61

<211> 2400

<212> DNA

<213> 可可

<400> 61

atggggatag ttcaaatgaa taataatggt cctgtggcca atgggttggt tgaattgaat 60

acacatattc atgatgagca caagaaaata aggggtgggg tcataggtga ggggcagggc 120

ctctcagtgg aagaagagtc atggattaat gaggatgtgg aagacaggaa tgatgggaag 180

acagagttgg ttcaggttca gggccatgcg catggtgagc aagagaggtc acagcaacag 240

cctcaaggtc ctttggttca ctgggagagg tttttacctc taaggtcttt gaaggttcta 300

ctggtggaaa atgatgactc aactcgccat gttgtctgtg cattgctgcg aaattgtgga 360

tttgaagtta ctgctgtgtc aaatggactg caagcttgga agatcttgga agatctaacc 420

aatcatattg atcttgtttt aactgaggta gtgatgcctt gtttgtcagg cattggcctt 480

ttatgcaaga taatgagcca caaaactcgc atgaatattc cagtgattat gatgtcatct 540

catgattcta tgagtacagt ctttaggtgt ttgtccaagg gtgcagttga ctttttagtg 600

aagcctatac gaaagaatga gcttaaaaat ctttggcagc atgtttggag gaaatgccac 660

agctctagca gtagtggagg ccaaagtggt acacagaccc aaaaatcctc aaaatcaaaa 720

ggtactgatt cagacaacaa tactggaagt aatgatgagg atgacaacgg cagtgttggt 780

ttgaatgttc aggatggaag tgacaatgga agtggcactc agagctcatg gacaaagaga 840

gcagtagaag tcgacagctc ccagccaata tcaccatggg accagttagc tgatcctcct 900

catagcactt gtgcccaggt tatccattct agacatgaag tgttaggtga cagctgggtt 960

ccagtaacag cgacgaggga gtatgatgag ctggataatg aactagaaaa tgttgttatg 1020

ggcaaagact tggagatagg ggtacctaaa attacagctt cgcagcttga agacccaagt 1080

gaaaaagtaa tgaccaacat agctggtgtt aataaagaca aattatctgc aataaaccct 1140

aagaaagatg atgagaaact agagaaagcg caattggaac ttaacagtga gaaatcaggt 1200

ggtgatttga gaaatcaagc tgctgacctg ataggtgtca tcaccaataa tactgaacct 1260

catatagaaa gcgcagtctt tgacatccca aatggcctcc ctaaggtctc tgatgcaaaa 1320

gagaaggtga actacgacac gaaggaaatg ccttttcttg agctcagttt aaagagactg 1380

agagatgtag gagacactgg aacaagtgcc catgaacgaa atgtattgag acattcagac 1440

ctttcagcct tctcaagata caattctggt tcaactgcca atcaggctcc aacaggaaat 1500

gttggtagtt gttctccact tgataatagc tcggaggcag ttaaaacaga ttctatgaag 1560

aattttcagt ctacctcaaa tagcatacct ccaaagcaac agtccaatgg aagtagtaac 1620

aataatgaca tgggttccac cactaataat gccttcagca aaccagcggt actcagtgac 1680

aagccagcac ctaaaacttc agctaaatct ttccatccct cttctgcctt ccaaccagtg 1740

cagagtggcc atggttctgc cctgcaacct gtagcacaag gtaaggctga tgctgcacta 1800

ggtaacatga ttttagttaa agcaaggggc acagaccaac aggggaaagt gcagcatcac 1860

catcatcatt atcaccacca ccaccaccac catgtccata acatgctccc aaatcaaaag 1920

ttaggtaacc atgatgattt atctttggaa aatatggcag cagcagctcc ccagtgtggg 1980

tcatccaatc tgtcaagttt accacatgtt gaaggcaatg ctgctaacca cagtttgact 2040

agaagtgcat caggaagtaa ccatggaagc aatggacaga acgggagcag cactgtgtta 2100

aataccagag gaatgaatct tgaaagtgaa aatggggtgc ctgggaaagg tggagctggc 2160

ggtggaattg gatctggagg caggaatgta gttgatcaaa accgttttgc tcaaagagaa 2220

gctgctttga acaaattccg ccagaaaagg aaagaaagat gctttgagaa gaaggttcga 2280

tatcagagca gaaagaaact ggctgagcag agaccacgca ttcgaggaca gtttgtgcga 2340

cagattagca ctactgggaa ggaagcattc agatttcgtg gtgcaggatt gtgcacttag 2400

<210> 62

<211> 2229

<212> DNA

<213> 稻

<400> 62

atgatgggaa ccgctcatca caaccaaacc gccggctctg ccctcggagt cggagtcgga 60

gatgccaacg acgccgtgcc tggggctggg ggtgggggct acagcgaccc ggatggcgga 120

ccaatctccg gtgtgcagcg gccaccgcag gtctgctggg agcgcttcat ccagaagaag 180

actatcaaag tcttgctagt tgatagcgat gactccacca ggcaggtggt cagtgccctg 240

cttcgtcact gcatgtatga agtcatccct gctgaaaatg gccagcaagc atggacatat 300

ctagaagata tgcaaaacag cattgatctt gttttgacag aggttgttat gcctggtgta 360

tctggaattt ctctattgag taggatcatg aaccacaata tttgcaagaa tattccagtg 420

attatgatgt cttcaaatga tgctatgggt acagttttta agtgtttgtc aaagggcgct 480

gttgacttct tagtcaagcc catacgtaag aatgaactta agaacctatg gcagcatgtg 540

tggagacggt gccacagctc cagtggcagt ggaagtgaaa gtggcattca gacacaaaag 600

tgtgccaaat caaaaagtgg ggatgaatcc aataataaca atggcagcaa tgacgatgat 660

gacgacgatg gtgtaatcat gggacttaat gcaagagatg gcagtgataa cggcagtggc 720

actcaagcgc agagctcatg gacaaagcgc gctgttgaga ttgacagtcc acaggctatg 780

tctccagatc aattagctga tccacctgat agcacttgtg cacaagtgat ccacctgaag 840

tcagatatat gcagcaatag atggttacca tgtacaagca acaaaaattc caagaaacaa 900

aaagaaacta atgatgactt caaggggaag gacttggaaa taggttctcc tagaaattta 960

aacacagctt atcaatcctc tccgaatgag agatccatca aaccaacaga tagacggaat 1020

gaatatccac tgcaaaacaa ttcaaaggag gcagcgatgg aaaatctgga ggagtcaagt 1080

gttcgagctg ctgacttaat tggttcgatg gccaaaaaca tggatgcaca acaggcagca 1140

agagccgcaa atgcccctaa ttgctcctcc aaagtgccag aagggaaaga taagaaccgt 1200

gataatatta tgccatcact tgaattaagt ttgaaaaggt caagatcgac tggggatggt 1260

gcaaacgcaa tccaagagga acaacggaat gttttgagac gatcagatct ctcggcattt 1320

acgaggtacc atacacctgt ggcttccaat caaggtggga caggattcat gggaagctgt 1380

tcgctgcatg ataatagctc agaggctatg aaaacggatt ctgcttacaa catgaagtca 1440

aactcagatg ctgcaccaat aaaacaaggt tctaatggta gtagcaataa caatgacatg 1500

ggttccacta caaagaacgt tgtgacaaag cctagtacaa ataaggagag agtaatgtca 1560

ccctcagctg ttaaggctaa tggacacaca tcagcatttc atcctgcaca gcactggacg 1620

tctccagcta atacaacagg aaaagaaaag actgatgaag tggctaacaa tgcagcaaag 1680

agggctcagc ctggtgaagt acagagcaac ctcgtacaac accctcgccc aatacttcat 1740

tatgttcatt tcgatgtgtc acgtgagaat ggtggatccg gggcccctca atgtggttca 1800

tccaatgtat ttgatcctcc tgtcgaaggt catgctgcca actatggtgt caatggaagc 1860

aactcaggca gtaacaatgg aagcaatggg cagaatggga gtacgactgc tgtaaatgct 1920

gaacggccaa atatggagat cgctaatggc accatcaaca aaagtggacc tggaggtggc 1980

aatggaagtg gaagcggcag tggcaatgac atgtatctga aacgcttcac tcaacgagag 2040

catagagtgg ctgcagtgat caagtttaga cagaaaagga aagagcgcaa cttcggaaaa 2100

aaggtgcggt accagagcag aaagaggctg gccgagcagc ggccaagggt ccgcggacag 2160

ttcgtgcggc aagctgtgca agaccaacaa cagcagggtg gtgggcgcga agcggcagcg 2220

gacagatga 2229

<210> 63

<211> 2301

<212> DNA

<213> 玉米

<400> 63

atgggcagtg cttgccaagc tggcacagac gggccttccc gcaaggatgt gttagggata 60

gggaatgccg ccttagagaa tggccaccat caggctgaag ctgacgcaga tgaatggagg 120

gaaaaggaag aggacttggc caacaacggg cacagtgcgc caccgccagg catgcagcag 180

gtggatgagc ataaggagga acaaagacaa agcattcact gggagaggtt cctacctgtg 240

aagacactga gagtcttgct ggtggagaat gatgactcta ctcgtcaggt ggtcagtgcc 300

ctgctccgta agtgctgcta tgaagttatt cctgctgaaa atggtttgca tgcatggcga 360

tatcttgaag atctgcagaa caacatcgac cttgtattga ctgaggtttt catgccttgt 420

ctatctggta tcggtctgct tagcaaaatc acaagtcaca aaatttgcaa agacattcct 480

gtgattatga tgtctacgaa tgattctatg agtatggtgt ttaagtgttt gtcgaaggga 540

gcagttgatt tcttggtaaa accactacgt aagaatgagc ttaagaacct ttggcagcat 600

gtttggaggc gatgccacag ttccagtgga agtgaaagtg gcatccagac acagaagtgt 660

gccaaactaa atactggcga cgagtatgag aacggcagtg acagcaatca tgatgatgaa 720

gaaaatgatg acggcgacga tgacgacttc agtgttggac tcaatgctag ggatggaagt 780

gacaatggca gtggtactca aagctcatgg acaaagcgtg ctgtggagat tgacagccca 840

caacctatat ctcccgatca actagttgat ccacctgata gtacatgtgc acaagtaatt 900

caccctagat cagagatatg cagtaacaag tggttaccga cagcaaacaa aaggaatgtc 960

aagaaacaga aggagaataa agatgaatct atgggaagat acttaggaat aggtgctcct 1020

aggaactcaa gtgcagaata tcaatcatct ctcaatgatg tatctgttaa tccaatagaa 1080

aaaggacatg agaatcacat gtccaaatgc aaatctaaaa aggaaacaat ggcagaagat 1140

gattgtacaa acatgcctag tgcaacaaat gctgaaactg ctgatttgat tagctcaata 1200

gccagaaaca cagaaggcca acaagcagta caagccgttg acgcaccaga tggcccttcc 1260

aaaatggcta atggaaatga taagaatcat gattctcata tcgaagtgac accccatgag 1320

ttgggtttga agagatcgag aacaaatgga gctacagcgg aaatccatga tgagcgaaat 1380

attctgaaaa gatcagatca gtcagccttc accaggtacc atacatctgt ggcttccaat 1440

caaggtggag caagatatgg ggaaagctct tcaccacaag ataacagttc tgaggccatg 1500

aaaacggact ctacatgcaa gatgaagtca aattcagatg ctgctccaat aaagcagggc 1560

tccaatggca gtagcaataa cgatgtggga tccagtacaa agaatgttgc tgcaaggcct 1620

tcgggtgaca gggagagagt agcgtcacca ttagccatca aatctaccca gcatgcctca 1680

gcatttcata ctatacagaa tcaaacgtca ccagctaatc tgattgggga agacaaagct 1740

gatgaaggaa tttccaatac agtgaaaatg agccacccaa cagaggttcc acaaggctgc 1800

gtccagcatc atcatcatgt gcattattac ctccatgtta tgacacagaa acagccatca 1860

acagaccgtg gatcatcaga tgttcactgt ggttcgtcaa atgtgtttga tcctcctgtt 1920

gaaggacatg ctgcaaacta cagtgtgaat gggggtgtct cagttggtca taatgggtgc 1980

aatgggcaga atggaagtag cgctgtcccc aatattgcaa gaccaaacat agagagtatt 2040

aatggtacca tgagccaaaa tattgccgga ggtggcattg taagtgggag tgggagtggc 2100

aatgacatgt atcagaatcg gttcctgcaa cgagaagctg cattgaacaa attcagactg 2160

aagcggaaag atcggaactt tggtaaaaag gttcgctacc aaagcaggaa gaggcttgct 2220

gagcagcggc cacgggtccg aggacagttt gtgcgacaat ctgagcaaga agatcaaaca 2280

gcgcaaggtt cagaaagatg a 2301

<210> 64

<211> 2034

<212> DNA

<213> 小立碗藓

<400> 64

atgccatatc tgtccggagt tgggcttctg tcgaagatga tgaagcggga agcatgcaag 60

agagtgccta ttgtcatcat gtcatcgtac gacagtcttg gcatcgtgtt ccgctgcctc 120

tcgaaaggag cttgcgacta tctcgtgaaa ccagttagga aaaacgagtt gaagaatctg 180

tggcagcacg tatggaggaa gtgccacagt tcgagtggga gcagaagtgg aagcggaagc 240

cagactgggg aagtagctaa gcctcggagt cgtggtgtag cagccgctga caatcctagt 300

ggaagcaatg atgggaatgg cagcagtgat gggagtgata atgggagcag ccgggtaaat 360

gcccagggtg gaagcgacaa tggtagtggc aatcaagctt gcatgcaacc tgtacaggtt 420

ctgaggaaca gcgcaattcc agaagcagta gacggggatg aggaggggca ggcgacatcg 480

caagataagg gtgctgactt ggatggagag atggggcatg atctggagat ggcaactcga 540

aggtctgctt gtgttaccac cggaaaagat cagcaaccag aggatgccca gaagcaagat 600

gaggatgctg tatgtatctt gcaagatgcg gggccatcac ctgatggggc taatgccgag 660

agcccatcat ctagcggtcg gaatgatgcc gcagaggagt cttctccaaa gatcattgac 720

ctgataaacg tcatagcgtg tcagccacag acccaggatg cagaacctca agaaagtgag 780

aacgatgacg aagaattgga tccgcgggga aggagcagcc ctaaaaacaa ctccgcttca 840

gattccggta cttcgctgga gttaagtttg aaacggccac gatcggcggt tggtaacggc 900

ggagaattag aagagcgtca accactgcga cattcaggag gctcggcctt ttctaggtat 960

ggcagcggag gaaccattat acagcaatac catcagactg gaggttcact ccctctcagt 1020

ggttatcctg tgtctggtgg atatggtgta tatggcatgt ccggcggtag ccctggagga 1080

tctcttcgtc tgggaatggg aatggatcga agtgggtcat cgaaaggaag tgtagagggg 1140

actacacccc caccctcgca tcctcagagc atggagaaag tgggtgggca agatgggtac 1200

ggcaatgcaa gacagactac ggaggatgca atgatcgtac ctggaatgcc catggctatt 1260

cctctcccac cacctgggat gcttgcatat gatggcgtta ttggaacgta tggtccggcg 1320

atgcacccga tgtattatgc tcaccctagc gcgtggatgg cagctccgtc tcgtcacatg 1380

ggagagcggg gagatgtcta caatcaatct cctgcatttc aagagcagga ttctgggtct 1440

gggaatcatt ctcaagcggg gcagactcac cagcacatgc accaccacca aggcaaccag 1500

caccaccatc atcatcacca tcaccaccat gggagtggcg cccagccttc tggaaatgca 1560

ggggtgcaag atgaacaaca gcaatcagtg gtaccgcctg ggtcgagtgc tcctcgctgc 1620

ggctcgaccg gtgtggatgg tcgaagtggt agcagcaacg gctacgggag caccgggaat 1680

gggaatgggt ccatgaacgg aagtgcttcg ggaagtaata ctggcgtgaa caacggtcag 1740

agtggatttg gtgcgacgcc gatgttaact gacaacagtg ggagtaacgg cgtcggtgga 1800

acggatgcag ccatggatgg ggtgagtggg ggcaatgggc tgtgcacaga gcaaatgcgt 1860

ttcgccagac gagaggctgc cttgaataag tttaggcaga agagaaagga gcgatgcttt 1920

gagaagaagg tgcgatacca aagcaggaaa cggcttgcag aacaaagacc acgagtccgc 1980

ggtcagtttg tgcggcaagc ggtacatgat ccgtctgctg gtgacgccga atag 2034

<210> 65

<211> 4080

<212> DNA

<213> 卡特氏团藻

<400> 65

atggagttcc acgtactgct ggtcgaagac gacagggtga cgctgaagac agttgagcag 60

ctactccgga aatgcaatta caaagttacc tgtgcagcaa atggacggga ggcaataaag 120

gtccttactg cctgccggca cagcggcgtc aaagtggacc ttattttgac cgatatactg 180

atgccggagg ttaccggctt tgacttaatc aatgaagtgg tacatgggga caccttttgc 240

gatgtgccag tggtcgtcat gtcctctcaa gactcgcagg agaacgtgtt acaggcattc 300

caagcaggcg ctgccgacta ccttataaag cccattcgca aaaatgagct ggctacgctc 360

tggcagcatg tctggcgcgc caacaaggcc aaggggtccg gcagcggcac caccactaac 420

gtcaccgggc agcccctttc cggtcgggag gatctggagg caggcgaagc cgtcgctgtc 480

gccgccgccg ccgccgctgc cagcggcaag gcctgtgcag caacgcatgg gcatttgaag 540

gacagcagcg gcggcagcag cggcgccgcc gcttctgtat tgcagtccac gggcggaaca 600

ctactgccgg accgtgctgc cactgtacgg tatccagctg cggcggcagc gccaccgcca 660

cctggcgcat ccgagctatc agggaacgtg acggcgggcg aagctcaagg gagccgtacg 720

cagcatctgc gccatctgtc cggcttggcg gggatggaaa gcacagcggc gacgtcagcg 780

gcggcgcaag gcagtagcgc agcagggccg ctgcggggct gcggcggtgc tggtactgct 840

atagctggtg ggccgcgcgc gcccttgggc ccactttcat tcgcgccctt cggcacttcc 900

gttgccgtac actttgacct gaaccccgca tccggcgcag ctcgacggct ggtcaactcc 960

agcggcgcca tcgatgcgtc gacgggcagc ggcactgctg gcgtcgccgc ttcatcgcgt 1020

tgcgccgccg gcacctccgc caccgtcatc agttggtcgc acgtcgatcc gacggagacg 1080

gacccagcgg aggcggagcc catgtacgac acgaacgcgg acgccaccgc ggcgaaggca 1140

gcggctgacg gtgtggcgga agctgacgac gacgatgttg gcgacgacgg cggtgctggg 1200

cccaaccaca atgacgatga tgacgagggt ggcggcgacg acgacgtcag cggcgacggt 1260

gacgaggacg gaaaccggcc tcgcaagcgt ccgcggctgc ttcagggatc ctcgcatcac 1320

cacagccacc agcatcgcct tcacagccta ggcggtacga ctaccaacac caccaccact 1380

acgacagccg cgaagcctaa gtcgacagcg ggagaacgcg gcggcgcggc ggcgctactc 1440

gcatgccgta ctgcggcggc cgcaccccta cgcggcagtg gctgcgccac cgctggcgcc 1500

accggagcat gtcgactggc ggcggcggca gcggcggcgg agggctccca gggttctcgc 1560

gccgcgtcgg cgtcggcagg ccctgacggc ggcgcgcgtg agagtacggc tacccccagt 1620

ggtgacacct ttgcagagag cccgtccgcg tacactgcaa ccgccacaac gaccagtacg 1680

gcaacaacca gtacgacaac gggatccggg attgagatgc aggacgacga gcaacagcag 1740

cgacagcagc ctaagcagcg tccgccggca tctcagccgg aactggaggg tcatcatcac 1800

caacaacaat atcaccatta ttatcgacgc accagcctgg agggcggttg cgccaacgca 1860

ccccctctcc ctgtcccttc atctgcacgg ggtgcttccc cggcaggcac gggtccgacg 1920

gaaagcggct ccgggaggga tagcggctgt gccaggatta caaatggtac ggcggcgggg 1980

gcgacggcgg caatgccgcc atctcacgtc agctcggcaa gccccccccg ctgtaccgcc 2040

acttccgcgg cggcgactcg cgggtcctct ggtgctgcta ctgcggcagc gggtgccatg 2100

acaacagcct tggcgacggc cggcagctat ccgcgaggag tggacgccag cccgccgccg 2160

aatagaagta tggggtccag cggcggtgat ggcggcggaa ccgccgctgc agctgccggt 2220

acggcacgag ggagctcgcc tgcggctgct acgccgccgc tgccaccttc tacgcagcag 2280

cacgggttgc cgcatcccgc ggcggcgccg ccgccgggcg ctgcatcgcc tggcggcgcc 2340

gtgacgctgc cgccagcgct tcaggagctg gcggcactgg gggcggcccg ccatgcgggg 2400

ctatggaccc agcgggcctt attgcatcag cagcaattgt tgctgcagca gcagaagcag 2460

cagaagcaac aacagcacca acaagaccag gtagtggggg cagagaagat tcatggtggg 2520

tcgacgtcgg ctgtagccaa cgccgccgag cagcagcagc agcagccgct gggggcggcg 2580

gcggcacgtc gtcccagcaa agcgggcgtg gacggaactg aggcgggaag tggcgcggtc 2640

ggcggatgcg catcggcgac agcggcggtc atggcgatgg aggcgtcgga gccgcatggc 2700

gcggttggca gctcctttac ggcggcagat cggcaggaga cgccgttgca gcctctgcat 2760

gctgaatctg cggcggcagg cggcgacatg gacggcaacc gcagtacacc cgcaactatg 2820

ccgtcggggc ctacggcagc cgcatcgggc ccttcgcaga cgtcgaacag cttgacggtg 2880

ctgcgacata gcgacagatc cgctttcacc gcattcaccg ttttcttgcc aagcagggtt 2940

gccggcgccg cggcggcggc ggcggcggca gcagctgctc ggccgccacc accgccggcg 3000

ccggtgcagc cgccggcgcc aatcttcacg caccctgctg ctgctgctgc agccgcggcg 3060

gcggctgccg ctggcagcgg cggtgcagcc tcagtgtggt atcctcacct ccatcatcac 3120

caccactact tgcagcagca gcagacgcac atgggtccct tgccgccact gccaggtgcc 3180

gtacatgttc tgccgtcgat catgcagctt cacatgggag tactggcgcc agggccgccg 3240

ccacagcagc agcagcagca gcaccttcag gccaaggcgc ctcagaagcc tcatgattcc 3300

gccgccgccg ccggcggagc taacggctcg ctaggtcccg cgacatcggc tgcagcggcc 3360

acgcacatgt cgtacactgg catgcaacag cgcccgggcg cctcatccgc caccaccacc 3420

agcgccggcg ctgtagcgtt cggtcaatct ccacctcacg ggctggcggc ggcggcggcc 3480

gccgctagca cgcctccgcc gcctccaccg ccgcctgttt gtattcccga atcggtacta 3540

cagctcattg cgcatctgtc tggtcgggcg gcggcggagc tgcccgtacc ggaaaccgtc 3600

acgacggcac cgttggtcgt acagaaggcg ccgtcggcag cgcgattggc tgctgtagcg 3660

aagtaccttg aaaagcggaa gcaccgaaac ttccaaaaga aggttcggta cgagagccgt 3720

aaacggctgg cggaggccag gcctcgcgta cgcggccaat tcgtcaaggc aagtacttcc 3780

gcggtggcgg caaccacccc tgccgccacg ggcgccaccg tcacctctac gtcgctccgt 3840

cagcccgttt atacggcggc cggcccggct ggcctggcgc tgccgccggc ggcggcagca 3900

gcggcggcca gcgccgccgc cgcgaggggg gttccgccgc cgtcatcccg catcggagcg 3960

gtggagctgg cggagttggt gcccgaccac gacgccgaca ttgaggacga ggggtgtgac 4020

gagcccgccg ccgccgagga ctccgacggg tccgtcgcgg tggagctggc ggaggtgtag 4080

<210> 66

<211> 3309

<212> DNA

<213> 莱茵衣藻

<400> 66

atggaggcta acggcttcca cgtcgtatta gtcgaggatg ataacatttg cctgaaagtg 60

gtggagcagc tgctgcggaa gctttcgtac agagtcagca ccgcatccga tggtgccgca 120

gcgctcaaag tcctggctga ctgcaagcag aggggcgaca aagtagacct cattctcacg 180

gacatcctga tgccagaggt taccgggttt gacctcatca acgaggtcgt gcatggagag 240

acctttgccg atattccggt cgtggttatg tcgtctcaag actcgcagga aagtgtcttg 300

caggcatttc aggcgggcgc agcggactac ctcatcaagc ccattcggaa aaatgagctt 360

gcaacgctct ggcagcacgt ctggcgtgca aaccgcgcca agggtggaca gaccagcagc 420

ggcgccgcgc atgtgggcgc aggcggcagg gggggcacca gcagccgcga tggcggtggc 480

gttgccggga cgcggtgcgg cccaggcgac cgcggcggca gcggcggcga cgctgagggt 540

agtgggctag gcggcggcgc gggtgcagtc aaggacagca gcggcggcag taccggcgcc 600

gccacttcag tgctgcactc cactggtggc acgacgctgc cctcacgtgc ggccaccggt 660

cggcacgcta gcacctcagc tggacacggc gtcaccagcg ctgaccccag caacaaccaa 720

acctcgcacg cgcacgcgca tgcgcatgcg cacgctcacg ggaacgcgca cgcgcacgcg 780

caccttcata tgcacggcgc aacagatcgt gcggcgcagg gcagcagcgc taacggcccg 840

gccaaccacg gggccgctgg gacagggctg cagtccgctg ggatggcagg ttccacggct 900

gcaggcgcgg ctgcgcccgc cggtgagtcg ctggccaagc cgcccttcgc ctccctagcc 960

gtccacttcg acctgcactc agtcctggcg ggcgcgggag cggctgcagc caatggtggc 1020

gccaatgccg cagctcacac tgctggcgcc accgggcgag agagcggcca ggcggcgggc 1080

gcggccacag gcggcattgc cgccgccggc accgtcatcg gctggtcgca tgcggacatg 1140

gacgtggacg gaggggaggc cggcgcgcag gatgaagatg acgaggacga ggacgacggc 1200

gtggaggcgc cggcgggcac acagaaccgg aagcgcgccg cggatgactc gggttgcgac 1260

ggcgccgccg ccaacaacaa cggcaacact gccgcaaagg ctggcgcagc ggcaatcgcc 1320

gcgggcgggc ctgggagctc gggcagggcg aaggccacgg acggcgcccg cgctgagatt 1380

cgccacaacg gtgggccgat ggcggcgcgg atggcggctg cagagggctc tcaaggctcg 1440

cgcgctgcat cgggctcggc ggcaacggga ccgggaggag cgcgggaggg cactgcgacg 1500

cctagcggcg acacctttgc ggagagccct tccaccttca cttccatcat caacaccacc 1560

ggctcgggca gcgaggccga cgagcagcca gtgccgctga agcaccagga acagcaacag 1620

cagcaacagc agcagcgggt cggcgagggt gacagggcga agcccgaacc gcacccacag 1680

aaccctgccc aggcagcaca cctgccgcac ccgtccgcgg ccccatgctc gggcggtggc 1740

ggtattgcgc aagcggccct acccctaggg ctacaggagc tggcagcgct gggggcggct 1800

cggcacaaag agctgtggac gcagcggcac cttatgcatc agcggcaggc ggcggcagcg 1860

gcgacagcag cggcggcctc ggcagctgct gcagcggcaa tgcccacggc cggcgcgagc 1920

gccgcggctc ctgcaggccc accttcggcg cggccctccg cttccttggc agacacgggc 1980

ggcgacggcc ccgcggctgc gacggcgcct gagacgcgcg cagatgggcc ctctggccct 2040

gccacgacgc agggccccaa acgagatgcc gtcgcaggtg ccgcggctgt cggcagctct 2100

gcacggagcg acagtccgct gccggcagcc gccgccgcga cggcaggcgc caacggcgcg 2160

agcggcgccg cttctgacgt gttggcgggc gcaggcagcc ttgcgcttct ccggcacagc 2220

gatcggtctg ccttcaccgc gttcacggtc ttcctgcccg ggcgtgttgc cgccgccgcg 2280

gccgctgcag cggccgccgc cgcagctgct accagcgcgg gcgccagcac cggcactgcc 2340

aacggggctc cgccggcacc gggcaccgct ctggctgccg ctgccgcagc agctgccgcc 2400

gctgcgtcag cagtgccgct gccgcatcca cacacagcgc ccccagcgct gttcggcgtc 2460

cctccgccgt cctccgtgcc tcccagctcg ctttctgtgc tacctcctgt gatgccgctc 2520

catccggccg ctgccgctgc agcggcgacg gcgggtgggg gcaagcccag cgacgcagcc 2580

acgtatgccg cggctgctgc agctggattg gtgccgtatc cagggtttgc gccggcgcgg 2640

ccggggccat ttccgccgcc gccaggttct ggtggccccg gcgcgccgcc tgtgtacata 2700

cccgagtcag tcctgcagct gattgcgcac ctgtccggcc gcgcggctgc ggaaattccg 2760

gcggtgccgg cggagtcagt gacggcagca ccggtggttg tgcagaagag cggcggccct 2820

gcctcggcgg cgcgactggc ggcagtggcc aagtacctgg agaagcggaa gcaccgcaat 2880

ttccagaaga aggtgcgcta cgagagccgc aagcggctcg ccgaggcccg gccacgcgtc 2940

agggggcagt tcgtcaaggc gggcaccgcg ggtgcagcgg cagcggcagc ggcagcggca 3000

gccgcagccg cagccggcac tgccgctact gctgccggca ccggcacggc cagaggtgct 3060

gccaccgctt ctggggctgc tgggaagccg gagctacagg gccccgacac ggcagaagag 3120

gctgcggctg cgacgctgct tagcgcagca gctgctatgg cagcagcggc tgcgggcacc 3180

agtggcccca gcggctctgg gtccggcgcg atggatgtgg acggtgccga cccggaagca 3240

gatgcagacg tcatggatga ggacgatggc gaagacgacg ggtcggacga gtccgctggg 3300

gagccctag 3309

<210> 67

<211> 1335

<212> DNA

<213> 胶球藻

<400> 67

atggctgcag gcctcaagcg gatacccagc ttctcggggc gaccaggatt ccccaacggt 60

ctgcaggtgt tggttgtgga cggggacacc agcagcagcc agtgcttgcg gcagaagctg 120

gaggagctgg catatgaagt cagctgctgc tcgtccggat ctgacgcttc ggcgctcctg 180

cgcaaggagg actccagcta cgacattctc ctagttgagg ccaaagctct ggcaaaggat 240

gctactgatg gaggcagtct cagagattct gcagcgcacc tgccgctggt cctcatgtca 300

gaaaagagca gcagcacaga cgctgtatgg cgaggcatag agctcggggc agcggacgtt 360

ctggagaagc cgctgtcctc cttgaagctg cgcaacatct ggcaacatgt cgttcgcaag 420

atgatgagct cgtcccagga cagcagcagg gaggcggtgc cctgcaagat ggagccgaag 480

agcaagggca agggcgtgtc agcgccctcc agccctcgca ctccctcccc tgcagcctcc 540

ctcctcacca tcagcagcgg cacgatgaca gagaagagct gcaagggcgg cggcgatgag 600

gcctccttct caggtgtggg agatgtgaag atgtcctgct cggcagaggc gccggagccc 660

tgcgattcgc gcgcgaccgc tgagtcaccc gccagcacgc agaccaaggt cacgttcccg 720

gggtgcttga atagcggcgg cacggcgctc gcggctagca agaattgcag ccgcaagaga 780

aaggcaaagg cgccggacac tcctgcatcg gtggcgagcc ggccgcctct ggccatcagg 840

ccccccgcat gggcctcccc atttggtccc ccccaccagg gcaacaccca cgtcgtcggc 900

atggccccgc cacagtgcta tatgcagggg gttgacccca cgaacgggtg cgtatggggc 960

acgccagcag ggggcgtcag ccaagcgcca gcctacatgc ccggctgggg cttctcgccg 1020

cagccaatgc tttccggcag cttcttgcag catccctcca ccagcgacct gcacaagtgc 1080

cccagcgtgg gtgccagcag cctggcaagc agcctggaca gcagcctgac gctgtgcggc 1140

tttggcgcgg acctgcctga cgacgatctc ctgttggagg acgtgcttct gccggacgag 1200

gatcttctgg acttggcccc agatgagccc gccaccatga aggcccccga gcagccgccc 1260

atcggcctca agctcaagaa gtccgcttca ctcatcgacc tcatcaatgc gcaactgtcc 1320

gccgccaccg cctga 1335

<210> 68

<211> 568

<212> PRT

<213> 拟小球藻属

<400> 68

Met Leu Arg Gln Gln Leu Leu His Ser Gly Arg Gln Pro Gly Ala Thr

1 5 10 15

Cys Ser Leu Leu Thr Cys Ser Thr Trp Arg Pro Ser Ala Leu Phe Gly

20 25 30

Arg Pro Lys Pro Gln Lys Leu His Ser Gln Arg Leu Gln His Gln Gly

35 40 45

Arg Pro Ser Arg Leu Val Val Arg Ser Ala Met Phe Asp Asn Leu Ser

50 55 60

Arg Ser Leu Glu Arg Ala Trp Asp Met Val Arg Lys Asp Gly Arg Leu

65 70 75 80

Thr Ala Asp Asn Ile Lys Glu Pro Met Arg Glu Ile Arg Arg Ala Leu

85 90 95

Leu Glu Ala Asp Val Arg Leu Gly Ala Pro Leu Ile Arg Phe Leu Val

100 105 110

Ser Thr Pro Pro Pro Ser Gln Val Ser Leu Pro Val Val Arg Lys Phe

115 120 125

Val Lys Ala Val Glu Glu Lys Ala Leu Gly Ser Ala Val Thr Lys Gly

130 135 140

Val Thr Pro Asp Gln Gln Leu Val Lys Val Val Tyr Asp Gln Leu Arg

145 150 155 160

Glu Leu Met Gly Gly Gln Gln Glu Gly Leu Val Pro Thr Ser Pro Glu

165 170 175

Glu Pro Gln Val Ile Leu Met Ala Gly Leu Gln Gly Thr Gly Lys Thr

180 185 190

Thr Ala Ala Gly Lys Leu Ala Leu Phe Leu Gln Lys Lys Gly Gln Lys

195 200 205

Val Leu Leu Val Ala Thr Asp Ile Tyr Arg Pro Ala Ala Ile Asp Gln

210 215 220

Leu Val Lys Leu Gly Asp Arg Ile Gly Val Pro Val Phe Gln Leu Gly

225 230 235 240

Thr Gln Val Gln Pro Pro Glu Ile Ala Arg Gln Gly Leu Glu Lys Ala

245 250 255

Arg Ala Glu Gly Phe Asp Ala Val Ile Val Asp Thr Ala Gly Arg Leu

260 265 270

Gln Ile Asp Gln Ser Met Met Glu Glu Leu Val Gln Ile Lys Ser Thr

275 280 285

Val Lys Pro Ser Asp Thr Leu Leu Val Val Asp Ala Met Thr Gly Gln

290 295 300

Glu Ala Ala Gly Leu Val Lys Ala Phe Asn Asp Ala Val Asp Ile Thr

305 310 315 320

Gly Ala Val Leu Thr Lys Leu Asp Gly Asp Ser Arg Gly Gly Ala Ala

325 330 335

Leu Ser Val Arg Gln Val Ser Gly Arg Pro Ile Lys Phe Val Gly Met

340 345 350

Gly Glu Gly Met Glu Ala Leu Glu Pro Phe Tyr Pro Glu Arg Met Ala

355 360 365

Ser Arg Ile Leu Gly Met Gly Asp Val Val Thr Leu Val Glu Lys Ala

370 375 380

Glu Glu Ser Ile Lys Glu Glu Glu Ala Gln Glu Ile Ser Arg Lys Met

385 390 395 400

Leu Ser Ala Lys Phe Asp Phe Asp Asp Phe Leu Lys Gln Tyr Lys Met

405 410 415

Val Ala Gly Met Gly Asn Met Ala Gln Ile Met Lys Met Leu Pro Gly

420 425 430

Met Asn Lys Phe Thr Glu Lys Gln Leu Ala Gly Val Glu Lys Gln Tyr

435 440 445

Lys Val Tyr Glu Ser Met Ile Gln Ser Met Thr Val Lys Glu Arg Lys

450 455 460

Gln Pro Glu Leu Leu Val Lys Ser Pro Ser Arg Arg Arg Arg Ile Ala

465 470 475 480

Arg Gly Ser Gly Arg Ser Glu Arg Glu Val Thr Glu Leu Leu Gly Val

485 490 495

Phe Thr Asn Leu Arg Thr Gln Met Gln Ser Phe Ser Lys Met Met Ala

500 505 510

Met Gly Gly Met Gly Met Gly Ser Met Met Ser Asp Glu Glu Met Met

515 520 525

Gln Ala Thr Leu Ala Gly Ala Gly Pro Arg Pro Val Pro Ala Gly Lys

530 535 540

Val Arg Arg Lys Lys Leu Ala Ala Ala Gly Gly Ser Arg Gly Met Ala

545 550 555 560

Glu Leu Ala Ser Leu Lys Ala Glu

565

<210> 69

<211> 23

<212> DNA

<213> 拟小球藻属

<400> 69

gggacatggt gcgcaaggac ggg 23

<210> 70

<211> 2667

<212> DNA

<213> 拟小球藻属

<400> 70

atggccaaac tgacatccgc tgttcctgtg ttgacagcaa gagatgttgc aggtgcagtg 60

gagttttgtg agttctgaga agctgattgt tgtttaactt ctttgaaagc tttatcgaag 120

attctgcaag cgatgaacat tgcttgtcaa gaccgagagc tgcatgccca cttgacatcc 180

agctttgaac ggctcttcat gtttgatttg tttctgattg tagggacaga tagactgggg 240

tttagcaggg actttgtgga ggacgatttt gcaggagtgg tgagggatga tgtgacactg 300

tttatctcag cagtgcagga tcaagtgagt gcagcgtcag ctgtggcagt tgttggcttt 360

cgtctcagtc agtagtttgc tgggattgat tatggagggc acagttgcaa ttttgagttg 420

cacgttgcga caagcgtgtt gacaaagcgt ggtcaagccg gccagtcttg ccggtggcgg 480

gtggcttggt ctaacttccg ctctacagca atcgttttgt tcatggttac ggggctggcg 540

tgccagaaag tcctggtcag ccaccctcgc ttcaaagccg tagcccaaca actttgcgaa 600

tatgttcgat ttgcaggtgg tgcccgataa tacactggca tgggtttggg tgagaggtac 660

agctctgcgt gcaacaggtt gcaagatgca gcgcaggtct tccctggtca aacgatgtat 720

gcagagttga gaggcacttg agctgggtga atggcgtggg ctcgtaggta gtgtgcaggg 780

caggaagggc agccaatttt ggagttgtgg tccggtgtcg ttgcttcgag ccttattagg 840

actcttgctc atcaaagcgt tagttgtgaa taagttgatc tgaaaggatg ttatgtacag 900

caagcagcag cagttaagag tctggggagt agctgcacag ggcgaggtgt caagatggga 960

agggtcctgc ctccttatgt gtttttccct gtaggggagg aagcctctta tgggcaatgg 1020

ttgggcatat tttccagcca gcccttcttt ctataggggc cagggtgggc ccagctcgtc 1080

ttggcttcca ccaccaggag agtgagggca ttgaagggcc ataaatagtc ctcccatcta 1140

cgtgcaccag agggtgtcgt ctaggctgtg catgccacga ggggaaggag ccaagaatga 1200

gtgtatgggt tgttttcatg tttaggctgg gataaaactg ttttcaattg cgcctgccgg 1260

gtgaaaacca cagcagcatc agcaagcttg gagaaggcca gcccgcccag cacaggctca 1320

cgttcccact caggcggtca gtcgggcggg ggtgtgagtc aggcaggcga gggtgtctgt 1380

gcctgacatc agcacctctg cttagccact gcagcccctg gagcagggta gggcgtcatt 1440

tgcagcaatc acctgctgcc tcacacgtcg cagcttggaa tttcaacgac catcagcgct 1500

ggggttgttg agggatcata gcagattttg gtgcagcctg gttgtcatgc tctttgtgga 1560

atggcctcta tgttcgagca attcgttgga tgttgaggtg cttggggaca gagagtcgaa 1620

tgatgggcca gggtcaaaca tgcgagcgtt tggctgagtc agcggttttt gctggtcact 1680

ttttcttttg tttcttattt aggtttgatg gatgtgtttt gtgctgctgc cctgaagctg 1740

cagcagcgtg tctgccctgc gctactgcgg gcaccaaggc tatgtgctgg tgcactcggc 1800

tgcgctgcac ctgtgcacct cgcactccgt ccagcctcca tgcagcacac gtactcacgg 1860

tgtcctcctg acctgtcgta cgctattcca aacttgctct tttgctgccg ctgctctcgt 1920

acacaattgc tgttgattat cgatatctaa tcgagcgcct gctgactgaa ctccgcaggt 1980

ttggatgaac tgtatgcaga gtggtctgaa gtggtgagca ccaactttag gtgggtgggc 2040

tctgaaggag gaggagggag cgggtgatta aacagggcct gcatgaagag gagcaggggc 2100

tgcatggaca gcagggggaa ggtgcagaag ggagggtcaa gcggggttca ggtggctgtg 2160

ggtttctgca cgagcagtga aagaagctgt atccttccac ctgctttcac tggcgaaagg 2220

ttgaaaacag gatgtcgcag ctggaaagat gttgcgctgt caagtgcaag ccatggttga 2280

gggtatgcct gtgtgcatgt gcttcttaaa gttactcctg ttctatggtt ctgggtgctt 2340

gttgtttgtg gtgcagggat gcaagcggac ctgcaatgac agagattgga gaacaacctt 2400

ggggaaggga gtttgcattg agagatcctg caggtgaggg ggcatgtaag caatggcagg 2460

caattcaaga acgaatcatt gctgcaaatg ctgggatggt atgcagctga ggtatctatt 2520

gccttgtatt ttgtctcgca ttgcatcggt ggtgcgttct gtggcctgag gcacagttct 2580

tgctgtttga taagggttcg actgagttgt cgtgtgtgct gtgctgcagg caattgcgtg 2640

cactttgttg cagaagaaca ggactga 2667

<210> 71

<211> 530

<212> DNA

<213> 拟小球藻属

<400> 71

ccaccatggg ggaggtttga agtgtgcgcc tgatataatc atacacctaa aagcaccact 60

tgctgattgt gaagggacta tgtcgtttat gacgggacgt tacgctggcc gatggtttga 120

atttggacgc tgtggtagaa tgttatatgg acgtaaaggt tggcatattg aaaatcgtct 180

tcgcaggcaa acttctagac gtgtgaccca ccggtaaaac gacaagcgtg gcgcgtcgat 240

tgcgctttga acgtcgtttg ttggactcca gatgaacctc aaaatcaaag cggtgattga 300

cgaaaatcaa atgacagccc gcaaaatttc atcagccttc ggatcggatt ctcagaatct 360

gattgtccct gctggctaca tttatgaaat ttcgtacatt ttggcagaaa tgtcccaata 420

ccatagcact gccgcctgag ctcacccgag caatgcatac tgggtacctc gcccatctcg 480

ccctctttcc aagcccagtg ctgttgtaat agccaaaggg ctcagtaaca 530

<210> 72

<211> 546

<212> DNA

<213> 拟小球藻属

<400> 72

gcatagcatc agcctgtggc agggttgtgg tagggctgag tggcagggtt aaaggggttg 60

cctaccccac ccctactctc atgacaccag caacagcagc agctcatgca gtactcaaat 120

cactgatgtc aatggtgtga cacatttggt taaggctgct ttttaaagtg ctgctttggg 180

ggcagtgact gtgcagagct tggagcgtat ccccatgtaa tcagaaccga cgagagttcg 240

gggcaacctt tcatcttcac attttttgtg atcagctaca gagtctgaaa tcaaatagag 300

gctgccatct aaacgcagga gtcacaacga aggcgaaaac tccaattgct gtactcaatg 360

cactaagtga ttgttcaatg gataaataca ctatgctcaa ttcatgccag cagagctgct 420

ccttccagcc agctacaatg gctttttcca cgccttttga agtatgaatg ttcagcttgc 480

tgtgcttgat gcatcaccat aaacacaatt ctacaacatt tcatgccaac aacagtacgg 540

gctttc 546

<210> 73

<211> 23

<212> DNA

<213> 拟小球藻属

<400> 73

tgcggtgaag cttggagctg tgg 23

<210> 74

<211> 23

<212> DNA

<213> 拟小球藻属

<400> 74

acaccacctt aaggcacatg agg 23

<210> 75

<211> 549

<212> PRT

<213> 莱茵衣藻

<400> 75

Met Gln Thr Ala Leu Arg Ala Arg Ser Ala Ala Pro Arg Gly Ala Cys

1 5 10 15

Asn Arg Thr Ala Val Ala Pro Val Ala Ser Ala His Leu Arg Gly Gln

20 25 30

Tyr Ala Pro Phe Ser Gly Ala Gln Ala Arg Pro Ala Leu Gly Arg Gln

35 40 45

Arg Gln Gln Gln Gln Gln Gln Arg Arg Gly Ala Leu Val Ile Arg Ser

50 55 60

Ala Met Phe Asp Ser Leu Ser Arg Ser Ile Glu Lys Ala Gln Arg Leu

65 70 75 80

Ile Gly Lys Ser Gly Thr Leu Thr Ala Glu Asn Met Lys Glu Pro Leu

85 90 95

Lys Glu Val Arg Arg Ala Leu Leu Glu Ala Asp Val Ser Leu Pro Val

100 105 110

Val Arg Arg Phe Ile Lys Lys Val Glu Glu Arg Ala Leu Gly Thr Lys

115 120 125

Val Arg Glu Gly Arg Ala Met Gly Thr Lys Trp Lys Ser Val Val Asn

130 135 140

Cys Pro Leu Gln Asp Gly Leu Gly Asn Arg Gly Val Gly Arg Ala Arg

145 150 155 160

Thr Glu Val Gly His Arg Ala Ala Cys Val His Gly Ala Arg Gly Val

165 170 175

Gly Lys Thr Thr Ala Ala Gly Lys Leu Ala Leu Tyr Leu Lys Lys Ala

180 185 190

Lys Lys Ser Cys Leu Leu Val Ala Thr Asp Val Tyr Arg Pro Ala Ala

195 200 205

Ile Asp Gln Leu Val Lys Leu Gly Ala Ala Ile Asp Val Pro Val Phe

210 215 220

Glu Met Gly Thr Asp Val Ser Pro Val Glu Ile Ala Lys Lys Gly Val

225 230 235 240

Glu Glu Ala Arg Arg Leu Gly Val Asp Ala Val Ile Ile Asp Thr Ala

245 250 255

Gly Arg Leu Gln Val Asp Glu Gly Met Met Ala Glu Leu Arg Asp Val

260 265 270

Lys Ser Ala Val Arg Pro Ser Asp Thr Leu Leu Val Val Asp Ala Met

275 280 285

Thr Gly Gln Glu Ala Ala Asn Leu Val Arg Ser Phe Asn Glu Ala Val

290 295 300

Asp Ile Ser Gly Ala Ile Leu Thr Lys Met Asp Gly Asp Ser Arg Gly

305 310 315 320

Gly Ala Ala Leu Ser Val Arg Glu Val Ser Gly Lys Pro Ile Lys Phe

325 330 335

Val Gly Val Gly Glu Lys Met Glu Ala Leu Glu Pro Phe Tyr Pro Glu

340 345 350

Arg Met Ala Ser Arg Ile Leu Gly Met Gly Asp Val Leu Thr Leu Tyr

355 360 365

Glu Lys Ala Glu Ala Ala Ile Lys Glu Glu Asp Ala Gln Lys Thr Met

370 375 380

Glu Arg Leu Met Glu Glu Lys Phe Asp Phe Asn Asp Phe Leu Asn Gln

385 390 395 400

Trp Lys Ala Met Asn Asn Met Gly Gly Leu Gln Met Leu Lys Met Met

405 410 415

Pro Gly Phe Asn Lys Ile Ser Glu Lys Gln Leu Tyr Glu Ala Glu Lys

420 425 430

Gln Phe Gly Val Tyr Glu Ala Ile Ile Gly Ala Met Asp Glu Glu Glu

435 440 445

Arg Ser Asn Pro Glu Val Leu Ile Lys Asn Leu Ala Arg Arg Arg Arg

450 455 460

Val Ala Gln Asp Ser Gly Lys Ser Glu Ala Glu Val Thr Lys Leu Met

465 470 475 480

Ala Ala Tyr Thr Ser Met Lys Ala Gln Val Gly Gly Met Ser Lys Leu

485 490 495

Leu Lys Leu Gln Lys Ala Gly Ala Asp Pro Gln Lys Ala Asn Ser Leu

500 505 510

Leu Gln Glu Leu Val Ala Ser Ala Gly Lys Lys Val Ala Pro Gly Lys

515 520 525

Val Arg Arg Lys Lys Glu Lys Glu Pro Leu Ser Lys Ala Arg Gly Phe

530 535 540

Gly Ser Ser Ser Lys

545

<210> 76

<211> 559

<212> PRT

<213> 细小微胞藻

<400> 76

Met Arg His Leu Leu Ser Ser Ala Ser Ile Arg Gln Tyr Asp Lys Trp

1 5 10 15

Ser Leu Val Ser Ser His Ala Lys Lys Pro Ala Leu Val Cys Ala Ser

20 25 30

Lys His Thr Lys Ser Ala Val Lys Leu Gln Cys Thr Ser Arg Gly Ser

35 40 45

Ser Asn Arg Thr Ile Gln Leu Leu Leu Phe Gln Gln Phe Arg Pro Ala

50 55 60

Lys Arg Gly Lys Leu Leu Ile Thr Arg Ala Asp Ser Phe Gly Thr Leu

65 70 75 80

Ser Glu Arg Leu Asn Ser Ala Trp Ser Ala Leu Lys Asp Glu Asp Asp

85 90 95

Leu Ser Val Glu Asn Ile Ser Leu Pro Leu Lys Asp Ile Arg Arg Ala

100 105 110

Leu Leu Glu Ala Asp Val Ser Leu Pro Val Val Arg Arg Phe Ile Lys

115 120 125

Ser Val Glu Glu Lys Ser Ile Gly Val Lys Val Thr Lys Gly Val Ser

130 135 140

Ala Ser Gln Gln Leu Thr Lys Val Val Ala Asp Glu Leu Cys Glu Leu

145 150 155 160

Met Gly Gly Phe Gly Gly Asp Lys Leu Ile Phe Arg Lys Glu Gly Glu

165 170 175

Gly Pro Thr Val Ile Leu Met Ala Gly Leu Gln Gly Val Gly Lys Thr

180 185 190

Thr Ala Cys Gly Lys Leu Ala Leu Phe Leu Lys Ala Gln Gly Lys Gln

195 200 205

Ser Leu Leu Val Ala Thr Asp Val Tyr Arg Pro Ala Ala Ile Asp Gln

210 215 220

Leu Lys Lys Leu Gly Glu Gln Ile Asp Val Pro Val Phe Glu Leu Gly

225 230 235 240

Thr Asp Phe Ser Pro Pro Asp Ile Ala Arg Ser Gly Val Glu Lys Ala

245 250 255

Lys Leu Glu Asn Phe Asp Val Val Ile Val Asp Thr Ala Gly Arg Leu

260 265 270

Gln Val Asp Glu Met Leu Met Ala Glu Leu Leu Ala Thr Lys Ala Ala

275 280 285

Thr Arg Ala Asp Glu Thr Leu Leu Val Val Asp Ala Met Thr Gly Gln

290 295 300

Glu Ala Ala Ser Leu Thr Ala Ala Phe Asn Asp Ala Val Gly Ile Thr

305 310 315 320

Gly Ala Val Leu Thr Lys Met Asp Gly Asp Thr Arg Gly Gly Ala Ala

325 330 335

Leu Ser Val Arg Glu Val Ser Gly Lys Pro Ile Lys Phe Ile Gly Ser

340 345 350

Gly Glu Lys Leu Asp Ala Leu Glu Pro Phe Phe Pro Glu Arg Met Thr

355 360 365

Thr Arg Ile Leu Gly Met Gly Asp Val Val Ser Leu Val Glu Arg Ala

370 375 380

Gln Val Ala Val Lys Glu Glu Gln Ala Asn Leu Met Arg Asp Lys Ile

385 390 395 400

Leu Ser Ala Thr Phe Asp Phe Asn Asp Phe Leu Ser Gln Leu Glu Met

405 410 415

Met Gly Lys Met Gly Gly Met Gly Gly Leu Thr Lys Met Met Pro Gly

420 425 430

Met Asn Thr Met Ser Asp Lys Glu Leu Gln Asp Ala Glu Lys Ser Leu

435 440 445

Ser Val Ala Lys Ser Leu Ile Met Ser Met Thr Pro Arg Glu Arg Gln

450 455 460

Phe Pro Asp Leu Leu Val Ala Gly Ser Ser Ala Ala Ser Arg Arg Gly

465 470 475 480

Arg Val Val Glu Gly Ser Gly Arg Ser Asp Lys Asp Leu Ala Asn Leu

485 490 495

Ile Val Met Phe Gly Ser Met Arg Val Lys Met Gln Ser Leu Ser Ala

500 505 510

Gln Met Asn Gly Thr Ala Lys Glu Val Gly Leu Val Pro Gln Leu Ser

515 520 525

Glu Val Asp Leu Asn Lys Leu Ala Phe Glu Gly Val Gly Lys Arg Val

530 535 540

Ser Pro Gly Met Val Arg Arg Arg Lys Leu Asn Ala Ser Phe Gly

545 550 555

<210> 77

<211> 568

<212> PRT

<213> 微胞藻属

<400> 77

Met Glu Ala Arg Thr Lys Gln Ala Arg Ala Pro Lys Gly Ser Ile Trp

1 5 10 15

Cys Ala Gln Arg Ala Arg Lys Asp Leu Arg Ala Arg Gly Cys Arg Gly

20 25 30

Leu Gly Ser Arg Ile Ser Lys Gly Gln Pro Phe Ser Pro Leu Thr Leu

35 40 45

Ser Thr Pro Ala Val Thr Glu Ile Gly Phe Gly Thr Leu Leu Tyr Gly

50 55 60

Ser Arg Leu Ser Ala Gly Gly Ser Arg Arg Gly Glu Thr Met Leu Arg

65 70 75 80

Arg Ala Ser Ala Phe Gly Ser Leu Thr Glu Arg Leu Asn Ser Val Trp

85 90 95

Ala Thr Leu Lys Asp Glu Asp Asp Leu Ser Leu Glu Asn Ile Lys Gly

100 105 110

Pro Leu Lys Asp Ile Arg Arg Ala Leu Leu Glu Ala Asp Val Ser Leu

115 120 125

Pro Val Val Arg Arg Phe Ile Lys Asn Ile Glu Gln Lys Ala Ile Gly

130 135 140

Thr Arg Val Thr Lys Gly Val Asn Ala Gly Gln Gln Leu Thr Lys Val

145 150 155 160

Val Ala Asp Glu Leu Cys Glu Leu Met Gly Gly Phe Gly Gly Asp Ser

165 170 175

Leu Ala Phe Lys Asp Pro Ser Met Gly Pro Thr Val Ile Leu Met Ala

180 185 190

Gly Leu Gln Gly Val Gly Lys Thr Thr Ala Cys Gly Lys Leu Ala Leu

195 200 205

Tyr Leu Lys Lys Gln Gly Lys Asp Ser Leu Leu Val Ala Thr Asp Val

210 215 220

Tyr Arg Pro Ala Ala Ile Glu Gln Leu Lys Arg Leu Gly Glu Gln Val

225 230 235 240

Lys Thr Pro Val Phe Asp Met Gly Val Arg Val Asp Pro Pro Glu Val

245 250 255

Ala Arg Leu Gly Leu Glu Lys Ala Arg Ala Glu Gly Ile Asp Val Val

260 265 270

Ile Ile Asp Thr Ala Gly Arg Leu Gln Val Asp Val His Leu Met Glu

275 280 285

Glu Leu Arg Ala Thr Lys Ile Ala Thr Ala Ala Asp Glu Ile Leu Leu

290 295 300

Val Val Asp Ala Met Thr Gly Gln Glu Ala Ala Ala Leu Thr Ala Ala

305 310 315 320

Phe Asp Glu Ala Val Gly Ile Thr Gly Ala Val Leu Thr Lys Met Asp

325 330 335

Gly Asp Thr Arg Gly Gly Ala Ala Leu Ser Val Arg Glu Val Ser Gly

340 345 350

Lys Pro Ile Lys Phe Thr Gly Val Gly Glu Lys Met Glu Ala Leu Glu

355 360 365

Pro Phe Tyr Pro Glu Arg Met Ala Ser Arg Ile Leu Gly Met Gly Asp

370 375 380

Val Val Thr Leu Val Glu Arg Ala Gln Gln Val Val Lys Asn Glu Glu

385 390 395 400

Ala Glu Gln Met Arg Asp Lys Ile Leu Ser Ala Thr Phe Asp Phe Asn

405 410 415

Asp Phe Ile Lys Gln Met Glu Met Met Gly Gln Met Gly Gly Met Asp

420 425 430

Gly Phe Met Lys Leu Leu Pro Gly Met Ser Gly Met Ser Glu Arg Glu

435 440 445

Met Gln Glu Ala Asp Lys Ser Leu Lys Val Ala Lys Ser Leu Ile Leu

450 455 460

Ser Met Thr Ser Lys Glu Arg Gln Phe Pro Asp Ile Leu Val Ala Gly

465 470 475 480

Ala Ser Ala Lys Ser Arg Arg Lys Arg Ile Ile Glu Gly Ala Gly Arg

485 490 495

Ser Glu Lys Asp Leu Ser Gln Leu Ile Val Leu Phe Gly Ser Met Arg

500 505 510

Val Lys Met Gln Lys Met Thr Ala Glu Ile Thr Gly Ala Ser Ala Glu

515 520 525

Val Gly Leu Thr Pro Gln Leu Ser Glu Glu Asp Met Asn Thr Leu Ala

530 535 540

Asn Glu Gly Leu Arg Lys Asn Val Ser Pro Gly Met Val Arg Arg Leu

545 550 555 560

Arg Ile Arg Arg Leu Thr Gly Ser

565

<210> 78

<211> 481

<212> PRT

<213> 载色宝琳虫

<400> 78

Met Phe Asp Glu Leu Ser Ala Arg Phe Glu Glu Ala Val Lys Ser Leu

1 5 10 15

Lys Gly Leu Ser Ala Ile Thr Glu Asn Asn Val Glu Asn Ala Leu Lys

20 25 30

Gln Val Arg Arg Ala Leu Ile Glu Ala Asp Val Ser Leu Val Val Val

35 40 45

Lys Glu Phe Met Glu Glu Val Arg Ser Lys Ser Ile Gly Ile Glu Val

50 55 60

Val Arg Gly Ile Lys Pro Asp Gln Lys Phe Ile Gln Val Val Tyr Glu

65 70 75 80

Gln Leu Ile Glu Ile Met Gly Ala Asn Asn Thr Pro Leu His Lys Gln

85 90 95

Ser His Thr Val Thr Val Val Leu Met Ala Gly Leu Gln Gly Ala Gly

100 105 110

Lys Thr Thr Ala Ala Ala Lys Leu Ala Leu Tyr Leu Lys Asn Gln Gly

115 120 125

Glu Lys Val Leu Met Val Ala Ala Asp Val Tyr Arg Pro Ala Ala Ile

130 135 140

Asp Gln Leu Phe Val Leu Gly Lys Gln Ile Asp Val Glu Val Phe Thr

145 150 155 160

Leu Asn Pro Glu Ser Ile Pro Glu Asp Ile Ala Ala Ala Gly Leu Gln

165 170 175

Lys Ala Ile Arg Glu Gly Phe Asp Tyr Leu Ile Val Asp Thr Ala Gly

180 185 190

Arg Leu Gln Ile Asp Thr Ala Met Met Gln Glu Met Val Arg Ile Arg

195 200 205

Ser Ala Val Asn Pro Asn Glu Ile Leu Leu Val Val Asp Ser Met Ile

210 215 220

Gly Gln Glu Ala Ala Glu Leu Thr Arg Ala Phe His Glu Gln Ile Gly

225 230 235 240

Ile Thr Gly Ala Val Leu Thr Lys Leu Asp Gly Asp Ala Arg Gly Gly

245 250 255

Ala Ala Leu Ser Ile Arg Lys Val Ser Gly Ala Pro Ile Lys Phe Ile

260 265 270

Gly Thr Gly Glu Lys Val Glu Ala Leu Gln Pro Phe His Pro Glu Arg

275 280 285

Met Ala Ser Arg Ile Leu Gly Met Gly Asp Ile Val Thr Leu Val Glu

290 295 300

Lys Ala Gln Glu Glu Val Glu Leu Ala Asp Val Glu Lys Met Gln Arg

305 310 315 320

Lys Leu Gln Glu Ala Ser Phe Asp Phe Ser Asp Phe Leu Gln Gln Met

325 330 335

Arg Leu Val Lys Arg Met Gly Ser Leu Gly Gly Leu Met Lys Met Ile

340 345 350

Pro Gly Met Asn Lys Ile Asp Ser Thr Met Leu Arg Glu Gly Glu Ala

355 360 365

Gln Leu Lys Arg Ile Glu Ser Met Ile Gly Ser Met Thr Pro Thr Glu

370 375 380

Arg Glu Lys Pro Glu Leu Leu Ala Ser Gln Pro Ser Arg Arg Gly Arg

385 390 395 400

Ile Ala Lys Gly Ser Gly His Lys Ile Ala Asp Val Asp Lys Met Leu

405 410 415

Val Asp Phe Gln Lys Met Arg Gly Phe Met Gln Gln Met Thr Lys Gly

420 425 430

Asn Asn Phe Ala Asn Pro Leu Ser Met Gly Ala Asn Met Phe Ser Gln

435 440 445

Pro Asn Met Thr Val Pro Gln Thr Lys Ile Ser Asn Thr Asn Glu Ser

450 455 460

Arg Met Arg Asn Ser Arg Ala Thr Lys Lys Lys Lys Gly Phe Gly Gln

465 470 475 480

Leu

<210> 79

<211> 498

<212> PRT

<213> 海洋发光蚝球藻

<400> 79

Met Thr Arg Ala Asp Ala Phe Ala Gly Met Ser Asp Lys Leu Asp Lys

1 5 10 15

Ala Trp Ala Arg Leu Gln Gly Glu Lys Asp Leu Asn Ala Asp Asn Val

20 25 30

Lys Ala Pro Leu Lys Asp Val Arg Arg Ala Leu Leu Glu Ala Asp Val

35 40 45

Ser Leu Pro Val Val Arg Arg Phe Ile Ala Arg Cys Glu Glu Lys Ala

50 55 60

Val Gly Met Lys Val Thr Lys Gly Val Glu Pro Gly Gln Met Leu Val

65 70 75 80

Lys Cys Val Ala Asp Glu Leu Cys Glu Leu Met Gly Gly Val Gly Ala

85 90 95

Glu Gly Ile Lys Phe Arg Asp Asp Gly Glu Pro Thr Val Val Leu Met

100 105 110

Ala Gly Leu Gln Gly Val Gly Lys Thr Thr Ala Cys Gly Lys Leu Ser

115 120 125

Leu Ala Leu Arg Lys Gln Gly Lys Ser Val Leu Leu Val Ala Thr Asp

130 135 140

Val Tyr Arg Pro Ala Ala Ile Asp Gln Leu Lys Thr Leu Gly Lys Gln

145 150 155 160

Ile Gly Val Pro Val Phe Asp Met Gly Val Asp Gly Asn Pro Pro Glu

165 170 175

Ile Ala Ala Arg Gly Val Arg Lys Ala Lys Asp Glu Asp Ile Asp Val

180 185 190

Val Ile Val Asp Thr Ala Gly Arg Leu Asn Ile Asp Glu Lys Leu Met

195 200 205

Gly Glu Leu Lys Ala Thr Lys Glu Ala Thr Ser Ala Asp Glu Thr Leu

210 215 220

Leu Val Val Asp Ala Met Thr Gly Gln Glu Ala Ala Thr Leu Thr Ala

225 230 235 240

Ser Phe Asn Glu Ala Val Glu Ile Thr Gly Ala Ile Leu Thr Lys Met

245 250 255

Asp Gly Asp Thr Arg Gly Gly Ala Ala Leu Ser Val Arg Glu Val Ser

260 265 270

Gly Lys Pro Ile Lys Phe Thr Gly Val Gly Glu Lys Met Asp Ala Leu

275 280 285

Glu Pro Phe Tyr Pro Glu Arg Met Thr Ser Arg Ile Leu Gly Met Gly

290 295 300

Asp Ile Val Ser Leu Val Glu Lys Val Gln Ala Gly Val Lys Glu Glu

305 310 315 320

Glu Ala Glu Lys Ile Lys Gln Lys Ile Met Ser Ala Thr Phe Asp Phe

325 330 335

Asn Asp Phe Val Gly Gln Leu Glu Met Met Asn Asn Met Gly Gly Met

340 345 350

Lys Gln Ile Met Gln Met Met Pro Gly Thr Ala Lys Leu Ser Glu Ala

355 360 365

Asp Met Glu Ala Ala Gly Lys Ser Met Thr Ile Ala Lys Ser Leu Ile

370 375 380

Asn Ser Met Thr Lys Glu Glu Arg Gln Tyr Pro Asp Met Leu Val Ala

385 390 395 400

Ser Thr Thr Ala Asp Ser Arg Arg Gln Arg Ile Val Lys Gly Ser Gly

405 410 415

Arg Thr Glu Ala Asp Leu Ala Gln Leu Ile Met Met Phe Gly Gly Met

420 425 430

Arg Thr Gln Met Gln Lys Met Ser Gly Gln Leu Gly Gly Gln Ala Gly

435 440 445

Asp Val Gly Leu Gln Pro Gln Leu Ser Glu Ala Glu Leu Ser Lys Leu

450 455 460

Ala Met Asn Lys Ile Arg Lys Thr Val Lys Pro Gly Met Val Arg Arg

465 470 475 480

Gln Lys Ala Lys Lys Val Pro Lys Phe Leu Ala Glu Arg Glu Ser Phe

485 490 495

Ser Gln

<210> 80

<211> 426

<212> PRT

<213> 金牛蚝球藻

<400> 80

Met Lys Val Thr Lys Gly Val Glu Pro Gly Gln Met Leu Val Lys Ala

1 5 10 15

Val Ala Asp Glu Leu Cys Glu Leu Met Gly Gly Val Gly Ala Glu Gly

20 25 30

Ile Lys Phe Arg Asp Asp Gly Glu Pro Thr Val Ile Leu Met Ala Gly

35 40 45

Leu Gln Gly Val Gly Lys Thr Thr Ala Cys Gly Lys Leu Ser Leu Ala

50 55 60

Met Arg Lys Gln Gly Lys Thr Val Leu Leu Val Ala Thr Asp Val Tyr

65 70 75 80

Arg Pro Ala Ala Ile Asp Gln Leu Lys Thr Leu Gly Thr Gln Ile Gly

85 90 95

Val Pro Val Phe Asp Met Gly Val Asp Ala Ser Pro Pro Glu Val Ala

100 105 110

Ala Arg Gly Val Arg Lys Ala Lys Glu Glu Asp Ile Asp Val Val Ile

115 120 125

Val Asp Thr Ala Gly Arg Leu Asn Ile Asp Glu Lys Leu Met Ser Glu

130 135 140

Leu Lys Asp Thr Lys Leu Ala Thr Lys Ala Asp Glu Thr Leu Leu Val

145 150 155 160

Val Asp Ala Met Thr Gly Gln Glu Ala Ala Asn Leu Thr Ala Ser Phe

165 170 175

Gln Arg Gly Asp Gly Arg Arg Thr Arg Arg Gly Gly Ala Ala Leu Ser

180 185 190

Val Ala Arg Ser Phe Arg Lys Ala His Gln Phe Thr Ala Ser Val Lys

195 200 205

Met Asp Ala Leu Glu Pro Phe Tyr Pro Glu Arg Met Thr Ser Arg Ile

210 215 220

Leu Gly Met Gly Asp Ile Val Ser Leu Val Glu Lys Val Gln Ser Glu

225 230 235 240

Val Lys Glu Ala Glu Ala Glu Lys Leu Lys Glu Lys Ile Leu Lys Ala

245 250 255

Thr Phe Asp Phe Asn Asp Phe Val Thr Gln Leu Glu Met Met Asn Asn

260 265 270

Met Gly Ser Met Lys Gln Ile Met Gln Met Leu Pro Gly Thr Thr Lys

275 280 285

Leu Ser Glu Ser Glu Met Glu Ala Ala Glu Lys Ser Phe Lys Ile Ala

290 295 300

Arg Ser Leu Ile Asn Ser Met Thr Lys Glu Glu Arg Gln Phe Pro Asp

305 310 315 320

Met Leu Val Ala Ser Thr Thr Ala Glu Ser Arg Arg Ala Arg Ile Val

325 330 335

Lys Gly Ser Gly Arg Thr Glu Ala Asp Leu Ala Gln Leu Ile Ile Met

340 345 350

Phe Gly Ser Met Arg Gly Lys Met Gln Gln Leu Ser Gly Glu Leu Gly

355 360 365

Gly Glu Ala Gly Asn Val Gly Leu Gln Pro Gln Leu Ser Ala Ala Glu

370 375 380

Leu Glu Lys Leu Thr Thr Asn Lys Leu Arg Lys Asn Ile Lys Pro Gly

385 390 395 400

Met Val Arg Arg Leu Lys Ser Lys Lys Ile Pro Ile Ala Lys Asn Gly

405 410 415

Asp Arg Met Gly Ile Ser Ala Ser Ala Asp

420 425

<210> 81

<211> 510

<212> PRT

<213> 卡特氏团藻

<400> 81

Met Ser Arg Pro Ala Ala Leu Arg Gly Ala Gly Asn Arg Lys Leu Thr

1 5 10 15

Ala Thr Val Thr Ala Ala His Leu Arg Gly Ile Ala Phe Thr Ser Ile

20 25 30

Arg Thr Cys Gln Gly Ala Lys Gly Gly Ser Leu Gly Leu Pro His Pro

35 40 45

Ser Pro Pro Leu Ala Leu Pro Arg Arg Gly Arg Gly Arg Gly Ala Ala

50 55 60

Val Val Val Arg Ala Ala Met Phe Asp Asn Leu Ser Lys Ser Leu Glu

65 70 75 80

Lys Ala Gln Arg Leu Ile Gly Gly Cys Glu Val Pro Gly Val Gly Val

85 90 95

Val Gly Lys Ser Gly Thr Leu Thr Ala Glu Asn Met Lys Glu Pro Leu

100 105 110

Lys Glu Val Arg Arg Ala Leu Leu Glu Ala Asp Val Ser Leu Pro Val

115 120 125

Val Arg Arg Phe Val Lys Lys Val Glu Glu Arg Ala Leu Gly Thr Lys

130 135 140

Val Ile Glu Gly Val Thr Pro Asp Val Gln Phe Ile Lys Val Val Ser

145 150 155 160

Asn Glu Leu Ile Glu Leu Met Gly Gly Gly Val Gly Ala Lys Asp Leu

165 170 175

Glu Pro Gly Phe Pro Gln Ile Ile Leu Met Ala Gly Leu Gln Gly Val

180 185 190

Gly Lys Thr Thr Ala Ala Gly Lys Leu Ala Leu Tyr Leu Lys Lys Ala

195 200 205

Lys Lys Ser Cys Leu Leu Val Ala Thr Asp Val Tyr Arg Pro Ala Ala

210 215 220

Ile Asp Gln Leu Val Lys Leu Gly Ala Ala Ile Asp Val Pro Val Phe

225 230 235 240

Glu Leu Gly Thr Gln Val Ser Gly Lys Pro Ile Lys Phe Val Gly Val

245 250 255

Gly Glu Lys Met Glu Ala Leu Glu Pro Phe Tyr Pro Glu Arg Met Ala

260 265 270

Ser Arg Ile Leu Gly Met Gly Asp Val Leu Thr Leu Tyr Glu Lys Ala

275 280 285

Glu Ala Ala Ile Lys Glu Glu Asp Ala Lys Ala Val Met Asp Arg Leu

290 295 300

Met Glu Glu Lys Phe Asp Phe Asn Asp Phe Leu Asn Gln Trp Lys Ser

305 310 315 320

Met Asn Asn Met Gly Gly Met Gln Ile Leu Lys Met Met Pro Gly Phe

325 330 335

Asn Lys Glu Arg Ser Asn Pro Glu Val Ile Ile Lys Ser Leu Ala Arg

340 345 350

Arg Arg Arg Val Ala Gln Asp Ser Gly His Ser Glu Ala Glu Val Ala

355 360 365

Lys Leu Met Thr Ala Tyr Thr Ala Met Arg Thr Gln Val Gly Gly Met

370 375 380

Ser Lys Leu Leu Lys Leu Gln Lys Ser Gly Gly Asp Pro Ser Gln Ala

385 390 395 400

Glu Lys Leu Leu Lys Glu Leu Val Ala Ser Ala Gly Lys Lys Val Ala

405 410 415

Pro Gly Lys Pro Pro Gly Asp Pro Ala Gly Ser Phe Ile Ser Thr Pro

420 425 430

Arg Thr Pro His Pro Pro Pro Gly Pro Leu Gly Pro Arg Ser Gln Val

435 440 445

Arg Arg Lys Lys Glu Lys Glu Pro Ile Ser Lys Ala Arg Gly Phe Gly

450 455 460

Ser Pro Ser Asn Phe Asn His Asp Leu Ser Pro Pro Gly Ser Ser Pro

465 470 475 480

Ala Ala Tyr Thr Tyr Thr Leu Ser Arg Leu Ser Cys Gln Arg Leu Cys

485 490 495

Asp Gly Gly Gly Leu Leu Asp Asp Trp Asn Leu Trp Arg Arg

500 505 510

<210> 82

<211> 448

<212> PRT

<213> 三角褐指藻

<400> 82

Met Ser Glu Ala Ser Ile Gln Pro Ala Leu Arg Glu Val Arg Arg Ala

1 5 10 15

Leu Leu Asp Ala Asp Val Asn Val Asp Val Ala Asp Thr Leu Ile Glu

20 25 30

Gly Val Arg Ala Arg Ser Leu Gly Gln Glu Val Leu Glu Gly Val Thr

35 40 45

Ala Glu Gln Gln Phe Val Lys Ala Met Tyr Asp Glu Leu Leu Asp Met

50 55 60

Met Gly Gly Asp Ser Ser Val Pro Met Ser Asp Gly Pro Ser Asn Val

65 70 75 80

Pro Val Ala Thr Leu Ala Ser Gly Thr Ala Ala Asp Pro Ala Val Ile

85 90 95

Leu Leu Ala Gly Leu Gln Gly Ala Gly Lys Thr Thr Ala Ala Gly Lys

100 105 110

Leu Ala Leu Phe Leu Lys Glu Gln Arg Lys Val Leu Leu Val Ala Ala

115 120 125

Asp Ile Tyr Arg Pro Ala Ala Ile Lys Gln Leu Gln Val Leu Gly Glu

130 135 140

Ser Ile Gly Val Glu Val Phe Thr Lys Gly Thr Asp Val Asp Pro Val

145 150 155 160

Glu Ile Val Asn Ala Gly Ile Gln Lys Ala Arg Asp Glu Gly Tyr Asp

165 170 175

Thr Val Ile Val Asp Thr Ala Gly Arg Gln Val Ile Asp Thr Asp Leu

180 185 190

Met Asp Glu Leu Gln Arg Met Lys Arg Ala Ala Ser Pro Gln Glu Thr

195 200 205

Leu Leu Ile Val Asp Ala Met Thr Gly Gln Glu Ala Ala Ser Leu Thr

210 215 220

Ala Ala Phe Asp Ser Ala Ile Gly Leu Thr Gly Ala Ile Leu Thr Lys

225 230 235 240

Met Asp Gly Asp Ser Arg Gly Gly Ala Ala Val Ser Val Arg Gly Val

245 250 255

Ser Gly Lys Pro Ile Lys Phe Val Gly Thr Gly Glu Lys Thr Ala Asp

260 265 270

Leu Glu Pro Phe Tyr Pro Asp Arg Met Ala Ser Arg Ile Leu Gly Met

275 280 285

Gly Asp Val Val Ser Leu Val Glu Lys Ala Ala Ser Glu Val Ser Asp

290 295 300

Ala Asp Ala Leu Lys Met Gln Gln Lys Met Leu Asp Ala Ser Phe Asp

305 310 315 320

Phe Asp Asp Phe Val Lys Gln Ser Glu Leu Val Thr Lys Met Gly Ser

325 330 335

Val Ala Gly Ile Ala Lys Leu Met Pro Gly Met Ala Asn Gln Leu Asn

340 345 350

Met Asn Gln Ile Arg Glu Val Glu Ala Arg Leu Lys Lys Ser Lys Ser

355 360 365

Met Ile Ser Ser Met Thr Lys Lys Glu Arg Ala Asn Pro Glu Leu Leu

370 375 380

Ile Lys Asp Ser Ser Ala Arg Ser Arg Leu Ile Arg Ile Thr Lys Gly

385 390 395 400

Ser Gly Cys Gly Leu Asp Glu Gly Gln Gln Phe Met Ser Glu Phe Gln

405 410 415

Arg Met Lys Thr Met Met Ser Thr Arg Arg Phe Trp Arg Phe Trp Leu

420 425 430

Met Ile Gln Ser Leu Ala Leu Ala Val Thr Arg Pro Glu Asn Thr Val

435 440 445

<210> 83

<211> 486

<212> PRT

<213> 假微型海链藻

<400> 83

Met Phe Asp Gln Leu Ser Asn Ala Leu Thr Glu Val Ala Lys Asn Phe

1 5 10 15

Gly Gly Lys Gln Arg Met Thr Glu Asn Ser Ile Gln Pro Ala Leu Lys

20 25 30

Ser Val Arg Arg Ala Leu Leu Asp Ala Asp Val Asn Leu Asp Val Ala

35 40 45

Thr Ala Leu Ile Asp Gly Val Lys Arg Arg Ser Leu Gly Lys Glu Val

50 55 60

Thr Lys Gly Val Thr Ala Glu Gln Gln Phe Ile Lys Ala Met Tyr Asp

65 70 75 80

Glu Leu Leu Asp Met Met Gly Gly Glu Ala Asn Glu Ser Asn Thr Met

85 90 95

Ala Thr Leu Ala His Ser Ser Val Ala Asn Glu Pro Ala Val Ile Leu

100 105 110

Leu Ala Gly Leu Gln Gly Ala Gly Lys Thr Thr Ala Ala Gly Lys Leu

115 120 125

Ala Phe Arg Leu Pro Lys Arg Asn Arg Lys Val Leu Leu Val Ala Ala

130 135 140

Asp Val Tyr Arg Pro Ala Ala Ile Glu Gln Leu Gln Ile Leu Gly Lys

145 150 155 160

Gln Ile Gly Val Glu Val Phe Ser Met Gly Val Asp Ala Asp Pro Ala

165 170 175

Asp Ile Ala Lys Glu Ala Val Glu Lys Ala Lys Arg Glu Gly Phe Asp

180 185 190

Thr Val Val Val Asp Thr Ala Gly Arg Gln Val Val Asp Glu Glu Leu

195 200 205

Met Glu Glu Leu Arg Arg Val Lys Lys Thr Val Glu Pro Asp Glu Thr

210 215 220

Leu Leu Val Val Asp Ala Met Thr Gly Gln Ala Ala Ala Ser Leu Thr

225 230 235 240

Ala Ser Phe Asp Ala Ala Val Gly Ile Ser Gly Ala Ile Leu Thr Lys

245 250 255

Leu Asp Gly Asp Ser Arg Gly Gly Ala Ala Val Ser Ile Arg Gly Val

260 265 270

Ser Gly Lys Pro Ile Lys Phe Val Gly Val Gly Glu Lys Thr Asn Asp

275 280 285

Leu Glu Pro Phe Tyr Pro Asp Arg Met Ala Ser Arg Ile Leu Gly Met

290 295 300

Gly Asp Val Ile Ser Leu Val Glu Lys Ala Ser Met Glu Val Ser Asp

305 310 315 320

Ala Asp Ala Ala Lys Met Gln Glu Lys Met Ala Lys Ala Glu Phe Asp

325 330 335

Phe Asp Asp Phe Met Thr Gln Ser Arg Met Val Ser Lys Met Gly Ser

340 345 350

Met Ala Gly Val Ala Lys Met Leu Pro Gly Met Gly Asn Met Ile Asp

355 360 365

Ser Ser Gln Met Arg Gln Val Glu Glu Arg Ile Lys Arg Ser Glu Ala

370 375 380

Met Ile Cys Ser Met Asn Lys Lys Glu Arg Ala Asn Pro Gly Leu Leu

385 390 395 400

Leu Thr Asp Lys Ser Ala Arg Ser Arg Leu Met Arg Ile Thr Lys Gly

405 410 415

Ser Gly Leu Ala Phe Glu Asp Gly Leu Ala Phe Met Ser Glu Phe Gln

420 425 430

Lys Met Arg Thr Met Ile Ser Arg Met Ala Lys Gln Thr Gly Met Gly

435 440 445

Gln Pro Asp Gly Glu Gly Glu Met Glu Pro Ala Met Ala Gly Asn Arg

450 455 460

Asn Ala Arg Arg Ala Ala Lys Lys Lys Gly Lys Lys Gly Gly Arg Gly

465 470 475 480

Gly Gly Met Gly Phe Ala

485

<210> 84

<211> 530

<212> PRT

<213> 抑食金球藻

<400> 84

Met Thr Met Ala Arg Arg Ala Ala Thr Ala Ala Leu Val Leu Ala Ala

1 5 10 15

Ala Trp Ala Phe Ala Pro Pro Gln Thr Lys Arg Ala Thr Thr Gln Leu

20 25 30

Tyr Phe Phe Asp Lys Leu Ala Glu Ser Ile Thr Ala Ala Thr Asp Val

35 40 45

Leu Ser Gly Lys Ser Arg Met Thr Glu Ala Asn Thr Lys Ser Ala Leu

50 55 60

Arg Asp Val Arg Arg Ser Leu Leu Asp Ala Asp Val Ala Lys Val Val

65 70 75 80

Val Asp Gly Phe Val Glu Asn Val Gln Ala Ser Ala Leu Asp Gly Glu

85 90 95

Val Ala Glu Gly Val Asp Pro Gly Gln Gln Phe Val Lys Ile Val Tyr

100 105 110

Asp Glu Leu Lys Arg Val Met Gly Gly Asp Asp Asp Glu Leu Leu Phe

115 120 125

Ser Asp Asp Pro Glu Ala Ala Ala Lys Ala Arg Ala Gly Leu Ala Tyr

130 135 140

Arg Asp Asp Gly Ala Pro Thr Val Val Leu Leu Cys Gly Leu Gln Gly

145 150 155 160

Ala Gly Lys Thr Thr Ala Ala Ala Lys Leu Ala Leu Arg Leu Lys Glu

165 170 175

Glu Glu Gly Lys Thr Pro Met Leu Val Ala Ala Asp Val Tyr Arg Pro

180 185 190

Ala Ala Val Glu Gln Leu Gln Ile Leu Gly Glu Gln Val Gly Val Pro

195 200 205

Val Tyr Ala Glu Ala Phe Glu Ala Gly Ala Gly Asp Ala Val Ala Ile

210 215 220

Ala Thr Ala Gly Val Arg Ala Ala Lys Glu Arg Gly Ala Asp Val Val

225 230 235 240

Ile Val Asp Thr Ala Gly Arg Gln Val Ile Glu Glu Ser Leu Met Ala

245 250 255

Glu Leu Arg Ser Val Arg Ala Ala Thr Lys Pro Asp Glu Thr Leu Leu

260 265 270

Val Leu Asp Ala Met Thr Gly Gln Asp Ala Ala Ser Leu Ala Lys Arg

275 280 285

Phe Asp Asp Ala Cys Pro Leu Thr Gly Ser Val Leu Thr Lys Leu Asp

290 295 300

Gly Asp Ala Arg Gly Gly Ala Ala Leu Ser Val Arg Ala Val Ser Gly

305 310 315 320

Lys Pro Ile Lys Phe Val Gly Val Gly Glu Lys Val Gly Asp Leu Glu

325 330 335

Pro Phe Phe Pro Ala Arg Met Ala Ser Arg Ile Leu Gly Met Gly Asp

340 345 350

Val Val Ser Leu Val Glu Lys Ala Ser Lys Gln Gln Ser Ala Ala Glu

355 360 365

Ala Lys Ala Val Met Glu Arg Thr Lys Gln Ala Lys Phe Asn Phe Asp

370 375 380

Asp Tyr Leu Asp Gln Ala Arg Met Val Ser Asn Met Gly Ser Phe Gly

385 390 395 400

Ala Val Ala Lys Met Met Pro Gly Met Gly Gly Ile Asp Asn Asp Gln

405 410 415

Ile Ala Ala Ala Glu Ala Lys Ile Lys Ile Gln Ala Ser Leu Ile Asn

420 425 430

Ser Met Thr Pro Lys Glu Arg Gly Glu Pro Asp Leu Ile Ile Arg Asp

435 440 445

Lys Ser Ala Leu Ala Arg Gln Lys Arg Ile Ala Ala Gly Ser Gly Arg

450 455 460

Ser Val Asp Gln Ala Lys Gln Phe Leu Ser Glu Phe Gln Gln Met Arg

465 470 475 480

Thr Met Met Ala Lys Met Ala Gly Gln Ala Pro Pro Asp Gly Ala Asp

485 490 495

Ala Ala Ala Ala Pro Asp Pro Asp Ala Leu Leu Asn Arg Ala Ala Arg

500 505 510

Arg Ala Lys Lys Lys Lys Gly Gly Lys Arg Lys Leu Lys Thr Ala Gly

515 520 525

Phe Gly

530

<210> 85

<211> 556

<212> PRT

<213> 长囊水云

<400> 85

Met Ile Met Ala Ser Leu Lys His Arg Ser Pro Pro Arg Gly Gly Ala

1 5 10 15

Ala Ala Thr Leu Ser Phe Phe Cys Cys Val Cys Ala Leu Phe Ala Gln

20 25 30

Ser Ser Val Ala Phe Val Pro Ala Gly Gly Leu Ser Arg Cys Gly Val

35 40 45

Asn Asp Arg Ser Ser Ser Ser Cys Arg Ala Ala Ala Ile Gly Ala Ala

50 55 60

Gly Arg Ser Ser Leu Pro Val Ser Arg Ser Ser Ser Arg Arg Gly Arg

65 70 75 80

Arg Gly Gly Cys Ala Gly Gly Ala Ser Ser Pro Leu Gly Met Met Phe

85 90 95

Asp Thr Leu Ala Glu Asn Met Ala Gly Val Ala Asn Leu Phe Thr Gly

100 105 110

Gln Lys Thr Ile Thr Glu Ser Ser Val Glu Gly Ala Leu Asn Glu Val

115 120 125

Lys Arg Ala Leu Leu Asp Ala Asp Leu Asn Leu Met Val Thr Asn Thr

130 135 140

Leu Val Asp Ala Val Lys Ser Lys Ala Val Gly Met Lys Leu Val Asp

145 150 155 160

Gly Val Thr Ala Lys Gln Gln Phe Val Asn Val Met Asn Asp Glu Leu

165 170 175

Val Glu Ile Met Gly Ala Glu Gln Ala Pro Leu Ala Arg Arg Thr Asp

180 185 190

Gly Lys Pro Thr Val Ile Leu Leu Ala Gly Leu Gln Gly Thr Gly Lys

195 200 205

Thr Thr Ala Ala Ala Lys Leu Ala Lys Tyr Leu Gln Gln Glu Glu Glu

210 215 220

Pro Lys Lys Val Leu Leu Val Ala Gly Asp Val Tyr Arg Pro Ala Ile

225 230 235 240

Asp Gln Leu Ile Ser Leu Gly Lys Arg Ile Asp Val Glu Val Phe Ser

245 250 255

Met Gly Gln Gly Val Asp Pro Val Glu Ile Thr Lys Ala Gly Leu Glu

260 265 270

Arg Ala Val Glu Gly Glu Phe Asp Thr Val Ile Val Asp Thr Ala Gly

275 280 285

Arg Gln Val Val Asp Asp Thr Leu Met Thr Glu Leu Lys Asp Ile Gln

290 295 300

Val Ala Ser Glu Ala Asp Glu Val Leu Leu Val Val Asp Ala Met Thr

305 310 315 320

Gly Gln Glu Ala Ala Thr Leu Ala Ser Val Phe Asn Glu Lys Ile Gly

325 330 335

Ile Thr Gly Ala Val Leu Thr Lys Met Asp Gly Asp Thr Arg Gly Gly

340 345 350

Ala Ala Leu Ser Val Gln Gly Val Ser Gln Lys Pro Ile Lys Phe Val

355 360 365

Gly Ile Gly Glu Lys Met Ser Glu Glu Glu Ala Ala Lys Leu Ala Lys

370 375 380

Lys Met Ile Asn Ala Glu Phe Asp Phe Asn Asp Phe Leu Lys Gln Ala

385 390 395 400

Lys Met Met Lys Gly Met Gly Ser Leu Gly Gly Val Ala Asn Met Ile

405 410 415

Pro Gly Met Ala Gly Lys Ile Thr Pro Gln Gln Leu Asn Gln Ala Glu

420 425 430

Glu Gly Val Gln Arg Ala Glu Gly Leu Ile Lys Phe Met Thr Pro Glu

435 440 445

Glu Arg Arg Thr Pro Lys Leu Leu Ile Leu Asp Pro Thr Ser Gln Ala

450 455 460

Arg Cys Arg Arg Ile Ala Arg Asp Ala Gly Val Lys Leu Ser Ala Val

465 470 475 480

Ser Ala Phe Leu Lys Glu Phe Gln Ala Met Gln Ser Asn Met Ser Arg

485 490 495

Met Gly Lys Gln Met Ala Asp Gly Asp Pro Asn Ala Gly Pro Gly Gly

500 505 510

Gln Pro Ser Pro Phe Gln Gly Leu Gly Gly Asp Thr Ala Pro Gly Ala

515 520 525

Ala Pro Ser Met Asn Arg Gln Gln Arg Arg Gln Ser Lys Lys Asn Lys

530 535 540

Ala Gly Arg Ser Ala Ala Pro Ser Lys Gly Phe Gly

545 550 555

<210> 86

<211> 28452

<212> DNA

<213> 人工序列

<220>

<223> 对人工序列的描述: 合成的多核苷酸

<400> 86

cgtctgatta aaccacgctg ggagattaga taatgaagcg tgcgcctgtt attccaaaac 60

atacgctcaa tactcaaccg gttgaagata cttcgttatc gacaccagct gccccgatgg 120

tggattcgtt aattgcgcgc gtaggagtaa tggctcgcgg taatgccatt actttgcctg 180

tatgtggtcg ggatgtgaag tttactcttg aagtgctccg gggtgatagt gttgagaaga 240

cctctcgggt atggtcaggt aatgaacgtg accaggagct gcttactgag gacgcactgg 300

atgatctcat cccttctttt ctactgactg gtcaacagac accggcgttc ggtcgaagag 360

tatctggtgt catagaaatt gccgatggga gtcgccgtcg taaagctgct gcacttaccg 420

aaagtgatta tcgtgttctg gttggcgagc tggatgatga gcagatggct gcattatcca 480

gattgggtaa cgattatcgc ccaacaagtg cttatgaacg tggtcagcgt tatgcaagcc 540

gattgcagaa tgaatttgct ggaaatattt ctgcgctggc tgatgcggaa aatatttcac 600

gtaagattat tacccgctgt atcaacaccg ccaaattgcc taaatcagtt gttgctcttt 660

tttctcaccc cggtgaacta tctgcccggt caggtgatgc acttcaaaaa gcctttacag 720

ataaagagga attacttaag cagcaggcat ctaaccttca tgagcagaaa aaagctgggg 780

tgatatttga agctgaagaa gttatcactc ttttaacttc tgtgcttaaa acgtcatctg 840

catcaagaac tagtttaagc tcacgacatc agtttgctcc tggagcgaca gtattgtata 900

agggcgataa aatggtgctt aacctggaca ggtctcgtgt tccaactgag tgtatagaga 960

aaattgaggc cattcttaag gaacttgaaa agccagcacc ctgatgcgac ctcgttttag 1020

tctacgttta tctgtcttta cttaatgtcc tttgttacag gccagaaagc ataactggcc 1080

tgaatattct ctctgggccc actgttccac ttgtatcgtc ggtctgataa tcagactggg 1140

accacggtcc cactcgtatc gtcggtctga ttattagtct gggaccacgg tcccactcgt 1200

atcgtcggtc tgattattag tctgggacca cggtcccact cgtatcgtcg gtctgataat 1260

cagactggga ccacggtccc actcgtatcg tcggtctgat tattagtctg ggaccatggt 1320

cccactcgta tcgtcggtct gattattagt ctgggaccac ggtcccactc gtatcgtcgg 1380

tctgattatt agtctggaac cacggtccca ctcgtatcgt cggtctgatt attagtctgg 1440

gaccacggtc ccactcgtat cgtcggtctg attattagtc tgggaccacg atcccactcg 1500

tgttgtcggt ctgattatcg gtctgggacc acggtcccac ttgtattgtc gatcagacta 1560

tcagcgtgag actacgattc catcaatgcc tgtcaagggc aagtattgac atgtcgtcgt 1620

aacctgtaga acggagtaac ctcggtgtgc ggttgtatgc ctgctgtgga ttgctgctgt 1680

gtcctgctta tccacaacat tttgcgcacg gttatgtgga caaaatacct ggttacccag 1740

gccgtgccgg cacgtgatcg cgcaggctca gctgcacagc agacgcaagg gacagctcag 1800

catctggaac cgccgacacc aggtgctgag gatgctgcac ctttggcaac cccaataggt 1860

gcttttgggc gtactgctgt gcctgcgcct agggacattg actggtggcg ggtcccgaag 1920

gagctgatgg gaagctacgc acaagctgaa gctggagaca gcagctccac caatgttgac 1980

ttctctgggg agcctccggc cagcagcgtg tacaacgaga ggggggacgc gttagtggag 2040

caggaggtga aggcagcaac ggcgggtgtg gattttgctg gcaggaggag ggccaggggg 2100

ttgttggaca atgctgagcc tcccgatctt gataatggcc ctggagagca gccagcaggg 2160

gcaacagtga gttggagctg gaggcgattc agggataggc agggataggc agtgtagtgg 2220

ccagaactgg ctgctggaac ctggggttac tcagggtgaa cgcaggcaaa ggggtgcagg 2280

tgtattgaag ctcttaatat aagagagatg cgtcgaacat atatggtgat agtcttgagt 2340

ggtgtgttgg gtggaaggct gctgtttacg gtgcaggaag ttttcctggt acggtcgtta 2400

tgtaatgcag cagcacgtat gtaagaacca gtcgacattt aacctatgca gtagcatata 2460

gttatgtgtc aaaatcataa attggcccta tttgtggcga gcctatcttt caaatactac 2520

tgttcctcgc actgtcctct aaaatttctc caaacaacgt tgtaaaggtg ctgatgttag 2580

catatcatct ctggcataac tggatgcacc cagccggcta acagtgggaa gatgaagagg 2640

ggcttgtaca ctacactttt cttgccaaga ctgttagctt gcccaagcca gcacagcgat 2700

tttcttgcaa caaacgtgag ccttgcatct gcttttgatt gcaacggccg actggtgagt 2760

tattgtgcaa gcagtgttct gctaaactgt tcagaccagg ttcgcagctg gctaagatcg 2820

gtatctggaa agctccaacg aacaggtttt caatacgtgc tgcgtcaata tgcccttctt 2880

gttcactaca gcgacctttc caatgttgga tgtgaacaaa tgtcgaagcg cacaataacc 2940

tgaaagacat tgttgctcat tcccttttct ttggtagcgt aggtttgtat atttagagtt 3000

ccagttctgt actagttgct ctgcggcaac gattgaagtg tgtaccttat actgcacgtt 3060

aaatatgata ggttcagcgc ggttctttaa atgacaaaat aaatagtatt caacaaaaaa 3120

aaatagttgt ttgacatgtc actttttctt ttacataggt agcatgtcgt caaatcgtca 3180

atgcaaacca gcttgcgact aacgtaagca gtacagggga tagtacaatg agtttttcac 3240

cagcaatttg gtccagtgtt ttcgcaccgc cgtgaagcgc attcacatta aagtagcatc 3300

gctacacctg ttctcatctt gttaggttca aattttgcaa cgtgtagcta caaagtggca 3360

acagcgcagg ctgttggtca ctcgctaagg cttgcattgg caccctcgtt gctctgtgta 3420

ggagcgtgca tttgtgctca agactgttat ttttgacttc aaaaacttta tcgatagcgc 3480

actgcctcgt ttttacaaga tagccttctg tgagcagccc tgccccatgc gcctttaggc 3540

tttctgtggc aatgtctggt tcagctggat cgggccaggc tactctcaga catgacggtg 3600

gctctgctgg cggcagtggg cctgtctcag acggtttttc accggccggc ctgaaggtaa 3660

agtagaaaga cactcataca catcttggtt cggcgttgaa agtaggtcat taacatactc 3720

tataaccaat atttgtaggt tctggtcgtg gacgacgagt taaccttgga aatccctacc 3780

aggctattct ccacagcccg aaccccttaa gctagacgaa cacagttagc ataacttcgt 3840

ataggatact ttatacgaag ttatgcggcc gcccaccatg ggggaggttt gaagtgtgcg 3900

cctgatataa tcatacacct aaaagcacca cttgctgatt gtgaagggac tatgtcgttt 3960

atgacgggac gttacgctgg ccgatggttt gaatttggac gctgtggtag aatgttatat 4020

ggacgtaaag gttggcatat tgaaaatcgt cttcacaggc aaacttctag acgtgtgacc 4080

caccggtaaa acgacaagcg tggcgcgtcg attgcgcttt gaacgtcgtt tgttggactc 4140

cagatgaacc tcaaaatcaa agcggtgatt gacgaaaatc aaatgacagc ccgcaaaatt 4200

tcatcagcct tcggatcgga ttctcagaat ctgattgtcc ctgctggcta catttatgaa 4260

atttcgtaca ttttggcaga aatgtcccaa taccatagca ctgccgcctg agctcacccg 4320

agcaatgcat actgggtacc tcgcccatct cgccctcttt ccaagcccag tgctgttgta 4380

aatagccaaa gggctcagta acaatggcca aactgacatc cgctgttcct gtgttgacag 4440

caagagatgt tgcaggtgca gtggagtttt gtgagttctg agaagctgat tgttgtttaa 4500

cttctttgaa agctttatcg aagattctgc aagcgatgaa cattgcttgt caagaccgag 4560

agctgcatgc ccacttgaca tccagctttg aacggctctt catgtttgat ttgtttctga 4620

ttgtagggac agatagactg gggtttagca gggactttgt ggaggacgat tttgcaggag 4680

tggtgaggga tgatgtgaca ctgtttatct cagcagtgca ggatcaagtg agtgcagcgt 4740

cagctgtggc agttgttggc tttcgtctca gtcagtagtt tgctgggatt gattatggag 4800

ggcacagttg caattttgag ttgcacgttg cgacaagcgt gttgacaaag cgtggtcaag 4860

ccggccagtc ttgccggtgg cgggtggctt ggtctaactt ccgctctaca gcaatcgttt 4920

tgttcatggt tacggggctg gcgtgccaga aagtcctggt cagccaccct cgcttcaaag 4980

ccgtagccca acaactttgc gaatatgttc gatttgcagg tggtgcccga taatacactg 5040

gcatgggttt gggtgagagg tacagctctg cgtgcaacag gttgcaagat gcagcgcagg 5100

tcttccctgg tcaaacgatg tatgcagagt tgagaggcac ttgagctggg tgaatggcgt 5160

gggctcgtag gtagtgtgca gggcaggaag ggcagccaat tttggagttg tggtccggtg 5220

tcgttgcttc gagccttatt aggactcttg ctcatcaaag cgttagttgt gaataagttg 5280

atctgaaagg atgttatgta cagcaagcag cagcagttaa gagtctgggg agtagctgca 5340

cagggcgagg tgtcaagatg ggaagggtcc tgcctcctta tgtgtttttc cctgtagggg 5400

aggaagcctc ttatgggcaa tggttgggca tattttccag ccagcccttc tttctatagg 5460

ggccagggtg ggcccagctc gtcttggctt ccaccaccag gagagtgagg gcattgaagg 5520

gccataaata gtcctcccat ctacgtgcac cagagggtgt cgtctaggct gtgcatgcca 5580

cgaggggaag gagccaagaa tgagtgtatg ggttgttttc atgtttaggc tgggataaaa 5640

ctgttttcaa ttgcgcctgc cgggtgaaaa ccacagcagc atcagcaagc ttggagaagg 5700

ccagcccgcc cagcacaggc tcacgttccc actcaggcgg tcagtcgggc gggggtgtga 5760

gtcaggcagg cgagggtgtc tgtgcctgac atcagcacct ctgcttagcc actgcagccc 5820

ctggagcagg gtagggcgtc atttgcagca atcacctgct gcctcacacg tcgcagcttg 5880

gaatttcaac gaccatcagc gctggggttg ttgagggatc atagcagatt ttggtgcagc 5940

ctggttgtca tgctctttgt ggaatggcct ctatgttcga gcaattcgtt ggatgttgag 6000

gtgcttgggg acagagagtc gaatgatggg ccagggtcaa acatgcgagc gtttggctga 6060

gtcagcggtt tttgctggtc actttttctt ttgtttctta tttaggtttg atggatgtgt 6120

tttgtgctgc tgccctgaag ctgcagcagc gtgtctgccc tgcgctactg cgggcaccaa 6180

ggctatgtgc tggtgcactc ggctgcgctg cacctgtgca cctcgcactc cgtccagcct 6240

ccatgcagca cacgtactca cggtgtcctc ctgacctgtc gtacgctatt ccaaacttgc 6300

tcttttgctg ccgctgctct cgtacacaat tgctgttgat tatcgatatc taatcgagcg 6360

cctgctgact gaactccgca ggtttggatg aactgtatgc agagtggtct gaagtggtga 6420

gcaccaactt taggtgggtg ggctctgaag gaggaggagg gagcgggtga ttaaacaggg 6480

cctgcatgaa gaggagcagg ggctgcatgg acagcagggg gaaggtgcag aagggagggt 6540

caagcggggt tcaggtggct gtgggtttct gcacgagcag tgaaagaagc tgtatccttc 6600

cacctgcttt cactggcgaa aggttgaaaa caggatgtcg cagctggaaa gatgttgcgc 6660

tgtcaagtgc aagccatggt tgagggtatg cctgtgtgca tgtgcttctt aaagttactc 6720

ctgttctatg gttctgggtg cttgttgttt gtggtgcagg gatgcaagcg gacctgcaat 6780

gacagagatt ggagaacaac cttggggaag ggagtttgca ttgagagatc ctgcaggtga 6840

gggggcatgt aagcaatggc aggcaattca agaacgaatc attgctgcaa atgctgggat 6900

ggtatgcagc tgaggtatct attgccttgt attttgtctc gcattgcatc ggtggtgcgt 6960

tctgtggcct gaggcacagt tcttgctgtt tgataagggt tcgactgagt tgtcgtgtgt 7020

gctgtgctgc aggcaattgc gtgcactttg ttgcagaaga acaggactga gcatagcatc 7080

agcctgtggc agggttgtgg tagggctgag tggcagggtt aaaggggttg cctaccccac 7140

ccctactctc atgacaccag caacagcagc agctcatgca gtactcaaat cactgatgtc 7200

aatggtgtga cacatttggt taaggctgct ttttaaagtg ctgctttggg ggcagtgact 7260

gtgcagagct tggagcgtat ccccatgtaa tcagaaccga cgagagttcg gggcaacctt 7320

tcatcttcac attttttgtg atcagctaca gagtctgaaa tcaaatagag gctgccatct 7380

aaacgcagga gtcacaacga aggcgaaaac tccaattgct gtactcaatg cactaagtga 7440

ttgttcaatg gataaataca ctatgctcaa ttcatgccag cagagctgct ccttccagcc 7500

agctacaatg gctttttcca cgccttttga agtatgaatg ttcagcttgc tgtgcttgat 7560

gcatcaccat aaacacaatt ctacaacatt tcatgccaac aacagtacgg gctttccctg 7620

caggcagttg gtacggcata ttatggttta aacatctatc ctccagatca ccagggccag 7680

tgaggccagt ttgcatagtt aagtatgctg gctattgcag taccttatat gcaaacaagt 7740

gctcaatctg tttcatcatt gtctgtgggc aaattgcctg ccaatattct ccagttattg 7800

cctgttgttt caaatgattg aaattggaag ttgtattgct ctacattttt gacttgtgat 7860

tttttcattt gttgatatct gacaactgtg aactgcactg aacttgctgt gcttataaat 7920

gcattttttt gttttgggcc acgttgattc cttgtgatac tttcctgcta tcaaaccaaa 7980

aatatactct catgactgac gtgcaacaaa tgcatggaag ctttcaacgt tacgacagct 8040

gcttgccccc catcagctat tctacatgtg taacctacct tgcatggcca ccacaacgct 8100

actgcatgca agatctggcg caactggatg tcccaatagt agaagtatcc ggattatctc 8160

cgagagtttt acatatgtaa tcgacgccat ttctgtcatc aactataaat ccattgctcc 8220

tgcatttctg gcactgacat tctaccacaa gcaataccaa tgttggagag cgacgagagc 8280

ggcctgcccg ccatggagat cgagtgccgc atcaccggca ccctgaacgg cgtggagttc 8340

gagctggtgg gcggcggaga gggcaccccc gagcagggcc gcatgaccaa caagatgaag 8400

agcaccaaag gcgccctgac cttcagcccc tacctgctga gccacgtgat gggctacggc 8460

ttctaccact tcggcaccta ccccagcggc tacgagaacc ccttcctgca cgccatcaac 8520

aacggcggct acaccaacac ccgcatcgag aagtacgagg acggcggcgt gctgcacgtg 8580

agcttcagct accgctacga ggccggccgc gtgatcggcg acttcaaggt gatgggcacc 8640

ggcttccccg aggacagcgt gatcttcacc gacaagatca tccgcagcaa cgccaccgtg 8700

gagcacctgc accccatggg cgataacgat ctggatggca gcttcacccg caccttcagc 8760

ctgcgcgacg gcggctacta cagctccgtg gtggacagcc acatgcactt caagagcgcc 8820

atccacccca gcatcctgca gaacgggggc cccatgttcg ccttccgccg cgtggaggag 8880

gatcacagca acaccgagct gggcatcgtg gagtaccagc acgccttcaa gaccccggat 8940

gcagatgccg gtgaagaata agcagcagct tgttatgcct tccccatggg catcagcatg 9000

ctgcaagctg tctagatatc cagctttcag tggaggttga gcgagggtca gcagcggttc 9060

cctggcgatg gcggtcagct tttctggaag ccttcactag gactgcgccc agcgcatgtg 9120

acgccaatcg aacttgtgtg caaggccaaa ttttgtgacc ctgtgctgca cttcatgtat 9180

tcaagaattg agaagaaatt tcattgctgc ccttctttca ctttaatttc catccctgga 9240

tccacctccc accattgtgg ttgatgggta ggggttttgg gtaggtgcag ttcgttgtgc 9300

acgttgacat gtgtaacggt gagcaaagga attgctgggc aagtagctat tgcagcttaa 9360

gggcatggtg aaacacttgt gctgtattta cagaggaagc cagacaggta aggagtgtgt 9420

ggcagcttgg aacaggaggg ctggtcgcaa caagtatgca tatcccatga ttgttgacat 9480

aagagcagca ggtgcatatt gccagccttt gtgaaagtgg attgaaaatc aattagttgg 9540

tgtgatagct gaggctaggc actgccaacc tgcagtgaaa tgaggctcca agaccgggta 9600

ataatacagg caatcgaatc cagttgaaat tacggcgatt aaatccaagc gagcgttgta 9660

agaacatctg cacctgtctg aagtagtgag cggataatga gcattgcttg ccttctatca 9720

ctatacctga cagttacgtg tcacacactc tcaagcacaa cacacagcgg caaagttact 9780

tgctaaacct cacagtcaag ctgaaaataa aggctaaatt acgtgagacc ggcgcgccat 9840

aacttcgtat aggatacttt atacgaagtt atcaccagat ataggtgacc cgataactta 9900

attaatcttg cgaagattga attgctaata gaaggttctc atctatacat gagttaccag 9960

tgaaccccat atctgctcta taatatagtc cccgctgagg cgcagtgctg aggttccagc 10020

tcgaacgagc cagtagggct tcgactcacg gctcatttat tttagagcta ggttgacttc 10080

ccagtctcat gcaatacatg agagcaggtg ttggtcgcac gcctctctca cggtgcctct 10140

tgattttcgg ccccttgcac ccgctctcat atgacatatt cgcgctgcac ccttgctcag 10200

agcaggcgca gcatgtggag tagcgggcgc aagccgtaat gaggagtctc agctcaacat 10260

gattgaggtc agcatcactg taacaataca aatcattgtg gtgccttata tatttggtaa 10320

atgctcgctg cagtattcaa atcgaccttc actgcaagca actcgattga actacgcgcg 10380

ttattgaagg cacatacaac cgggagttca gaggagtatg cccaagaaga agcggaaagt 10440

cgggagcaat ctgttgaccg tgcatcaggt atcgagaaga actaaagagc gttcaaacgc 10500

atcaatattt tgctaaagag ctttacatct ttttggggct attttctggc tactcggtag 10560

tgacttgacc actttcttcc caagtggggg caagccgata agccgctgtg accgttgatt 10620

tttttataaa agacgtagac atgttcaatc agccacaatt gatatgcttg aatacagaac 10680

ctgcccgcat tgcctgttga cgcaacatct ggtgagctgc ggttgctatc ctcccaatat 10740

aacctgaagt catgcatata ttcgcactaa tctacatccc atgttgtgtt gagctattcg 10800

gtattgatgc cagctcagtg aactaattat caaatgtata tcggtgctgc cagaatcgat 10860

ccatgtatca atgccacaag taactggaga tacatttgct acatgtagat gaggtgcgca 10920

agaacctgat ggacatgttt agggaccgcc aagccttcag cgagcataca tggaagatgc 10980

tgctgagcgt gtgcagatct tgggcagcat ggtgtaagct gaacaaccgc aagtggttcc 11040

cagcagaacc cgaaggtatg cctgggtaac tgtcaaaatc atgtatattc ccgcaatgca 11100

agtggttcat tgttgtgctt tacgttaaag acgtgtcagc tgcaggagaa ttattttgag 11160

gatgattgtc cgttgttggc gatgtcttgc attgtgaagt atgttttgaa gtcatacagg 11220

aagtgtgaaa tcccaaagca gctggctgcc gctgcatgcg accagtcatt cacctgcatt 11280

gtgtgtgctg tagatgtgag ggactatctg ctgtacctgc aagcaagggg actggcagtg 11340

aagaccattc agcagcatct gggacagctg aacatgctgc ataggaggtc tggactgcct 11400

aggccaagcg atagcaatgc agtgtctctg gtgatgcgcc gcattagaaa ggagaacgtg 11460

gatgctgggg agagggcaaa acaagcactg gcatttgagc gcaccgactt tgaccaagtg 11520

aggtgggctt cgcaactgct gcctgaactt cctgttcctg tgcatgtaca tgagagtcgg 11580

ttggaacagg ctcatactgc gcctgattga taggctgtcc cacattgttt tatttgctgt 11640

atcgatgtat tcattttgca ttgggtcctt tctgctcatg aagcaccaag aaggctggct 11700

gtcaatggca tgccagctca tgccatctgg atgacattat gcaagaccag tgttgactcg 11760

aacatgaatc ttactggaaa ctttaatgaa tgctttcgag ctttttgtgc aggtctctga 11820

tggagaactc agaccgctgc caagacatcc gcaatctggc atttctgggg atcgcctaca 11880

acacactgct gaggattgcc gagatcgcac gcattagggt gaaggacatt agccgcacag 11940

atggagggag gatgctgatc catatcggga ggacaaagac cctggtgagc acagctggag 12000

tggagaaagc actgtctctg ggagtgacca aggtaagctt accatgtgtt tatatgaagc 12060

tgatatttgg aagaaaggag gaagcaacga caacaagggc ggtgcacaat ctattgccgc 12120

ttttgaatct tgcccgcaaa ggcagtcgat gattgctcac tgtatcaggt tgatttagtt 12180

gatgaggtgt agctggggaa gctccaatcc ccagtccaga tagccttggt tatgaattgc 12240

ataatgtagg caccacttgc actggtccta aaccccagtt cattcctgtc cttctcgtgc 12300

attttgtcaa atgaacatgc aaccgagtgt gttttcctac tcgacatgtg tgcgattgcc 12360

cacgtgtgct gcagctggtg gaacggtgga ttagcgtgtc tggagtggca gatgacccca 12420

acaactacct gttttgccgc gtgcgcaaga atggagttgc tgcacctagc gcaaccagtc 12480

aactgtctac aagggcactg gaggggatct ttgaggcaac acatcgcctg atctacgggg 12540

caaaggatga ttctgggcag aggtatctgg cctggtctgg acattctgca agggttggag 12600

cagcaaggga catggcaaga gctggagtga gcattcccga gatcatgtga gaggccccag 12660

caaaaacaac agcactagct gttgctgctc agtttgtgct cgtgatgttt gaaaggaatg 12720

gacaaggttc atccatgatg ttcattatct gggctggtct tgtacatggg gttattctat 12780

actaaacagg agcgatacaa ataacaaaca atcaatgtct atatacacat atacttggct 12840

aaatttttct cccggcctta catacataac aaaggctaaa ctaattgacc caaaataatt 12900

gtatgaataa tcaaattgat gcatacaaat aatcctaaaa atgaaaaaaa tttcattgaa 12960

ataagtatag aaataacaaa tgtttgaccc acagccctca ctctccaacc caatcctgcc 13020

tctcacaaga cttgccatgt accaacttac aatgacagcg agctacaaca agttccatca 13080

aggtgtgggt tgctattagt tggtggaacg tttgtacatt tcacagttgg acatgcactt 13140

gcgaaaaagg cgttggcttc agtgaggcag tgcttgctcg tatcccctcc aagcatgcct 13200

tgtgcaccca ttttgcaacg caggcaagct ggagggtgga caaacgtgaa catcgtgatg 13260

aactacatcc gcaacctgga cagcgagact ggagcaatgg tgagactgct ggaggatggg 13320

gattaatcag gatgttttga gcggttgtag gttctgtagt tgtatggtag gttgcatgga 13380

ggaaataggc caacaacaat tccaaatcaa aggagattgt agcgttgctc ttggtccccc 13440

tgaaaatttt tgttgttatg tgtctataaa tctagttctg caccttgcaa actgtgggat 13500

gccctgtcca gagcagaagg taatcccaaa acagtcgaga aagtctcgtt gggtggttgt 13560

gtaaagtaca aatgtatgtt ttccaccttg tctttgtatt gtgcacgagc tacagcattg 13620

gtggaagggc ttatagctgc tgggtcatca tgctgtcctg ttcttgatgg tttaggtgtc 13680

atccctttca ctgactcagc gaaatcggat gcgtaccatt catgaacggt gttgcacttg 13740

ctgtttgtga aaggtactgc atgtgcattg tacaatagac tactataatg tctcatgcac 13800

gtggtcaatg atgtagattt ctggaatatg catcgtgtaa ttgattcgat gaacccctcg 13860

tttggaactc tatttgaaaa gcaatcgagt gtcattatcc ataatggatg atgatcatga 13920

gcattgcaaa tagcaccatt agaacaaact gaatattgta caccttgacc tggatatgca 13980

tccgtccttc atcccacttt attaaggcag gttataattg gcaaggagtc ggcagaatag 14040

tcgtttggtt ataccccagt tttagtgggg cctttggcag ctatattatg gtcgcgactg 14100

taaccgggtc cgtttaaagt tcgattacat ctcagaaata taattgggct gcatgttaga 14160

aacttttcgc cgggtataac cggggtataa tcggcatact gcccaatgac ggccagccgc 14220

tggtcagtga ccgtcaaacg gtcggacggt ctgcatcgca tgtgcgctga catgtcaagt 14280

gcatgcttct cttacattca ggcaaaagac tacaagtcat tgaagaattg tcaactcagt 14340

aagctgacaa ttacgttcat gaaggtcagt cgtatgaaac tcgtatttct ccctaagtcg 14400

ttactatgga aagtacatcg tgccacgtca tcgtcatcgt ggcaatgaca gatgatggat 14460

agggtggggt tggcattaat tgctatcatt ttctttgcag aaaacaaata cctggcacat 14520

aatttgttga taatcatatg tatgtatgtc cacatgtcaa cgttatatgt ataaaaatca 14580

agacttgttt gcttaactct aaatttaatg taagaatttc ggtaataatc tgatctacat 14640

tatcacttgt gattaatgtt gaaatttgtt atccttaatt atcgtgcttg gcacaacttt 14700

cagattttgt ctgctgtcac attcatgcag tttcatttgc agtaaattct caatcattta 14760

tgtagttgat aagaatattt gatctgcttt tcattaagca aattttgtta gctttctccc 14820

cttgattgtt cattcaatga gattacattg aatgatgtct acacatataa taagaacgca 14880

tgtctacaca aatctaaaaa tcagctgcac gctcccaatt actatcgcac actctgacac 14940

cagaccgtgc tgtgacaata taagctgcac tgacaaattt ggaaaacaca agattcagaa 15000

gaaaacaaat actggaaccc ctcacacacc acctttctac agcacaaaca cgaagcagta 15060

gccaaggtaa gaaaatccga tcaaaataca ttaaatcatg tctaatatac agcataagta 15120

tagctaatga aatcgttggt cgggccttaa taacacacag tctaccaaca cctagttggt 15180

aaataccgtt gctgatattg ctctgtacca gtaaaagagg gctgcgatga gcgtttttag 15240

tgcacttctt caacacggaa tatttttcac aaattggtat gagaaccaat tttgcaaaat 15300

gttcgccctg taaagtatcg ctctgggacg atcagcttga cgtaattgta ggcgaaaagg 15360

gcgttcaaag tgcagcttta tgtatgaacg tcataaaata taaagcatag cacaatcact 15420

gatagaaaat atttgtgcgc attaaaactc tcacttctgt tgcggataca acgacggaaa 15480

tgagaagctt gtgtaagaag caattcaagt tttcattttg tcatctaagg tgtgatcctc 15540

cgatattcat taccgaatgc tgatctgagt tggaaagatg gcaatattta gctgtgcaca 15600

ctttgacctc caggccttgg cgggaattta gtattctagc tttcctattg gaacgatagg 15660

ccagccaagt ctccagcttg tatacgctac accagcagac atgctctcaa tttagctgac 15720

agtgtcttca tatttgtatt atctgttgtg tctatgccga agaagaagcg caaggtgggc 15780

gactacaagg acgacgacga caagctggag ccaggtatgc ataacctttc aatagatgct 15840

gccgcgcctt gggttcgctg cctgtgtcct gaagtacttt tcaccaggtc tacatgcatg 15900

cagcaactaa tcgttagttg ttcctttgta aacagcgttt tctgtcttta ccatgattca 15960

ggcgagaagc cgtacaagtg tccagagtgc ggcaagagct tcagccagtc aggagcactg 16020

acccgccacc agagaacaca tacacgcgac aagaagtaca gcatcggcct ggacatcggc 16080

accaactctg ttggttgggc ggtgatcacc gacgagtaca aggtgccgag gtatgttatc 16140

tttgattgca ctacttgcag tcctggtggg cactattgtt gtgcataggc gctcttttgc 16200

attcatgtat tgaatgtaga gaagttgtac actcctccta ggagactagc tgatggagtc 16260

ctgtattaaa tttgttcaca tcatatgcct tacagcatga tccattagaa gtaactaaat 16320

ttctaagcac ccagtctgag aaaccagatc gatggcaagt tgctcttggc ttgctgtgct 16380

tgcagcaaga agttcaaggt gctgggcaac accgaccgcc acagcatcaa gaagaacctg 16440

atcggcgcgc tgctgttcga ttctggcgag acagcagagg cgacacgcct gaagagaaca 16500

gcacgcagac gctacacacg ccgcaagaac cgcatctgct acctccagga gatcttcagc 16560

aacgagatgg cgaaggtgga cgacagcttc ttccacaggc tggaggagtc gttcctggtg 16620

gaggaggaca agaagcacga gcgccacccg gtaagtcgcg tgccaagcac tagtttacca 16680

tcccacaaat gacaggtctg ggtgggacat ctgcacctga aaatggctta cgacagctgc 16740

ttctcaattc gagtgtgcat attgcaagca ttagattttt tcctgcagat cttcggcaac 16800

atcgtggatg aggtggcgta ccacgagaag tacccgacca tctaccacct gcgcaagaag 16860

ctggtggaca gcaccgacaa ggcggacctg agactgatct acctggcact ggcgcacatg 16920

atcaagttcc gcggccactt cctgatcgag ggtgagtgtg gaatgcatca cagtggaaac 16980

tgctttgtag tacaatttgt ttgtgaagtt tgtgtctaga tgtccatttg atctgtggaa 17040

tgaatgtgct agctctcatg cacagcagta tttggaatgc tgaattacag tgtttccttt 17100

gttggtgtca ggcgatctga acccggacaa cagcgacgtg gacaagctgt tcatccagct 17160

ggtgcagacc tacaaccagc tgttcgagga gaacccgatc aacgcaagcg gcgtggacgc 17220

aaaggtgtct tgatgtaaag tcgaacattg catttgaacg aaggagctcc cttgttggct 17280

aagcatgggt attgactcta ccccagcagg gaatcatctt gctgcaacag ctcacgtcgt 17340

atttgtatgt ggtgcaggcg attctgagcg caaggctgag caagagccgc agactggaga 17400

acctgatcgc gcaactgcca ggcgagaaga agaacggcct gttcggcaac ctgatcgcgc 17460

tgtcactggg cctgacgccg aacttcaaga gcaacttcga cctggcggag gacgcgaagc 17520

tgcaactgag caaggtgaac gtccccctcg gccctgtgct ggtgtgcctg ctgtccaatg 17580

gcacgtttgt gcttcacaat tctacaggtt gatgcaatgt aggttggttg tgctgatgcc 17640

agagatgcac tcaaccaaca ccgtgttgct ttgttggttc ccaaccagcc tgcaatgcaa 17700

cctgtgaatc gtgcaccata cgatctgcat gcaggacacc tacgacgacg acctggacaa 17760

cctgctggcg caaatcggcg accagtacgc agacctgttc ctggcagcga agaacctgag 17820

cgacgcgatt ctgctgagcg acattctgta agtctcagag cacatcacct gcatcacaca 17880

ggatttcttt tgtcagcata tcctgccttt tcgggtcatg tttggatgcc gtgcggctgt 17940

gtgccactgg tccaggcgta ctgggctttc tgacaagctg gatgttatgc ttatattgca 18000

ggcgcgtgaa caccgagatc accaaggtga gccgcacact tgctattgct cgctttcaca 18060

aaatacccgt cgtgaaaacg tcatgtgaag gttgctatca tcgggtcaga gagtatatta 18120

catcatgaac aggctgcaag ggtttgattc ctgcaggcac cactgagcgc gagcatgatc 18180

aagcggtacg acgagcacca ccaggacctg acactgctga aggcactggt gaggcagcag 18240

cttccggaga aatacaagga gatcttcttc gaccagagca agaacggcta cgcgggctac 18300

atcgatggcg gtgcatctca agaggagttc tacaaattca tcaaggtatg tttggcacac 18360

cattgacaga aggggcatgt cttgcccagt gtgcactgct gtcaggtcga tgagagaagt 18420

ggcaatgaaa aattttggtt tgacaacaaa tatgaggggg tactcgggac tgattggcaa 18480

tgcgttagaa actccgtaag atcaaatttc tgaagtggta gcagtggaag ttcctagctg 18540

agggtgtcac tcactcttat ttctgcagcc gatcctggag aagatggacg gcaccgagga 18600

gctgctggtg aagctgaacc gcgaggatct gctgcgcaag cagcgcacat tcgacaatgg 18660

cagcatcccg caccagatcc atctgggtga gctgcacgcg attctgagaa ggcaggagga 18720

cttctacccg ttcctgaagg acaaccgcga gaagatcgag aagatcctgg tacgtggccc 18780

gggttcacct gttgcgtgca tgttgacttc aggacaaagt tagcattatt acacagcggc 18840

agcacagtga gggtcatcat gtggctggct ttccaattgc tccgagggaa taatcggttg 18900

aatgtgtgtt tctcttgcca gtgtgtcctt ggaggtgcgt gcgtgcttcg caaaaaagga 18960

gtacccaata acccttgaaa caaccagttt tgggctgcaa caacacaaga ccgcggttta 19020

ctgcctgact atgcagacgt tccgcatccc gtactacgtc ggtccactgg cacgcggcaa 19080

cagcagattt gcgtggatga cccgcaagag cgaggagaca atcaccccgt ggaacttcga 19140

ggaggtggtg gataagggtg cgtggccagt accagctgca ccccacaggc ggttgttttg 19200

acatttaaac cgctttcagg aagcgtttgt acactcatgc gcttcatggt ctaccagcag 19260

gaggtctgga acacattcag atctaacatg aaatcaagct tgcatttcaa aagcggggca 19320

tccaagtgca gcggggatga actgctgtct catttctatg caggcgcgtc tgcacagagc 19380

ttcatcgagc gcatgaccaa cttcgacaag aacctgccga acgagaaggt gctgccgaag 19440

catagcctgc tgtacgagta tttcaccgtg tacaacgagc tgaccaaggt gaagtacgtg 19500

accgagggca tgcgcaagcc ggcatttctg agcggtgagc aaaagaaggt gggtggtgca 19560

caatgttgat gcagatttga cgctgtatca ctgctgtctc gctgtacagc atctgataca 19620

ctgctgttcc cgctccccgc aggccatcgt ggacctgctg ttcaagacca accgcaaggt 19680

gaccgtgaag cagctgaagg aggactactt caagaagatc gagtgcttcg acagcgtgga 19740

gatcagcggc gtggaggatc gctttaacgc gagtctgggc acctaccacg acctgctgaa 19800

gatcatcaag gacaaggact tcctggacaa cgaggaggtg attgtgggtg gagtgcaccg 19860

cgaatgaatg gggcactgca gcacaatgga gcacacatcc aatccgcaat gagctctcct 19920

gagacttttt ttggctcctg aagcaaacca gacaatgtgc gcctatttca cggacctggc 19980

gcatggaagt agtctggcaa ctatggctgg agcacaacaa tttctggtta ttttgattgg 20040

aatgattggg ggaaaaaaca atgtgttgcc cgcagcacag gccctggtgc agttgagtta 20100

gctgtagcag tagcagaagg catgtcatcg aaaaagtacc gaattgtgcc atcatcccca 20160

ccctgctgca gaacgaggac atcctggagg acatcgtgct gaccctgacc ctgttcgagg 20220

atcgcgagat gatcgaggag cgcctgaaga catacgcgca cctgtgagtg gttgccctgg 20280

acactggaga tttcttgcat gttgggtgtg gctgattgtg cctgcatcac tggatgattg 20340

tggcacattt tcggtttaat attcagggta ctgctgcaaa cgagcttggt tcaactgacg 20400

tacctgaacc agtcgttttg ctgcttgcag gttcgacgac aaggtaagct gtgacaggac 20460

aagctggcag attcttcact tgcacctgtc cagctgaatc tacaaccatg ggtgaaggat 20520

gctgccgttg ctggcagcca cacctgtttg aaactaaaat gggagcaacc tgtgcagcaa 20580

ggtcctacga tatcatacct gcttcttcaa ccatctgatg ccccttatca acaagcgcac 20640

cctgcaggaa ttacccttgc accaaaacct gggcacgttg cctgccgctt gccagaacta 20700

gctgtctgtg ccactcccaa catgtgccta gcatctgtga tatctgctac aggtgatgaa 20760

gcagctgaag cgccgccgct atacaggttg gggtagactg agccgcaagc tgatcaacgg 20820

catccgcgac aagcagagcg gcaagacaat cctggacttc ctgaagagcg acggcttcgc 20880

aaaccgcaac ttcatgcagc tgatccacga cgacaggtga gccaggggag gtgcattcct 20940

agcctgtgct tgcttgtgtg gaccctattt gggaggagga agattgacct ggtatgaaat 21000

gtgaggctag acaacacatg cgactatttc tctccagcag cactggcagg acgatgggac 21060

tgcatgtgag ggcatgtctt gacatgaaat gtcttgccac cagtttgatg tgttgacatc 21120

gaacatcagc cccccttccc cagctattat ctagttctgg tcctatcaga ccatgcgcaa 21180

tctgctggcg gtctcatctt taaaagcatt cttgtcatca ggctgtgcag tggagccagc 21240

aataaaacca acctattgtt ttgcagcctg accttcaagg aggacatcca gaaggcgcag 21300

gtgtctggtc agggcgatag ccttcacgag cacatcgcga acctggcagg ctcaccagcg 21360

atcaagaagg gcatcctgca gaccgtgaag gtggtggatg agctggtgaa ggtgatgggc 21420

cgccacaaac cggagaacat cgtgatcgag atggcgcgcg agaaccagac aacccaaaag 21480

ggccagaaga acagccgcga gcggtacgca gaactctggc gtagccacgc aaatcatgtt 21540

tgcagatgaa agttttgtca tatgcgcaag accagggacc ttctatgtat caaaaggctt 21600

aacagtgtgt tgttggttat gttgtgcagc atgaagcgca tcgaagaggg catcaaggag 21660

ctgggtgagt catgtggaaa ggtatcatac attagatggt gttcccctgt tgtacaagat 21720

ctggcagcat ttggatgctg ccattggaga tttcatgaga tattcagtta aactaaaagc 21780

gtgagttttc gcagcagagg atagagccaa actcacaaat cattttggct tggtgcaggc 21840

agccagatcc tgaaggagca tccagtggag aacacccagc tgcagaacga gaagctgtac 21900

ctgtactacc tgcagaacgg ccgcgacatg tacgtggatc aggagctgga catcaaccgc 21960

ctgagcgact acgacgtgga ccacattgtg ccgcagtcgt tcctgaagga cgacagcatc 22020

gacaacaagg tgctgacccg cagcgacaag aatcgcggca agagcgacaa cgtgccgtct 22080

gaagaggtga ggcatcgcac aggatataca gtgggttcca tgagtgctgt tgtgttgtgc 22140

attgcttcga cccgctttcc aacctgtgcg tggtgtatgg gtttgcacca tggcgtgcac 22200

gggcacaggc atgtcatgct gcaagcaaca gggccgccaa gcttccttca cctgctcggt 22260

gatctttgtc ccttcctcca ccctcccttt ttccccgccc caggtggtga agaagatgaa 22320

gaactactgg cgccagctgc tgaacgcgaa gctgatcaca cagcgcaagt tcgacaacct 22380

gaccaaggca gagaggggtg gcctgtctga gctggataag gcgggcttca tcaagcgcca 22440

gctggtggag acacgccaga tcacaaagca cgtggcgcag atcctggaca gccgcatgaa 22500

caccaagtac gacgagaacg acaagctgat ccgcgaggtg tgacccgggt gtattagaga 22560

gatgcgcaac gcgtgctggt tgttgttgcc gttgcaccta gggagtaggt cgaatgccgc 22620

gttggtgccc gctggggtgg ctgtatcatg ctggatgggg ttgcaatcag acccgggtaa 22680

gaatgaagtg tggagctcac tgttccgtcg agcgcttcag cctgcttgat ggtgatgccg 22740

gtttggcgca ggtgaaggtg atcaccctga agagcaagct gtgagtggcg tgctgcacaa 22800

ttgtttgtca agtgcacttg ttcttgatac aaagttgggc tcgccattga tagcaagaaa 22860

aagaacttgc cacctggata gctgcgtctg gaacatgttg catggaggga attttatggt 22920

gacacccatg gtgacactct tcatggaacc tgctggccac ctgctggtat gcctcttgag 22980

gctggatgat caacaaatga tgtgccgcag tctacagtca atttcagttc acccagtagc 23040

tgtttttcat tcgtgctgca gggtgagcga cttccgcaag gacttccagt tctacaaggc 23100

aagtgccttc tagggttcag atctaagcca gagcagtgaa caactggtgc tattatatcg 23160

tacatatggt gctaattcgc ctgcttgcag ctcagcaggc accattggtg cacaggaaaa 23220

tcggcgcatg atccaagtgc agctgcgcct cgcagcttgt acccctgctg agttttcttt 23280

cggctgttgc ccatgcaggt gcgcgagatc aacaactacc accacgcgca cgacgcctac 23340

ctgaatgcag tggttggcac cgcgctgatc aagaagtacc cgaagctgga gagcgagttc 23400

gtgtacggcg actacaaggt gtacgacgtg cgcaagatga tcgcgaagag cgaggtgagc 23460

actcacaggc agttctgtta ccaacatctg cgattttctt gggcagagag tgtatcttag 23520

acctcattca cctcagattc ctgagcgagc tgcaatgccc gttgtcagcc tgtgcaatga 23580

aggaaaaacc tgtcgtaatg cttgcagcag gagatcggca aggcaaccgc gaagtatttc 23640

ttctactcga acatcatgaa cttcttcaag accgagatca ccctggcgaa cggcgagatt 23700

gtgagtgtca cagtagtgtg catcttcgtt tgatccagtt tgatccacgt gcagctgccc 23760

atcaagtcca ggttgtggac cttcatcttt ggactggcag tgtatgaaaa gtccactggg 23820

aacctgctct ttttcatacc gcatcatgca tatcgtgtcc catcgtgcgt acttcatgag 23880

ttgtccctat ttttattact gtcgtcatca cttccaacgt ccacagagcc aacacgactt 23940

gtgctgaata aaggaatgaa atcgcctatt taatataaac tggtattgtg ggacaaagtc 24000

caattcgcaa gtctgatgcg cacctgtgca gaggaagagg ccgctgatcg agaccaacgg 24060

cgagacaggc gagatcgtgt gggataaggg ccgcgacttt gcgacagtgc gcaaggttct 24120

gagcatgcca caggtgaaca tcgtgaagaa gaccgaggtg cagaccggcg gcttcagcaa 24180

agagagcatc ctgccaaagc gcaacagcga caagctgatc gcgcgcaaga aggactggga 24240

cccgaagaag tatggcggct tcgacagccc aaccgtggca tatagcgtgc tggtggtggc 24300

gaaggtggag aagggcaaga gcaagaagct gaagagcgtg aaggagctgc tgggtgagcg 24360

gccagcacat gcacctaggt tgcctatcac atggcaccaa attgcatagc catttcaggg 24420

tgattcactt cccggtaaca ggcattgtct ggcagcctca tcgtatgcat gaatggagat 24480

gggtcaattc aagcttgcat ttcaaaagca gggcatccaa gtgcagctgg gatcaactgc 24540

tgtctcattt ctatgcaggc atcaccatca tggagaggag cagcttcgag aagaacccca 24600

tcgacttcct ggaggcgaag ggctacaagg aggtgaagaa ggacctgatc atcaagctgc 24660

cgaagtacag cctgttcgag ctggagaatg gccgcaagcg catgctggca tctgcaggtg 24720

ggtggtgcac aatgttgatg atagtgccct gatgtagtgc gcagatttga cgctgtatca 24780

ctgctgtctc gctgtacagc atctgataca ctgctgttcc cgctccccgc aggtgagctg 24840

caaaagggca acgagctggc actgccgagc aagtacgtga acttcctgta cctggcgagc 24900

cactacgaga agctgaaggg ctcaccggag gacaacgagc agaagcagct gttcgtggag 24960

cagcacaagc actacctgga cgagatcatc gagcagatca gcgagttcag caagcgcgtg 25020

atcctggcag acgcgaacct ggataaggta ggaattttcc cctccctgca ggtggccagg 25080

gaaatgaacc ggtcaccatg taccgggtag cacgggtgga cacacggcag tggccaggga 25140

atcgtactgc tgagggtccc cctgcatgca gactgtgggg gttccctcag gctccgtctt 25200

tgttgcacat gcaatggttt gatcggtctc agttggcatc tctattgaaa ctgctatatt 25260

cctatgccag tgacgcagag gtgaggatgg ttgacaaggt tttgacgtag tgggtgttga 25320

gggtgctgtg caggtgctga gcgcgtacaa caagcaccgc gacaagccaa tccgcgagca 25380

agcagagaac atcatccacc tgttcacgct gaccaacctg ggcgcaccag cagcgttcaa 25440

atatttcgac accaccatcg accgcaagcg gtacacaagc accaaggtac tacctgcctg 25500

cccaaatgct gttgggcttt gcagcacaaa ggaaaattct ccagccaggg tttttcctgc 25560

tgcaacactg ttgtatgatc gctcacaata agggggaaat aggtttccaa gtcatggttg 25620

tgacagtgga aaccaagtct tttttgcctc caccaagttt ttgtcctcaa atttaattca 25680

atggtggttt gtaggaggtg ctggacgcga ccctgatcca tcagagcatc acaggtacag 25740

tgcagcagca caatccctcg tcaagcttac ttgtgttgca ttgccaaatt gcccaatttc 25800

ctatgaagtt tgctgtacat ttgatcatgc gctaaattgc ttttacgttc tatcgctttg 25860

tatgcatgca ggcctgtacg agacccgcat cgacctgtct caactgggtg gcgactgagg 25920

tgcgaatagt gcttcagtaa aaaagtagca acttggtgca atatcgtcag ggtcgtgtgg 25980

tctgctcgcc agcaagtttt ttggcacagg agagcgcttt ttccgagtac cgccaaagtt 26040

caagcatgtg ctgtgattcg ctgttgcctc ttatgataat tgctcaaagt ttccaagcat 26100

tctatgtcca ccctgcacca ctaagttgta tggtgcttat tctgcagggg atgattcatg 26160

gtgcctaaaa attttgtgct gctgtcgcgt ctgttttctg tcgcagttta gtgaatgtaa 26220

ctccaaatac caaacttttc atcacaatca tattgatgcc tttgtaagtg aattacagcg 26280

ttttttgcca taaaaagaag taccgtgaca ttggggtcgt cataacaaga agctttatga 26340

acaagcagct tgatctacga gacttataca taaatggttt cgggtaactc ctaatacggg 26400

gctacgttag ttcagcagct gagaacgacc acgaacggga agaattccag ccatgttgaa 26460

gaggtgcagc tatcaaggtg aggtctttac tggtgtctgt tattgctgta acatcatttc 26520

gctgttgcac aatttaaaca tttgtaattt actgttgtta ttgcagtggc cacttgtagc 26580

agtggcagcg aggcactgac acttctacgt gaacgcaacg aggacggatc ctccgaccag 26640

ttcgacctcg tactgtcaga tgtttacatg ccgggtatgt cgtattcctt tgtaaacttt 26700

acaatatgcg tctagtttga cgcgtacact ttgtacactt tgcaaaaacg caccctgcga 26760

ggtctgccat ttggtcacta caacttggcc accttggttg caagtttgca agttcgctct 26820

acgtcaacgc tgcaaaatga accaattgtt ttgcactgac cctgccaacc ttcatttgtg 26880

gctgcagaca tggacggttt caagctgctt gaacacatcg gtctagagtt ggagcttccc 26940

gttatcagta agttgatcga gccgagtcca gagcgaagcc tgcttctata ctattagcag 27000

ctgtcttttg atatttgaca gcttgacttg atatggtcac agagcatact tgcaaccagg 27060

ttacctgttg aactagcaac tgtgcccaag catctcttca agcacctccg tcagtccata 27120

gggtactgtt gatttgtact ctgcaatact gcactgtaat gcgctgtgaa tcactgccct 27180

tcacctctag atggtgcttc cctggagccc tcccccacct ccgcctcaag cccctcacat 27240

gcctctcccc cccctgcagt gatgtcatcc aacggggaca cgaatgtcgt gctgcggggg 27300

gtcacccacg gggctgtgga ctttctgatc aagcccgttc gaattgagga gctgcggaac 27360

gtgtggcagc acgtggtgcg tcgtcgttcc atggcgctgg ccaggacgcc agacgagggg 27420

ggacactcgg acgaggactc tcaggtgccc ttggcagctt ctgggcggct tgctgtgtcg 27480

gatgccactt ggactgggga tgcacgaggg gtggggggac aatgggagat gggccatagt 27540

aggccagagt tgatggcagt ggtggtgggg gggagtaggc gggagagaag cagccatcct 27600

ggtgttggtt ttgatgattg agtgcatggg gatgatgcac aggtgagctg actggatgcc 27660

ttgtcttgct gtgctgcgct gcagcggcac agtgtgaaac gcaaggagtc ggagcagagc 27720

ccgctgcagc tcagcacaga gcagggcggg aacaagaagc caagagtggt gtggtcggtg 27780

gagatgcacc aacaggtgtg cttgcgggcg ggtgtatacg ggggaggggg gccagctgct 27840

ggctgacctg gcgtgcgcgg tgcattgcac ttggcgatga ggggcgtgct tcagtatgta 27900

gctgggacgc aattggttgt gctgtgtgac cagtgcacaa aatacatccc tgaattccag 27960

tgggttgaac agagttgtcc tggaggtggg aagcaaacgc gcacgtggta gaggggagca 28020

gggtgcagaa cagccgcagc aggggtgttg cgcagtgtgc aggtatcctg cctccatgcc 28080

ccgggccatg ggcatactac gctggtaccg tcaggatggg cgttgagcct ggcttggggg 28140

gcagggggcg agcgaatgcg gaatgggagc ggcaggtgct gggagggtgg ctgactggct 28200

tgcaggagcg caagtcctgt cgggggcgtc gtcctgttcc ctcctgcccg cttcacccac 28260

gttcactctc atgcctccac actcctgctg ctgacacacc tgtcgccacc tccgctgcag 28320

tttgtgaacg cggtcaactc cctgggcatt gacaaggcgg tgcccaagcg gattctggac 28380

ctgatgaacg tggaggggct gacgcgcgag aacgtggcca gccatctgca ggtgcctgcc 28440

atgacccgcg at 28452

<210> 87

<211> 30

<212> DNA

<213> 人工序列

<220>

<223> 对人工序列的描述: 合成的引物

<400> 87

aggctactct cagacatgac ggtggctctg 30

<210> 88

<211> 30

<212> DNA

<213> 人工序列

<220>

<223> 对人工序列的描述: 合成的引物

<400> 88

gccacaaatg aaggttggca gggtcagtgc 30

<210> 89

<211> 23

<212> DNA

<213> 人工序列

<220>

<223> 对人工序列的描述: 合成的寡核苷酸

<400> 89

acaccacctt aaggcacatg agg 23

<210> 90

<211> 23

<212> DNA

<213> 人工序列

<220>

<223> 对人工序列的描述: 合成的寡核苷酸

<400> 90

ggcgtgggac atggtgcgca agg 23

<210> 91

<211> 55

<212> DNA

<213> 人工序列

<220>

<223> 对人工序列的描述: 合成的寡核苷酸

<400> 91

tgaagcaccc cccggcctct ccccccgcag ggccgcccct cccgcctcgt cgtgc 55

<210> 92

<211> 56

<212> DNA

<213> 人工序列

<220>

<223> 对人工序列的描述: 合成的寡核苷酸

<400> 92

cgcaacgctc tccctcccca ccccccagcc tcacatccgc ctcaagcagc gccctg 56

<210> 93

<211> 25

<212> DNA

<213> 人工序列

<220>

<223> 对人工序列的描述: 合成的引物

<400> 93

caagctatgc gaggaaggga gggtc 25

<210> 94

<211> 23

<212> DNA

<213> 人工序列

<220>

<223> 对人工序列的描述: 合成的引物

<400> 94

ctgccgcaag tgagtgtgct gtc 23

<210> 95

<211> 25

<212> DNA

<213> 人工序列

<220>

<223> 对人工序列的描述: 合成的引物

<400> 95

caccagatat aggtgacccg ataac 25

<210> 96

<211> 24

<212> DNA

<213> 人工序列

<220>

<223> 对人工序列的描述: 合成的引物

<400> 96

aaaactccac tgcacctgca acat 24

<210> 97

<211> 20

<212> DNA

<213> 人工序列

<220>

<223> 对人工序列的描述: 合成的寡核苷酸

<400> 97

tgcggtgaag cttggagctg 20

<210> 98

<211> 59

<212> DNA

<213> 人工序列

<220>

<223> 对人工序列的描述: 合成的寡核苷酸

<400> 98

ttgccgtcga cgagacttcg gggcgcgcat ttatcgactc tcttgaagat acaccggtt 59

<210> 99

<211> 65

<212> DNA

<213> 人工序列

<220>

<223> 对人工序列的描述: 合成的寡核苷酸

<400> 99

tccaattgta gatatcatat tgtttccgga cctaccttac gcactgagtg ctgccagatg 60

ttctt 65

<210> 100

<211> 28

<212> DNA

<213> 人工序列

<220>

<223> 对人工序列的描述: 合成的引物

<400> 100

gaggtgggtg gtagtgcttc gcgaggtg 28

<210> 101

<211> 29

<212> DNA

<213> 人工序列

<220>

<223> 对人工序列的描述: 合成的引物

<400> 101

atcacagctc acagggcaga cactgcgtc 29

<210> 102

<211> 7

<212> PRT

<213> 未知

<220>

<223> 对未知的描述: 'SHAQKYF'类家族蛋白序列

<400> 102

Ser His Ala Gln Lys Tyr Phe

1 5

<210> 103

<211> 23

<212> DNA

<213> 人工序列

<220>

<223> 对人工序列的描述: 合成的寡核苷酸

<400> 103

gggacatggt gcgcaaggac ggg 23

<210> 104

<211> 23

<212> DNA

<213> 人工序列

<220>

<223> 对人工序列的描述: 合成的寡核苷酸

<400> 104

tgcggtgaag cttggagctg tgg 23

<210> 105

<211> 23

<212> DNA

<213> 人工序列

<220>

<223> 对人工序列的描述: 合成的寡核苷酸

<400> 105

acaccacctt aaggcacatg agg 23

245页详细技术资料下载
上一篇:一种医用注射器针头装配设备
下一篇:灵长类动物视网膜色素上皮细胞特异性启动子SynP61

网友询问留言

已有0条留言

还没有人留言评论。精彩留言会获得点赞!

精彩留言,会给你点赞!