Construction of chimeric SacAS9 based on evolutionary information for enhanced and extended PAM site recognition
阅读说明:本技术 基于进化信息构建嵌合SaCas9用于增强和扩展PAM位点的识别 (Construction of chimeric SacAS9 based on evolutionary information for enhanced and extended PAM site recognition ) 是由 谢震 马大程 张昭煜 许志锰 于 2018-07-05 设计创作,主要内容包括:本发明提出了一种Cas9蛋白突变体。该Cas9蛋白突变体具有:框架区;和PAM识别区,所述PAM识别区识别下列核酸序列的至少之一:5’-NNNRRT-3’;5’-NNNRRN-3’;5’-NNNRCN-3’;5’-NNNRTN-3’;5’-NNNCAA-3’;5’-NNNCAT-3’;5’-NNNCGT-3’;5’-NNNCGC-3’;5’-NNNGTN-3’;5’-NNNTCN-3’;5’-NNNTTC-3’;5’-NNNTTG-3’;5’-NNNTTT-3’;N=A、T、G或C,R=A或G。(The invention provides a mutant of Cas9 protein. The Cas9 protein mutant has: a framework region; and a PAM recognition region that recognizes at least one of the following nucleic acid sequences: 5 '-NNNRRT-3'; 5 '-NNNRRN-3'; 5 '-NNNRCN-3'; 5 '-NNNRTN-3'; 5 '-NNNCAA-3'; 5 '-NNNCAT-3'; 5 '-NNNCGT-3'; 5 '-NNNCGC-3'; 5 '-NNNGTN-3'; 5 '-NNNTCN-3'; 5 '-NNNTTC-3'; 5 '-NNNTTG-3'; 5 '-NNNTTT-3'; N-A, T, G or C, R-a or G.)
1. A mutant Cas9 protein, characterized in that it has:
a framework region; and
a PAM recognition region that recognizes at least one of the following nucleic acid sequences:
5 '-NNNRRT-3', N-A, T, G or C, R-a or G;
5 '-NNNRRN-3', N-A, T, G or C, R-a or G;
5 '-NNNRCN-3', N-A, T, G or C, R-a or G;
5 '-NNNRTN-3', N-A, T, G or C, R-a or G;
5 '-NNNCAA-3', N ═ A, T, G or C;
5 '-NNNCAT-3', N ═ A, T, G or C;
5 '-NNNCGT-3', N ═ A, T, G or C;
5 '-NNNCGC-3', N ═ A, T, G or C;
5 '-NNNGTN-3', N ═ A, T, G or C;
5 '-NNNTCN-3', N ═ A, T, G or C;
5 '-NNNTTC-3', N ═ A, T, G or C;
5 '-NNNTTG-3', N ═ A, T, G or C;
5 '-NNNTTT-3', N-A, T, G or C.
2. A Cas9 protein mutant according to claim 1, wherein the framework regions have at least 70% homology with the framework regions of the following wild-type proteins; preferably, at least 80% homology; more preferably, at least 90% homology; more preferably, at least 95% homology; more preferably, at least 99% homology;
O13、O40、O23、O39、O26、O18、O38、O12、O36、O27、O10、O33、O34、O14、O44、O15、O28、O42、O20、O37、O24、O43、O30、O31、O32、O29、O16、O19、O25、O21、O17、O35、O22、saCas9、SaCas9-KKH。
3. a Cas9 protein mutant according to claim 1, wherein the framework region has at least 90% homology with the framework region of saCas 9; more preferably, at least 95% homology; more preferably, at least 99% homology;
preferably, the framework regions have at least one of the following mutations compared to the framework regions of saCas9, relative to saCas 9:
the 499 th position of the mutant is A,
the 500 th mutation is K,
the 654 th mutation is A,
the mutation at the 655 th position is R,
mutation at the 782 th position to K,
the 968 th position is mutated into K,
the 1015 th mutation is H.
4. A Cas9 protein mutant according to claim 1, wherein the framework regions have the amino acid sequence of SEQ ID NO: 1-2, and 130.
5. A Cas9 protein mutant according to claim 1, characterized in that the PAM recognition region has at least one mutation compared to 982IGVNNDLLNRIEV994 relative to saCas 9.
6. A Cas9 protein mutant according to claim 1, characterized in that the PAM recognition region has at most 13 mutations compared to 982IGVNNDLLNRIEV994 relative to saCas 9;
preferably, there are at most 8 mutations.
7. A Cas9 protein mutant according to claim 1, characterized in that the PAM recognition region has at least one of the following mutations compared to 982IGVNNDLLNRIEV994, relative to saCas 9:
the 982 th mutation is T, K, R or L,
the 983 th mutation is A, C or S,
the mutation at the 984 th position is T, D,
the mutation at the 985 th position is F, S, A, N,
the mutation at the 986 th site is E, D, H, A, M,
the mutation at the 987 th site is S, G, N, S, D, E, P,
the mutation at the 988 th site is D, K, T, S, T, D, K, R, E, A,
the mutation at the 989 th site is R, A, N, Q, G, E, T, K, S, G, H, V,
the 990 th mutation is S,
the 991 th mutation is I, V, L, K, T, M,
the 992 th mutation is V, L,
the 993 th mutation is Q,
the 994 th mutation was L, M, C, I, A.
8. The mutant Cas9 protein according to claim 1, wherein relative to sacAS9, the PAM recognition region has a mutation at 986 position to S and a mutation at 991 position to R compared with 982IGVNNDLLNRIEV994 under the premise that the mutation at 985 position to S;
preferably, on the premise that the 985 th mutation is A, the 986 th mutation is M and the 991 th mutation is K;
preferably, on the premise that the 985 th mutation is N, the 986 th mutation is N and the 991 th mutation is K;
preferably, on the premise that the 985 th mutation is A, the 986 th mutation is N and the 991 th mutation is I;
preferably, on the premise that the 985 th mutation is N, the 986 th mutation is H and the 991 th mutation is R;
preferably, on the premise that the 985 th mutation is N, the 986 th mutation is S and the 991 th mutation is L;
preferably, under the premise that the 985 th mutation is N, the 986 th mutation is N and the 991 th mutation is I;
preferably, on the premise that the 985 th mutation is N, the 986 th mutation is N and the 991 th mutation is K;
preferably, the 986 th mutation is N and the 991 th mutation is V on the premise that the 985 th mutation is N;
preferably, on the premise that the 985 th mutation is N, the 986 th mutation is D and the 991 th mutation is K;
preferably, on the premise that the 985 th mutation is N, the 986 th mutation is S and the 991 th mutation is I;
preferably, on the premise that the 985 th mutation is N, the 986 th mutation is D and the 991 th mutation is K;
preferably, on the premise that the 985 th mutation is N, the 986 th mutation is N and the 991 th mutation is R;
preferably, on the premise that the 985 th mutation is N, the 986 th mutation is H and the 991 th mutation is R;
preferably, under the premise that the 985 th mutation is N, the 986 th mutation is E and the 991 th mutation is I;
preferably, on the premise that the 985 th mutation is S, the 986 th mutation is S and the 991 th mutation is R;
preferably, on the premise that the 985 th mutation is N, the 986 th mutation is D and the 991 th mutation is K;
preferably, the 986 th mutation is A and the 991 th mutation is T on the premise that the 985 th mutation is N;
preferably, on the premise that the 985 th mutation is N, the 986 th mutation is D and the 991 th mutation is T;
preferably, on the premise that the 985 th mutation is N, the 986 th mutation is N and the 991 th mutation is R;
preferably, the 986 th mutation is N and the 991 th mutation is V on the premise that the 985 th mutation is N;
preferably, the 986 th mutation is N and the 991 th mutation is V on the premise that the 985 th mutation is N;
preferably, on the premise that the 985 th mutation is S, the 986 th mutation is M and the 991 th mutation is K;
preferably, on the premise that the 985 th mutation is A, the 986 th mutation is M and the 991 th mutation is K;
preferably, on the premise that the 985 th mutation is N, the 986 th mutation is N and the 991 th mutation is L;
preferably, on the premise that the 985 th mutation is N, the 986 th mutation is N and the 991 th mutation is M;
preferably, on the premise that the 985 th mutation is N, the 986 th mutation is N and the 991 th mutation is R;
preferably, on the premise that the 985 th mutation is N, the 986 th mutation is D and the 991 th mutation is T;
preferably, on the premise that the 985 th mutation is N, the 986 th mutation is N and the 991 th mutation is K;
preferably, on the premise that the 985 th mutation is N, the 986 th mutation is N and the 991 th mutation is M;
preferably, on the premise that the 985 th mutation is N, the 986 th mutation is N and the 991 th mutation is K;
preferably, on the premise that the 985 th mutation is F, the 986 th mutation is S and the 991 th mutation is L;
preferably, the 986 th mutation is N and the 991 th mutation is V on the premise that the 985 th mutation is N;
preferably, on the premise that the 985 th mutation is N, the 986 th mutation is S and the 991 th mutation is L;
preferably, on the premise that the 985 th mutation is N, the 986 th mutation is S and the 991 th mutation is K;
preferably, the 986 th mutation is S and the 991 th mutation is R on the premise that the 985 th mutation is N;
preferably, on the premise that the 985 th mutation is N, the 986 th mutation is N and the 991 th mutation is K;
preferably, the mutation at 986 is N and the mutation at 991 is I under the precondition that the mutation at 985 is N.
9. A Cas9 protein mutant according to claim 1, characterized in that the PAM recognition region has an amino acid sequence shown in SEQ ID NO. 3-SEQ ID NO. 43;
optionally, the Cas9 protein mutant has an amino acid sequence shown as SEQ ID NO: 44-84 and 131.
10. A nucleic acid encoding a Cas9 protein mutant according to any one of claims 1 to 9.
11. The nucleic acid of claim 10, wherein the nucleic acid has the nucleotide sequence of any one of SEQ ID NOS 85-125 and 132.
12. A kit, comprising:
a first nucleic acid molecule encoding a mutant Cas9 protein of any one of claims 1-9; and
a second nucleic acid molecule encoding a gRNA.
13. The kit of claim 12, wherein the first nucleic acid molecule has the nucleotide sequence of any one of SEQ ID NOs 85-125 and 132.
14. The kit of claim 12, wherein the nucleotide sequence of the second nucleic acid molecule encoding the gRNA framework sequence has at least one of the following mutations compared to the nucleotide sequence of a wild-type gRNA framework sequence: U3C, U4A, U4C, U5C, A6G, a32G, a31T, a31G, a30G, T29C.
15. The kit of claim 14, wherein the second nucleic acid molecule has the nucleotide sequence of any one of SEQ ID NOS 127-129.
16. The kit of claim 12, wherein the first nucleic acid molecule and the second nucleic acid molecule are carried on the same expression vector.
17. The kit of claim 16, wherein said same vector is an adenoviral vector.
18. A method for genetically modifying a cell, characterized in that a first nucleic acid molecule and a second nucleic acid molecule are introduced into the cell to be modified, the first nucleic acid molecule and the second nucleic acid molecule being as defined in any one of claims 12 to 17.
19. The method of claim 18, wherein the PAM recognition region sequence and the gRNA sequence of the Cas9 protein mutant are determined based on a gene sequence to be engineered.
20. The method of claim 18, wherein the PAM recognition region sequence and the gRNA sequence are determined based on the following relationship:
21. the method of claim 18, wherein the genetic modification comprises knockout or regulation of expression of a predetermined site.
22. A cell obtained by the method of any one of claims 18 to 21.
23. The cell of claim 22, wherein the cell is an animal cell, a plant cell, or a microbial cell.
Technical Field
The invention relates to the technical field of biology, in particular to a Cas9 protein mutant, nucleic acid, a kit, a method for genetically modifying cells and the cells.
Background
The CRISPR/Cas9 ribozyme enables efficient gene editing in a variety of species and in different cells. Cas9 is directed to recognize different positions by artificially coupling crRNA and tracrrna to form guide RNA. Cas9 still needs to bind before a specific PAM sequence.
The widely used SpCas9 recognizes the NGG PAM sequence, while another SaCas9 protein recognizes the PAM sequence of "NNGRRT," which limits the range of recognition by SaCas 9. To extend the recognition range of SpCas9, multiple different PAMs were mined by a method of screening in bacteria using direct evolution. Meanwhile, PAM of KKH-SaCas9 was expanded to NNNRRT by introducing three-point mutation. Although the PAM recognition range of KKH-SaCas9 is more extended compared to SaCas9, in theory KKH-SaCas9 can only bind to 1/16 region.
Although Cas9 homologous proteins are widely distributed in bacteria, many different Cas9 homologous proteins have been identified. However, few have been identified for efficient gene editing in mammalian cells.
Therefore, Cas9 with PAM broad recognition capability requires continued development and improvement by researchers, making the gene editing capability of CRISPR/Cas9 system more powerful.
Disclosure of Invention
The present application is based on the discovery and recognition by the inventors of the following facts and problems:
the inventor of the application discovers a series of different SaCas9 homologous proteins by a method of evolution information and gene mining, and further designs a series of different Cas9 chimeras (cCas9) by using KKH SaCas9 as a framework and replacing 13 amino acid residue peptide segments of a PAM action region with other homologous protein sequences. These different cscas 9 have different PAM specificities, and in addition to NNNRRT, different mutants can also recognize PAM regions including NNNRRN, NNNRCN, NNNRTN, NNNCAA, NNNCAT, NNNCGT, NNNCGC, NNNGTN, NNNTCN, NNNTTC, NNNTTG, NNNTTT (N A, T, G or C, R a or G). The inventors of the present application succeeded in extending the PAM recognition range of Cas9 to greater than 1/2, (49 PAM species in total listed above, 64 PAM species in total, the probability of this recognition range is 49/64). Not only expands the tendency of PAM, but also discovers a plurality of new chimeras.
To this end, in a first aspect of the invention, the invention proposes a mutant Cas9 protein. According to an embodiment of the invention, it has: a framework region; and a PAM recognition region that recognizes at least one of the following nucleic acid sequences:
5 '-NNNRRT-3', N-A, T, G or C, R-a or G;
5 '-NNNRRN-3', N-A, T, G or C, R-a or G;
5 '-NNNRCN-3', N-A, T, G or C, R-a or G;
5 '-NNNRTN-3', N-A, T, G or C, R-a or G;
5 '-NNNCAA-3', N ═ A, T, G or C;
5 '-NNNCAT-3', N ═ A, T, G or C;
5 '-NNNCGT-3', N ═ A, T, G or C;
5 '-NNNCGC-3', N ═ A, T, G or C;
5 '-NNNGTN-3', N ═ A, T, G or C;
5 '-NNNTCN-3', N ═ A, T, G or C;
5 '-NNNTTC-3', N ═ A, T, G or C;
5 '-NNNTTG-3', N ═ A, T, G or C;
5 '-NNNTTT-3', N-A, T, G or C.
Compared with Cas9, the recognition range of the protein mutant of Cas9 provided by the embodiment of the invention is expanded to be close to 1/2, and the tendency of PAM is greatly expanded. According to the Cas9 protein mutant provided by the embodiment of the invention, under the guide of guide RNA, a dsDNA region capable of being combined is greatly expanded, and the gene editing capacity of a CRISPR/Cas9 system is more powerful.
According to an embodiment of the present invention, the Cas9 protein mutant may further include at least one of the following additional technical features:
according to embodiments of the invention, the framework regions of the Cas9 protein mutant are at least 70% homologous to the framework regions of the following wild-type proteins; preferably, at least 80% homology; more preferably, at least 90% homology; more preferably, at least 95% homology; more preferably, at least 99% homology;
O13、O40、O23、O39、O26、O18、O38、O12、O36、O27、O10、O33、O34、O14、O44、O15、O28、O42、O20、O37、O24、O43、O30、O31、O32、O29、O16、O19、O25、O21、O17、O35、O22、saCas9、SaCas9-KKH。
compared with Cas9, the above Cas9 protein mutant provided by the embodiment of the invention has a wide PAM recognition range, and under guide RNA guidance, the Cas9 protein mutant provided by the embodiment of the invention has a greatly expanded dsDNA region capable of being combined, and the gene editing capability of a CRISPR/Cas9 system becomes stronger.
According to an embodiment of the invention, the framework region has at least 90% homology to saCas 9; more preferably, at least 95% homology; more preferably, it has at least 99% homology. Compared with saCas9, the recognition range of the PAM is wider, and the PAM recognition range of the mutant of the Cas9 protein can be expanded to be close to 1/2.
According to a specific embodiment of the invention, the framework region has an amino acid sequence shown as SEQ ID NO 1-2, 130.
MKRNYILGLDIGITSVGYGIIDYETRDVIDAGVRLFKEANVENNEGRRSKRGARRLKRRRRHRIQRVKKLLFDYNLLTDHSELSGINPYEARVKGLSQKLSEEEFSAALLHLAKRRGVHNVNEVEEDTGNELSTKEQISRNSKALEEKYVAELQLERLKKDGEVRGSINRFKTSDYVKEAKQLLKVQKAYHQLDQSFIDTYIDLLETRRTYYEGPGEGSPFGWKDIKEWYEMLMGHCTYFPEELRSVKYAYNADLYNALNDLNNLVITRDENEKLEYYEKFQIIENVFKQKKKPTLKQIAKEILVNEEDIKGYRVTSTGKPEFTNLKVYHDIKDITARKEIIENAELLDQIAKILTIYQSSEDIQEELTNLNSELTQEEIEQISNLKGYTGTHNLSLKAINLILDELWHTNDNQIAIFNRLKLVPKKVDLSQQKEIPTTLVDDFILSPVVKRSFIQSIKVINAIIKKYGLPNDIIIELAREKNSKDAQKMINEMQKRNRQTNERIEEIIRTTGKENAKYLIEKIKLHDMQEGKCLYSLEAIPLEDLLNNPFNYEVDHIIPRSVSFDNSFNNKVLVKQEENSKKGNRTPFQYLSSSDSKISYETFKKHILNLAKGKGRISKTKKEYLLEERDINRFSVQKDFINRNLVDTRYATRGLMNLLRSYFRVNNLDVKVKSINGGFTSFLRRKWKFKKERNKGYKHHAEDALIIANADFIFKEWKKLDKAKKVMENQMFEEKQAESMPEIETEQEYKEIFITPHQIKHIKDFKDYKYSHRVDKKPNRKLINDTLYSTRKDDKGNTLIVNNLNGLYDKDNDKLKKLINKSPEKLLMYHHDPQTYQKLKLIMEQYGDEKNPLYKYYEETGNYLTKYSKKDNGPVIKKIKYYGNKLNAHLDITDDYPNSRNKVVKLSLKPYRFDVYLDNGVYKFVTVKNLDVIKKENYYEVNSKCYEEAKKLKKISNQAEFIASFYKNDLIKINGELYRV(SEQ ID NO:1)。
NMIDITYREYLENMNDKRPPHIIKTIASKTQSIKKYSTDILGNLYEVKSKKHPQIIKKG(SEQ IDNO:2)。
MKRNYILGLDIGITSVGYGIIDYETRDVIDAGVRLFKEANVENNEGRRSKRGARRLKRRRRHRIQRVKKLLFDYNLLTDHSELSGINPYEARVKGLSQKLSEEEFSAALLHLAKRRGVHNVNEVEEDTGNELSTKEQISRNSKALEEKYVAELQLERLKKDGEVRGSINRFKTSDYVKEAKQLLKVQKAYHQLDQSFIDTYIDLLETRRTYYEGPGEGSPFGWKDIKEWYEMLMGHCTYFPEELRSVKYAYNADLYNALNDLNNLVITRDENEKLEYYEKFQIIENVFKQKKKPTLKQIAKEILVNEEDIKGYRVTSTGKPEFTNLKVYHDIKDITARKEIIENAELLDQIAKILTIYQSSEDIQEELTNLNSELTQEEIEQISNLKGYTGTHNLSLKAINLILDELWHTNDNQIAIFNRLKLVPKKVDLSQQKEIPTTLVDDFILSPVVKRSFIQSIKVINAIIKKYGLPNDIIIELAREKNSKDAQKMINEMQKRNAKTNERIEEIIRTTGKENAKYLIEKIKLHDMQEGKCLYSLEAIPLEDLLNNPFNYEVDHIIPRSVSFDNSFNNKVLVKQEENSKKGNRTPFQYLSSSDSKISYETFKKHILNLAKGKGRISKTKKEYLLEERDINRFSVQKDFINRNLVDTRYATARLMNLLRSYFRVNNLDVKVKSINGGFTSFLRRKWKFKKERNKGYKHHAEDALIIANADFIFKEWKKLDKAKKVMENQMFEEKQAESMPEIETEQEYKEIFITPHQIKHIKDFKDYKYSHRVDKKPNRKLINDTLYSTRKDDKGNTLIVNNLNGLYDKDNDKLKKLINKSPEKLLMYHHDPQTYQKLKLIMEQYGDEKNPLYKYYEETGNYLTKYSKKDNGPVIKKIKYYGNKLNAHLDITDDYPNSRNKVVKLSLKPYRFDVYLDNGVYKFVTVKNLDVIKKENYYEVNSKCYEEAKKLKKISNQAEFIASFYKNDLIKINGELYRV(SEQ ID NO:130)。
Wherein the amino acid sequence shown by
According to embodiments of the invention, the PAM recognition region has at least one mutation compared to 982IGVNNDLLNRIEV994 relative to saCas 9.
According to embodiments of the invention, the PAM recognition region has at most 13 mutations compared to 982IGVNNDLLNRIEV994, preferably at most 8 mutations, or at most 7 mutations, or at most 6 mutations, or at most 5 mutations, or at most 4 mutations, or at most 3 mutations relative to saCas 9.
According to embodiments of the present invention, the Cas9 protein mutant has a mutation at any one or more of
According to embodiments of the invention, the PAM recognition region has at least one of the following mutations compared to 982IGVNNDLLNRIEV994 relative to saCas 9: the 982 th mutation is T, K, R or L, the 983 rd mutation is A, C or S, the 984 th mutation is T, D, the 985 th mutation is F, S, A, N, the 986 th mutation is E, D, H, A, M, the 987 th mutation is S, G, N, S, D, E, P, the 988 th mutation is D, K, T, S, T, D, K, R, E, A, the 989 th mutation is R, A, N, Q, G, E, T, K, S, G, H, V, the 990 th mutation is S, the 991 th mutation is I, V, L, K, T, M, the 992 nd mutation is V, L, the 993 th mutation is Q, and the 994 th mutation is L, M, C, I, A. The inventors found that the PAM recognition range of Cas9 protein mutants according to the examples of the present application, having at least one of the mutations described above, is broad, extending to nearly 1/2.
According to the embodiment of the invention, compared with sacAS9, compared with 982IGVNNDLLNRIEV994, compared with the PAM recognition region, on the premise that the 985 th mutation is S, the 986 th mutation is S and the 991 th mutation is R;
preferably, on the premise that the 985 th mutation is A, the 986 th mutation is M and the 991 th mutation is K;
preferably, on the premise that the 985 th mutation is N, the 986 th mutation is N and the 991 th mutation is K;
preferably, on the premise that the 985 th mutation is A, the 986 th mutation is N and the 991 th mutation is I;
preferably, on the premise that the 985 th mutation is N, the 986 th mutation is H and the 991 th mutation is R;
preferably, on the premise that the 985 th mutation is N, the 986 th mutation is S and the 991 th mutation is L;
preferably, under the premise that the 985 th mutation is N, the 986 th mutation is N and the 991 th mutation is I;
preferably, on the premise that the 985 th mutation is N, the 986 th mutation is N and the 991 th mutation is K;
preferably, the 986 th mutation is N and the 991 th mutation is V on the premise that the 985 th mutation is N;
preferably, on the premise that the 985 th mutation is N, the 986 th mutation is D and the 991 th mutation is K;
preferably, on the premise that the 985 th mutation is N, the 986 th mutation is S and the 991 th mutation is I;
preferably, on the premise that the 985 th mutation is N, the 986 th mutation is D and the 991 th mutation is K;
preferably, on the premise that the 985 th mutation is N, the 986 th mutation is N and the 991 th mutation is R;
preferably, on the premise that the 985 th mutation is N, the 986 th mutation is H and the 991 th mutation is R;
preferably, under the premise that the 985 th mutation is N, the 986 th mutation is E and the 991 th mutation is I;
preferably, on the premise that the 985 th mutation is S, the 986 th mutation is S and the 991 th mutation is R;
preferably, on the premise that the 985 th mutation is N, the 986 th mutation is D and the 991 th mutation is K;
preferably, the 986 th mutation is A and the 991 th mutation is T on the premise that the 985 th mutation is N;
preferably, on the premise that the 985 th mutation is N, the 986 th mutation is D and the 991 th mutation is T;
preferably, on the premise that the 985 th mutation is N, the 986 th mutation is N and the 991 th mutation is R;
preferably, the 986 th mutation is N and the 991 th mutation is V on the premise that the 985 th mutation is N;
preferably, the 986 th mutation is N and the 991 th mutation is V on the premise that the 985 th mutation is N;
preferably, on the premise that the 985 th mutation is S, the 986 th mutation is M and the 991 th mutation is K;
preferably, on the premise that the 985 th mutation is A, the 986 th mutation is M and the 991 th mutation is K;
preferably, on the premise that the 985 th mutation is N, the 986 th mutation is N and the 991 th mutation is L;
preferably, on the premise that the 985 th mutation is N, the 986 th mutation is N and the 991 th mutation is M;
preferably, on the premise that the 985 th mutation is N, the 986 th mutation is N and the 991 th mutation is R;
preferably, on the premise that the 985 th mutation is N, the 986 th mutation is D and the 991 th mutation is T;
preferably, on the premise that the 985 th mutation is N, the 986 th mutation is N and the 991 th mutation is K;
preferably, on the premise that the 985 th mutation is N, the 986 th mutation is N and the 991 th mutation is M;
preferably, on the premise that the 985 th mutation is N, the 986 th mutation is N and the 991 th mutation is K;
preferably, on the premise that the 985 th mutation is F, the 986 th mutation is S and the 991 th mutation is L;
preferably, the 986 th mutation is N and the 991 th mutation is V on the premise that the 985 th mutation is N;
preferably, on the premise that the 985 th mutation is N, the 986 th mutation is S and the 991 th mutation is L;
preferably, on the premise that the 985 th mutation is N, the 986 th mutation is S and the 991 th mutation is K;
preferably, the 986 th mutation is S and the 991 th mutation is R on the premise that the 985 th mutation is N;
preferably, the 986 th mutation is S and the 991 th mutation is R on the premise that the 985 th mutation is N;
preferably, on the premise that the 985 th mutation is N, the 986 th mutation is N and the 991 th mutation is K;
preferably, on the premise that the 985 th mutation is N, the 986 th mutation is N and the 991 th mutation is K;
preferably, under the premise that the 985 th mutation is N, the 986 th mutation is N and the 991 th mutation is I;
preferably, the mutation at 986 is N and the mutation at 991 is I under the precondition that the mutation at 985 is N.
Furthermore, the Cas9 protein mutant PAM according to the examples of the present application has a wider recognition range.
According to the embodiment of the invention, the PAM recognition region of the Cas9 protein mutant has an amino acid sequence shown as SEQ ID NO. 3-43.
RSDSSPRENRLEV(SEQ ID NO:3)。
KGDAMPRGNKIEI(SEQ ID NO:4)。
TATNNDKSNKIEV(SEQ ID NO:5)。
LGDANSRQNILEA(SEQ ID NO:6)。
IGVNHDEGNRIEM(SEQ ID NO:7)。
IGVNSDKNNLIEV(SEQ ID NO:8)。
IGVNNSTRNIVEL(SEQ ID NO:9)。
RGDNNPRQNKLEV(SEQ ID NO:10)。
IGVNNDKNNVIEL(SEQ ID NO:11)。
IGINDNKHNKIEL(SEQ ID NO:12)。
IGVNSDDRNIIEL(SEQ ID NO:13)。
IGVNDSEKNKIQL(SEQ ID NO:14)。
KCINNEKTHRIEI(SEQ ID NO:15)。
IGVNHDKTNRIEC(SEQ ID NO:16)。
IGVNEDKRNIIEL(SEQ ID NO:17)。
RGDSSPRENRFEV(SEQ ID NO:18)。
RGDNDPKANKIEV(SEQ ID NO:19)。
IGVNAEKRNTIEV(SEQ ID NO:20)。
IGVNDDAKNTLEL(SEQ ID NO:21)。
VGVNNDSVNRVEL(SEQ ID NO:22)。
VGVNNDTRNVVEL(SEQ ID NO:23)。
VGVNNDSRNVVEL(SEQ ID NO:24)。
RGDSMPRQNKIEM(SEQ ID NO:25)。
RGDAMPRDNKIEV(SEQ ID NO:26)。
IGINNGDKNLVEL(SEQ ID NO:27)。
RGDNNPRQNMIEV(SEQ ID NO:28)。
IGVNNDSTNRVEL(SEQ ID NO:29)。
RGDNDPRRSTIEL(SEQ ID NO:30)。
RGDNNPRQNKLEV(SEQ ID NO:31)。
TATNNDKKNMIEV(SEQ ID NO:32)。
IGVNNNRLNKIEL(SEQ ID NO:33)。
IGVFSDAGNLLEV(SEQ ID NO:34)。
IGDNNPRNNVIEV(SEQ ID NO:35)。
IGVNSDDRNLIEL(SEQ ID NO:36)。
IGVNSDDRNKIEL(SEQ ID NO:37)。
IGVNSDDRNRIEL(SEQ ID NO:38)。
IGVNSDLLNRIEV(SEQ ID NO:39)。
IGVNNNLLNKIEV(SEQ ID NO:40)。
IGVNNDLLNKIEV(SEQ ID NO:41)。
IGVNNSTRNIKEL(SEQ ID NO:42)。
IGVNNSTRNILEL(SEQ ID NO:43)。
The inventor finds that the Cas9 protein mutant with the PAM recognition sequence can be combined with a very wide dsDNA region under the guide of guide RNA, the PAM recognition range is expanded to be close to 1/2, and the gene editing capacity of the CRISPR/Cas9 system is more powerful.
According to the embodiment of the invention, the Cas9 protein mutant has amino acid sequences shown as SEQ ID NO: 44-85 and 131.
MKRNYILGLDIGITSVGYGIIDYETRDVIDAGVRLFKEANVENNEGRRSKRGARRLKRRRRHRIQRVKKLLFDYNLLTDHSELSGINPYEARVKGLSQKLSEEEFSAALLHLAKRRGVHNVNEVEEDTGNELSTKEQISRNSKALEEKYVAELQLERLKKDGEVRGSINRFKTSDYVKEAKQLLKVQKAYHQLDQSFIDTYIDLLETRRTYYEGPGEGSPFGWKDIKEWYEMLMGHCTYFPEELRSVKYAYNADLYNALNDLNNLVITRDENEKLEYYEKFQIIENVFKQKKKPTLKQIAKEILVNEEDIKGYRVTSTGKPEFTNLKVYHDIKDITARKEIIENAELLDQIAKILTIYQSSEDIQEELTNLNSELTQEEIEQISNLKGYTGTHNLSLKAINLILDELWHTNDNQIAIFNRLKLVPKKVDLSQQKEIPTTLVDDFILSPVVKRSFIQSIKVINAIIKKYGLPNDIIIELAREKNSKDAQKMINEMQKRNRQTNERIEEIIRTTGKENAKYLIEKIKLHDMQEGKCLYSLEAIPLEDLLNNPFNYEVDHIIPRSVSFDNSFNNKVLVKQEENSKKGNRTPFQYLSSSDSKISYETFKKHILNLAKGKGRISKTKKEYLLEERDINRFSVQKDFINRNLVDTRYATRGLMNLLRSYFRVNNLDVKVKSINGGFTSFLRRKWKFKKERNKGYKHHAEDALIIANADFIFKEWKKLDKAKKVMENQMFEEKQAESMPEIETEQEYKEIFITPHQIKHIKDFKDYKYSHRVDKKPNRKLINDTLYSTRKDDKGNTLIVNNLNGLYDKDNDKLKKLINKSPEKLLMYHHDPQTYQKLKLIMEQYGDEKNPLYKYYEETGNYLTKYSKKDNGPVIKKIKYYGNKLNAHLDITDDYPNSRNKVVKLSLKPYRFDVYLDNGVYKFVTVKNLDVIKKENYYEVNSKCYEEAKKLKKISNQAEFIASFYKNDLIKINGELYRVRSDSSPRENRLEVNMIDITYREYLENMNDKRPPHIIKTIASKTQSIKKYSTDILGNLYEVKSKKHPQIIKKG(SEQID NO:44)。
MKRNYILGLDIGITSVGYGIIDYETRDVIDAGVRLFKEANVENNEGRRSKRGARRLKRRRRHRIQRVKKLLFDYNLLTDHSELSGINPYEARVKGLSQKLSEEEFSAALLHLAKRRGVHNVNEVEEDTGNELSTKEQISRNSKALEEKYVAELQLERLKKDGEVRGSINRFKTSDYVKEAKQLLKVQKAYHQLDQSFIDTYIDLLETRRTYYEGPGEGSPFGWKDIKEWYEMLMGHCTYFPEELRSVKYAYNADLYNALNDLNNLVITRDENEKLEYYEKFQIIENVFKQKKKPTLKQIAKEILVNEEDIKGYRVTSTGKPEFTNLKVYHDIKDITARKEIIENAELLDQIAKILTIYQSSEDIQEELTNLNSELTQEEIEQISNLKGYTGTHNLSLKAINLILDELWHTNDNQIAIFNRLKLVPKKVDLSQQKEIPTTLVDDFILSPVVKRSFIQSIKVINAIIKKYGLPNDIIIELAREKNSKDAQKMINEMQKRNRQTNERIEEIIRTTGKENAKYLIEKIKLHDMQEGKCLYSLEAIPLEDLLNNPFNYEVDHIIPRSVSFDNSFNNKVLVKQEENSKKGNRTPFQYLSSSDSKISYETFKKHILNLAKGKGRISKTKKEYLLEERDINRFSVQKDFINRNLVDTRYATRGLMNLLRSYFRVNNLDVKVKSINGGFTSFLRRKWKFKKERNKGYKHHAEDALIIANADFIFKEWKKLDKAKKVMENQMFEEKQAESMPEIETEQEYKEIFITPHQIKHIKDFKDYKYSHRVDKKPNRKLINDTLYSTRKDDKGNTLIVNNLNGLYDKDNDKLKKLINKSPEKLLMYHHDPQTYQKLKLIMEQYGDEKNPLYKYYEETGNYLTKYSKKDNGPVIKKIKYYGNKLNAHLDITDDYPNSRNKVVKLSLKPYRFDVYLDNGVYKFVTVKNLDVIKKENYYEVNSKCYEEAKKLKKISNQAEFIASFYKNDLIKINGELYRVKGDAMPRGNKIEINMIDITYREYLENMNDKRPPHIIKTIASKTQSIKKYSTDILGNLYEVKSKKHPQIIKKG(SEQID NO:45)。
MKRNYILGLDIGITSVGYGIIDYETRDVIDAGVRLFKEANVENNEGRRSKRGARRLKRRRRHRIQRVKKLLFDYNLLTDHSELSGINPYEARVKGLSQKLSEEEFSAALLHLAKRRGVHNVNEVEEDTGNELSTKEQISRNSKALEEKYVAELQLERLKKDGEVRGSINRFKTSDYVKEAKQLLKVQKAYHQLDQSFIDTYIDLLETRRTYYEGPGEGSPFGWKDIKEWYEMLMGHCTYFPEELRSVKYAYNADLYNALNDLNNLVITRDENEKLEYYEKFQIIENVFKQKKKPTLKQIAKEILVNEEDIKGYRVTSTGKPEFTNLKVYHDIKDITARKEIIENAELLDQIAKILTIYQSSEDIQEELTNLNSELTQEEIEQISNLKGYTGTHNLSLKAINLILDELWHTNDNQIAIFNRLKLVPKKVDLSQQKEIPTTLVDDFILSPVVKRSFIQSIKVINAIIKKYGLPNDIIIELAREKNSKDAQKMINEMQKRNRQTNERIEEIIRTTGKENAKYLIEKIKLHDMQEGKCLYSLEAIPLEDLLNNPFNYEVDHIIPRSVSFDNSFNNKVLVKQEENSKKGNRTPFQYLSSSDSKISYETFKKHILNLAKGKGRISKTKKEYLLEERDINRFSVQKDFINRNLVDTRYATRGLMNLLRSYFRVNNLDVKVKSINGGFTSFLRRKWKFKKERNKGYKHHAEDALIIANADFIFKEWKKLDKAKKVMENQMFEEKQAESMPEIETEQEYKEIFITPHQIKHIKDFKDYKYSHRVDKKPNRKLINDTLYSTRKDDKGNTLIVNNLNGLYDKDNDKLKKLINKSPEKLLMYHHDPQTYQKLKLIMEQYGDEKNPLYKYYEETGNYLTKYSKKDNGPVIKKIKYYGNKLNAHLDITDDYPNSRNKVVKLSLKPYRFDVYLDNGVYKFVTVKNLDVIKKENYYEVNSKCYEEAKKLKKISNQAEFIASFYKNDLIKINGELYRVTATNNDKSNKIEVNMIDITYREYLENMNDKRPPHIIKTIASKTQSIKKYSTDILGNLYEVKSKKHPQIIKKG(SEQID NO:46)。
MKRNYILGLDIGITSVGYGIIDYETRDVIDAGVRLFKEANVENNEGRRSKRGARRLKRRRRHRIQRVKKLLFDYNLLTDHSELSGINPYEARVKGLSQKLSEEEFSAALLHLAKRRGVHNVNEVEEDTGNELSTKEQISRNSKALEEKYVAELQLERLKKDGEVRGSINRFKTSDYVKEAKQLLKVQKAYHQLDQSFIDTYIDLLETRRTYYEGPGEGSPFGWKDIKEWYEMLMGHCTYFPEELRSVKYAYNADLYNALNDLNNLVITRDENEKLEYYEKFQIIENVFKQKKKPTLKQIAKEILVNEEDIKGYRVTSTGKPEFTNLKVYHDIKDITARKEIIENAELLDQIAKILTIYQSSEDIQEELTNLNSELTQEEIEQISNLKGYTGTHNLSLKAINLILDELWHTNDNQIAIFNRLKLVPKKVDLSQQKEIPTTLVDDFILSPVVKRSFIQSIKVINAIIKKYGLPNDIIIELAREKNSKDAQKMINEMQKRNRQTNERIEEIIRTTGKENAKYLIEKIKLHDMQEGKCLYSLEAIPLEDLLNNPFNYEVDHIIPRSVSFDNSFNNKVLVKQEENSKKGNRTPFQYLSSSDSKISYETFKKHILNLAKGKGRISKTKKEYLLEERDINRFSVQKDFINRNLVDTRYATRGLMNLLRSYFRVNNLDVKVKSINGGFTSFLRRKWKFKKERNKGYKHHAEDALIIANADFIFKEWKKLDKAKKVMENQMFEEKQAESMPEIETEQEYKEIFITPHQIKHIKDFKDYKYSHRVDKKPNRKLINDTLYSTRKDDKGNTLIVNNLNGLYDKDNDKLKKLINKSPEKLLMYHHDPQTYQKLKLIMEQYGDEKNPLYKYYEETGNYLTKYSKKDNGPVIKKIKYYGNKLNAHLDITDDYPNSRNKVVKLSLKPYRFDVYLDNGVYKFVTVKNLDVIKKENYYEVNSKCYEEAKKLKKISNQAEFIASFYKNDLIKINGELYRVLGDANSRQNILEANMIDITYREYLENMNDKRPPHIIKTIASKTQSIKKYSTDILGNLYEVKSKKHPQIIKKG(SEQID NO:47)。
MKRNYILGLDIGITSVGYGIIDYETRDVIDAGVRLFKEANVENNEGRRSKRGARRLKRRRRHRIQRVKKLLFDYNLLTDHSELSGINPYEARVKGLSQKLSEEEFSAALLHLAKRRGVHNVNEVEEDTGNELSTKEQISRNSKALEEKYVAELQLERLKKDGEVRGSINRFKTSDYVKEAKQLLKVQKAYHQLDQSFIDTYIDLLETRRTYYEGPGEGSPFGWKDIKEWYEMLMGHCTYFPEELRSVKYAYNADLYNALNDLNNLVITRDENEKLEYYEKFQIIENVFKQKKKPTLKQIAKEILVNEEDIKGYRVTSTGKPEFTNLKVYHDIKDITARKEIIENAELLDQIAKILTIYQSSEDIQEELTNLNSELTQEEIEQISNLKGYTGTHNLSLKAINLILDELWHTNDNQIAIFNRLKLVPKKVDLSQQKEIPTTLVDDFILSPVVKRSFIQSIKVINAIIKKYGLPNDIIIELAREKNSKDAQKMINEMQKRNRQTNERIEEIIRTTGKENAKYLIEKIKLHDMQEGKCLYSLEAIPLEDLLNNPFNYEVDHIIPRSVSFDNSFNNKVLVKQEENSKKGNRTPFQYLSSSDSKISYETFKKHILNLAKGKGRISKTKKEYLLEERDINRFSVQKDFINRNLVDTRYATRGLMNLLRSYFRVNNLDVKVKSINGGFTSFLRRKWKFKKERNKGYKHHAEDALIIANADFIFKEWKKLDKAKKVMENQMFEEKQAESMPEIETEQEYKEIFITPHQIKHIKDFKDYKYSHRVDKKPNRKLINDTLYSTRKDDKGNTLIVNNLNGLYDKDNDKLKKLINKSPEKLLMYHHDPQTYQKLKLIMEQYGDEKNPLYKYYEETGNYLTKYSKKDNGPVIKKIKYYGNKLNAHLDITDDYPNSRNKVVKLSLKPYRFDVYLDNGVYKFVTVKNLDVIKKENYYEVNSKCYEEAKKLKKISNQAEFIASFYKNDLIKINGELYRVIGVNHDEGNRIEMNMIDITYREYLENMNDKRPPHIIKTIASKTQSIKKYSTDILGNLYEVKSKKHPQIIKKG(SEQID NO:48)。
MKRNYILGLDIGITSVGYGIIDYETRDVIDAGVRLFKEANVENNEGRRSKRGARRLKRRRRHRIQRVKKLLFDYNLLTDHSELSGINPYEARVKGLSQKLSEEEFSAALLHLAKRRGVHNVNEVEEDTGNELSTKEQISRNSKALEEKYVAELQLERLKKDGEVRGSINRFKTSDYVKEAKQLLKVQKAYHQLDQSFIDTYIDLLETRRTYYEGPGEGSPFGWKDIKEWYEMLMGHCTYFPEELRSVKYAYNADLYNALNDLNNLVITRDENEKLEYYEKFQIIENVFKQKKKPTLKQIAKEILVNEEDIKGYRVTSTGKPEFTNLKVYHDIKDITARKEIIENAELLDQIAKILTIYQSSEDIQEELTNLNSELTQEEIEQISNLKGYTGTHNLSLKAINLILDELWHTNDNQIAIFNRLKLVPKKVDLSQQKEIPTTLVDDFILSPVVKRSFIQSIKVINAIIKKYGLPNDIIIELAREKNSKDAQKMINEMQKRNRQTNERIEEIIRTTGKENAKYLIEKIKLHDMQEGKCLYSLEAIPLEDLLNNPFNYEVDHIIPRSVSFDNSFNNKVLVKQEENSKKGNRTPFQYLSSSDSKISYETFKKHILNLAKGKGRISKTKKEYLLEERDINRFSVQKDFINRNLVDTRYATRGLMNLLRSYFRVNNLDVKVKSINGGFTSFLRRKWKFKKERNKGYKHHAEDALIIANADFIFKEWKKLDKAKKVMENQMFEEKQAESMPEIETEQEYKEIFITPHQIKHIKDFKDYKYSHRVDKKPNRKLINDTLYSTRKDDKGNTLIVNNLNGLYDKDNDKLKKLINKSPEKLLMYHHDPQTYQKLKLIMEQYGDEKNPLYKYYEETGNYLTKYSKKDNGPVIKKIKYYGNKLNAHLDITDDYPNSRNKVVKLSLKPYRFDVYLDNGVYKFVTVKNLDVIKKENYYEVNSKCYEEAKKLKKISNQAEFIASFYKNDLIKINGELYRVIGVNSDKNNLIEVNMIDITYREYLENMNDKRPPHIIKTIASKTQSIKKYSTDILGNLYEVKSKKHPQIIKKG(SEQID NO:49)。
MKRNYILGLDIGITSVGYGIIDYETRDVIDAGVRLFKEANVENNEGRRSKRGARRLKRRRRHRIQRVKKLLFDYNLLTDHSELSGINPYEARVKGLSQKLSEEEFSAALLHLAKRRGVHNVNEVEEDTGNELSTKEQISRNSKALEEKYVAELQLERLKKDGEVRGSINRFKTSDYVKEAKQLLKVQKAYHQLDQSFIDTYIDLLETRRTYYEGPGEGSPFGWKDIKEWYEMLMGHCTYFPEELRSVKYAYNADLYNALNDLNNLVITRDENEKLEYYEKFQIIENVFKQKKKPTLKQIAKEILVNEEDIKGYRVTSTGKPEFTNLKVYHDIKDITARKEIIENAELLDQIAKILTIYQSSEDIQEELTNLNSELTQEEIEQISNLKGYTGTHNLSLKAINLILDELWHTNDNQIAIFNRLKLVPKKVDLSQQKEIPTTLVDDFILSPVVKRSFIQSIKVINAIIKKYGLPNDIIIELAREKNSKDAQKMINEMQKRNRQTNERIEEIIRTTGKENAKYLIEKIKLHDMQEGKCLYSLEAIPLEDLLNNPFNYEVDHIIPRSVSFDNSFNNKVLVKQEENSKKGNRTPFQYLSSSDSKISYETFKKHILNLAKGKGRISKTKKEYLLEERDINRFSVQKDFINRNLVDTRYATRGLMNLLRSYFRVNNLDVKVKSINGGFTSFLRRKWKFKKERNKGYKHHAEDALIIANADFIFKEWKKLDKAKKVMENQMFEEKQAESMPEIETEQEYKEIFITPHQIKHIKDFKDYKYSHRVDKKPNRKLINDTLYSTRKDDKGNTLIVNNLNGLYDKDNDKLKKLINKSPEKLLMYHHDPQTYQKLKLIMEQYGDEKNPLYKYYEETGNYLTKYSKKDNGPVIKKIKYYGNKLNAHLDITDDYPNSRNKVVKLSLKPYRFDVYLDNGVYKFVTVKNLDVIKKENYYEVNSKCYEEAKKLKKISNQAEFIASFYKNDLIKINGELYRVIGVNNSTRNIVELNMIDITYREYLENMNDKRPPHIIKTIASKTQSIKKYSTDILGNLYEVKSKKHPQIIKKG(SEQID NO:50)。
MKRNYILGLDIGITSVGYGIIDYETRDVIDAGVRLFKEANVENNEGRRSKRGARRLKRRRRHRIQRVKKLLFDYNLLTDHSELSGINPYEARVKGLSQKLSEEEFSAALLHLAKRRGVHNVNEVEEDTGNELSTKEQISRNSKALEEKYVAELQLERLKKDGEVRGSINRFKTSDYVKEAKQLLKVQKAYHQLDQSFIDTYIDLLETRRTYYEGPGEGSPFGWKDIKEWYEMLMGHCTYFPEELRSVKYAYNADLYNALNDLNNLVITRDENEKLEYYEKFQIIENVFKQKKKPTLKQIAKEILVNEEDIKGYRVTSTGKPEFTNLKVYHDIKDITARKEIIENAELLDQIAKILTIYQSSEDIQEELTNLNSELTQEEIEQISNLKGYTGTHNLSLKAINLILDELWHTNDNQIAIFNRLKLVPKKVDLSQQKEIPTTLVDDFILSPVVKRSFIQSIKVINAIIKKYGLPNDIIIELAREKNSKDAQKMINEMQKRNRQTNERIEEIIRTTGKENAKYLIEKIKLHDMQEGKCLYSLEAIPLEDLLNNPFNYEVDHIIPRSVSFDNSFNNKVLVKQEENSKKGNRTPFQYLSSSDSKISYETFKKHILNLAKGKGRISKTKKEYLLEERDINRFSVQKDFINRNLVDTRYATRGLMNLLRSYFRVNNLDVKVKSINGGFTSFLRRKWKFKKERNKGYKHHAEDALIIANADFIFKEWKKLDKAKKVMENQMFEEKQAESMPEIETEQEYKEIFITPHQIKHIKDFKDYKYSHRVDKKPNRKLINDTLYSTRKDDKGNTLIVNNLNGLYDKDNDKLKKLINKSPEKLLMYHHDPQTYQKLKLIMEQYGDEKNPLYKYYEETGNYLTKYSKKDNGPVIKKIKYYGNKLNAHLDITDDYPNSRNKVVKLSLKPYRFDVYLDNGVYKFVTVKNLDVIKKENYYEVNSKCYEEAKKLKKISNQAEFIASFYKNDLIKINGELYRVRGDNNPRQNKLEVNMIDITYREYLENMNDKRPPHIIKTIASKTQSIKKYSTDILGNLYEVKSKKHPQIIKKG(SEQID NO:51)。
MKRNYILGLDIGITSVGYGIIDYETRDVIDAGVRLFKEANVENNEGRRSKRGARRLKRRRRHRIQRVKKLLFDYNLLTDHSELSGINPYEARVKGLSQKLSEEEFSAALLHLAKRRGVHNVNEVEEDTGNELSTKEQISRNSKALEEKYVAELQLERLKKDGEVRGSINRFKTSDYVKEAKQLLKVQKAYHQLDQSFIDTYIDLLETRRTYYEGPGEGSPFGWKDIKEWYEMLMGHCTYFPEELRSVKYAYNADLYNALNDLNNLVITRDENEKLEYYEKFQIIENVFKQKKKPTLKQIAKEILVNEEDIKGYRVTSTGKPEFTNLKVYHDIKDITARKEIIENAELLDQIAKILTIYQSSEDIQEELTNLNSELTQEEIEQISNLKGYTGTHNLSLKAINLILDELWHTNDNQIAIFNRLKLVPKKVDLSQQKEIPTTLVDDFILSPVVKRSFIQSIKVINAIIKKYGLPNDIIIELAREKNSKDAQKMINEMQKRNRQTNERIEEIIRTTGKENAKYLIEKIKLHDMQEGKCLYSLEAIPLEDLLNNPFNYEVDHIIPRSVSFDNSFNNKVLVKQEENSKKGNRTPFQYLSSSDSKISYETFKKHILNLAKGKGRISKTKKEYLLEERDINRFSVQKDFINRNLVDTRYATRGLMNLLRSYFRVNNLDVKVKSINGGFTSFLRRKWKFKKERNKGYKHHAEDALIIANADFIFKEWKKLDKAKKVMENQMFEEKQAESMPEIETEQEYKEIFITPHQIKHIKDFKDYKYSHRVDKKPNRKLINDTLYSTRKDDKGNTLIVNNLNGLYDKDNDKLKKLINKSPEKLLMYHHDPQTYQKLKLIMEQYGDEKNPLYKYYEETGNYLTKYSKKDNGPVIKKIKYYGNKLNAHLDITDDYPNSRNKVVKLSLKPYRFDVYLDNGVYKFVTVKNLDVIKKENYYEVNSKCYEEAKKLKKISNQAEFIASFYKNDLIKINGELYRVIGVNNDKNNVIELNMIDITYREYLENMNDKRPPHIIKTIASKTQSIKKYSTDILGNLYEVKSKKHPQIIKKG(SEQID NO:52)。
MKRNYILGLDIGITSVGYGIIDYETRDVIDAGVRLFKEANVENNEGRRSKRGARRLKRRRRHRIQRVKKLLFDYNLLTDHSELSGINPYEARVKGLSQKLSEEEFSAALLHLAKRRGVHNVNEVEEDTGNELSTKEQISRNSKALEEKYVAELQLERLKKDGEVRGSINRFKTSDYVKEAKQLLKVQKAYHQLDQSFIDTYIDLLETRRTYYEGPGEGSPFGWKDIKEWYEMLMGHCTYFPEELRSVKYAYNADLYNALNDLNNLVITRDENEKLEYYEKFQIIENVFKQKKKPTLKQIAKEILVNEEDIKGYRVTSTGKPEFTNLKVYHDIKDITARKEIIENAELLDQIAKILTIYQSSEDIQEELTNLNSELTQEEIEQISNLKGYTGTHNLSLKAINLILDELWHTNDNQIAIFNRLKLVPKKVDLSQQKEIPTTLVDDFILSPVVKRSFIQSIKVINAIIKKYGLPNDIIIELAREKNSKDAQKMINEMQKRNRQTNERIEEIIRTTGKENAKYLIEKIKLHDMQEGKCLYSLEAIPLEDLLNNPFNYEVDHIIPRSVSFDNSFNNKVLVKQEENSKKGNRTPFQYLSSSDSKISYETFKKHILNLAKGKGRISKTKKEYLLEERDINRFSVQKDFINRNLVDTRYATRGLMNLLRSYFRVNNLDVKVKSINGGFTSFLRRKWKFKKERNKGYKHHAEDALIIANADFIFKEWKKLDKAKKVMENQMFEEKQAESMPEIETEQEYKEIFITPHQIKHIKDFKDYKYSHRVDKKPNRKLINDTLYSTRKDDKGNTLIVNNLNGLYDKDNDKLKKLINKSPEKLLMYHHDPQTYQKLKLIMEQYGDEKNPLYKYYEETGNYLTKYSKKDNGPVIKKIKYYGNKLNAHLDITDDYPNSRNKVVKLSLKPYRFDVYLDNGVYKFVTVKNLDVIKKENYYEVNSKCYEEAKKLKKISNQAEFIASFYKNDLIKINGELYRVIGINDNKHNKIELNMIDITYREYLENMNDKRPPHIIKTIASKTQSIKKYSTDILGNLYEVKSKKHPQIIKKG(SEQID NO:53)。
MKRNYILGLDIGITSVGYGIIDYETRDVIDAGVRLFKEANVENNEGRRSKRGARRLKRRRRHRIQRVKKLLFDYNLLTDHSELSGINPYEARVKGLSQKLSEEEFSAALLHLAKRRGVHNVNEVEEDTGNELSTKEQISRNSKALEEKYVAELQLERLKKDGEVRGSINRFKTSDYVKEAKQLLKVQKAYHQLDQSFIDTYIDLLETRRTYYEGPGEGSPFGWKDIKEWYEMLMGHCTYFPEELRSVKYAYNADLYNALNDLNNLVITRDENEKLEYYEKFQIIENVFKQKKKPTLKQIAKEILVNEEDIKGYRVTSTGKPEFTNLKVYHDIKDITARKEIIENAELLDQIAKILTIYQSSEDIQEELTNLNSELTQEEIEQISNLKGYTGTHNLSLKAINLILDELWHTNDNQIAIFNRLKLVPKKVDLSQQKEIPTTLVDDFILSPVVKRSFIQSIKVINAIIKKYGLPNDIIIELAREKNSKDAQKMINEMQKRNRQTNERIEEIIRTTGKENAKYLIEKIKLHDMQEGKCLYSLEAIPLEDLLNNPFNYEVDHIIPRSVSFDNSFNNKVLVKQEENSKKGNRTPFQYLSSSDSKISYETFKKHILNLAKGKGRISKTKKEYLLEERDINRFSVQKDFINRNLVDTRYATRGLMNLLRSYFRVNNLDVKVKSINGGFTSFLRRKWKFKKERNKGYKHHAEDALIIANADFIFKEWKKLDKAKKVMENQMFEEKQAESMPEIETEQEYKEIFITPHQIKHIKDFKDYKYSHRVDKKPNRKLINDTLYSTRKDDKGNTLIVNNLNGLYDKDNDKLKKLINKSPEKLLMYHHDPQTYQKLKLIMEQYGDEKNPLYKYYEETGNYLTKYSKKDNGPVIKKIKYYGNKLNAHLDITDDYPNSRNKVVKLSLKPYRFDVYLDNGVYKFVTVKNLDVIKKENYYEVNSKCYEEAKKLKKISNQAEFIASFYKNDLIKINGELYRVIGVNSDDRNIIELNMIDITYREYLENMNDKRPPHIIKTIASKTQSIKKYSTDILGNLYEVKSKKHPQIIKKG(SEQID NO:54)。
MKRNYILGLDIGITSVGYGIIDYETRDVIDAGVRLFKEANVENNEGRRSKRGARRLKRRRRHRIQRVKKLLFDYNLLTDHSELSGINPYEARVKGLSQKLSEEEFSAALLHLAKRRGVHNVNEVEEDTGNELSTKEQISRNSKALEEKYVAELQLERLKKDGEVRGSINRFKTSDYVKEAKQLLKVQKAYHQLDQSFIDTYIDLLETRRTYYEGPGEGSPFGWKDIKEWYEMLMGHCTYFPEELRSVKYAYNADLYNALNDLNNLVITRDENEKLEYYEKFQIIENVFKQKKKPTLKQIAKEILVNEEDIKGYRVTSTGKPEFTNLKVYHDIKDITARKEIIENAELLDQIAKILTIYQSSEDIQEELTNLNSELTQEEIEQISNLKGYTGTHNLSLKAINLILDELWHTNDNQIAIFNRLKLVPKKVDLSQQKEIPTTLVDDFILSPVVKRSFIQSIKVINAIIKKYGLPNDIIIELAREKNSKDAQKMINEMQKRNRQTNERIEEIIRTTGKENAKYLIEKIKLHDMQEGKCLYSLEAIPLEDLLNNPFNYEVDHIIPRSVSFDNSFNNKVLVKQEENSKKGNRTPFQYLSSSDSKISYETFKKHILNLAKGKGRISKTKKEYLLEERDINRFSVQKDFINRNLVDTRYATRGLMNLLRSYFRVNNLDVKVKSINGGFTSFLRRKWKFKKERNKGYKHHAEDALIIANADFIFKEWKKLDKAKKVMENQMFEEKQAESMPEIETEQEYKEIFITPHQIKHIKDFKDYKYSHRVDKKPNRKLINDTLYSTRKDDKGNTLIVNNLNGLYDKDNDKLKKLINKSPEKLLMYHHDPQTYQKLKLIMEQYGDEKNPLYKYYEETGNYLTKYSKKDNGPVIKKIKYYGNKLNAHLDITDDYPNSRNKVVKLSLKPYRFDVYLDNGVYKFVTVKNLDVIKKENYYEVNSKCYEEAKKLKKISNQAEFIASFYKNDLIKINGELYRVIGVNDSEKNKIQLNMIDITYREYLENMNDKRPPHIIKTIASKTQSIKKYSTDILGNLYEVKSKKHPQIIKKG(SEQID NO:55)。
MKRNYILGLDIGITSVGYGIIDYETRDVIDAGVRLFKEANVENNEGRRSKRGARRLKRRRRHRIQRVKKLLFDYNLLTDHSELSGINPYEARVKGLSQKLSEEEFSAALLHLAKRRGVHNVNEVEEDTGNELSTKEQISRNSKALEEKYVAELQLERLKKDGEVRGSINRFKTSDYVKEAKQLLKVQKAYHQLDQSFIDTYIDLLETRRTYYEGPGEGSPFGWKDIKEWYEMLMGHCTYFPEELRSVKYAYNADLYNALNDLNNLVITRDENEKLEYYEKFQIIENVFKQKKKPTLKQIAKEILVNEEDIKGYRVTSTGKPEFTNLKVYHDIKDITARKEIIENAELLDQIAKILTIYQSSEDIQEELTNLNSELTQEEIEQISNLKGYTGTHNLSLKAINLILDELWHTNDNQIAIFNRLKLVPKKVDLSQQKEIPTTLVDDFILSPVVKRSFIQSIKVINAIIKKYGLPNDIIIELAREKNSKDAQKMINEMQKRNRQTNERIEEIIRTTGKENAKYLIEKIKLHDMQEGKCLYSLEAIPLEDLLNNPFNYEVDHIIPRSVSFDNSFNNKVLVKQEENSKKGNRTPFQYLSSSDSKISYETFKKHILNLAKGKGRISKTKKEYLLEERDINRFSVQKDFINRNLVDTRYATRGLMNLLRSYFRVNNLDVKVKSINGGFTSFLRRKWKFKKERNKGYKHHAEDALIIANADFIFKEWKKLDKAKKVMENQMFEEKQAESMPEIETEQEYKEIFITPHQIKHIKDFKDYKYSHRVDKKPNRKLINDTLYSTRKDDKGNTLIVNNLNGLYDKDNDKLKKLINKSPEKLLMYHHDPQTYQKLKLIMEQYGDEKNPLYKYYEETGNYLTKYSKKDNGPVIKKIKYYGNKLNAHLDITDDYPNSRNKVVKLSLKPYRFDVYLDNGVYKFVTVKNLDVIKKENYYEVNSKCYEEAKKLKKISNQAEFIASFYKNDLIKINGELYRVKCINNEKTHRIEINMIDITYREYLENMNDKRPPHIIKTIASKTQSIKKYSTDILGNLYEVKSKKHPQIIKKG(SEQID NO:56)。
MKRNYILGLDIGITSVGYGIIDYETRDVIDAGVRLFKEANVENNEGRRSKRGARRLKRRRRHRIQRVKKLLFDYNLLTDHSELSGINPYEARVKGLSQKLSEEEFSAALLHLAKRRGVHNVNEVEEDTGNELSTKEQISRNSKALEEKYVAELQLERLKKDGEVRGSINRFKTSDYVKEAKQLLKVQKAYHQLDQSFIDTYIDLLETRRTYYEGPGEGSPFGWKDIKEWYEMLMGHCTYFPEELRSVKYAYNADLYNALNDLNNLVITRDENEKLEYYEKFQIIENVFKQKKKPTLKQIAKEILVNEEDIKGYRVTSTGKPEFTNLKVYHDIKDITARKEIIENAELLDQIAKILTIYQSSEDIQEELTNLNSELTQEEIEQISNLKGYTGTHNLSLKAINLILDELWHTNDNQIAIFNRLKLVPKKVDLSQQKEIPTTLVDDFILSPVVKRSFIQSIKVINAIIKKYGLPNDIIIELAREKNSKDAQKMINEMQKRNRQTNERIEEIIRTTGKENAKYLIEKIKLHDMQEGKCLYSLEAIPLEDLLNNPFNYEVDHIIPRSVSFDNSFNNKVLVKQEENSKKGNRTPFQYLSSSDSKISYETFKKHILNLAKGKGRISKTKKEYLLEERDINRFSVQKDFINRNLVDTRYATRGLMNLLRSYFRVNNLDVKVKSINGGFTSFLRRKWKFKKERNKGYKHHAEDALIIANADFIFKEWKKLDKAKKVMENQMFEEKQAESMPEIETEQEYKEIFITPHQIKHIKDFKDYKYSHRVDKKPNRKLINDTLYSTRKDDKGNTLIVNNLNGLYDKDNDKLKKLINKSPEKLLMYHHDPQTYQKLKLIMEQYGDEKNPLYKYYEETGNYLTKYSKKDNGPVIKKIKYYGNKLNAHLDITDDYPNSRNKVVKLSLKPYRFDVYLDNGVYKFVTVKNLDVIKKENYYEVNSKCYEEAKKLKKISNQAEFIASFYKNDLIKINGELYRVIGVNHDKTNRIECNMIDITYREYLENMNDKRPPHIIKTIASKTQSIKKYSTDILGNLYEVKSKKHPQIIKKG(SEQID NO:57)。
MKRNYILGLDIGITSVGYGIIDYETRDVIDAGVRLFKEANVENNEGRRSKRGARRLKRRRRHRIQRVKKLLFDYNLLTDHSELSGINPYEARVKGLSQKLSEEEFSAALLHLAKRRGVHNVNEVEEDTGNELSTKEQISRNSKALEEKYVAELQLERLKKDGEVRGSINRFKTSDYVKEAKQLLKVQKAYHQLDQSFIDTYIDLLETRRTYYEGPGEGSPFGWKDIKEWYEMLMGHCTYFPEELRSVKYAYNADLYNALNDLNNLVITRDENEKLEYYEKFQIIENVFKQKKKPTLKQIAKEILVNEEDIKGYRVTSTGKPEFTNLKVYHDIKDITARKEIIENAELLDQIAKILTIYQSSEDIQEELTNLNSELTQEEIEQISNLKGYTGTHNLSLKAINLILDELWHTNDNQIAIFNRLKLVPKKVDLSQQKEIPTTLVDDFILSPVVKRSFIQSIKVINAIIKKYGLPNDIIIELAREKNSKDAQKMINEMQKRNRQTNERIEEIIRTTGKENAKYLIEKIKLHDMQEGKCLYSLEAIPLEDLLNNPFNYEVDHIIPRSVSFDNSFNNKVLVKQEENSKKGNRTPFQYLSSSDSKISYETFKKHILNLAKGKGRISKTKKEYLLEERDINRFSVQKDFINRNLVDTRYATRGLMNLLRSYFRVNNLDVKVKSINGGFTSFLRRKWKFKKERNKGYKHHAEDALIIANADFIFKEWKKLDKAKKVMENQMFEEKQAESMPEIETEQEYKEIFITPHQIKHIKDFKDYKYSHRVDKKPNRKLINDTLYSTRKDDKGNTLIVNNLNGLYDKDNDKLKKLINKSPEKLLMYHHDPQTYQKLKLIMEQYGDEKNPLYKYYEETGNYLTKYSKKDNGPVIKKIKYYGNKLNAHLDITDDYPNSRNKVVKLSLKPYRFDVYLDNGVYKFVTVKNLDVIKKENYYEVNSKCYEEAKKLKKISNQAEFIASFYKNDLIKINGELYRVIGVNEDKRNIIELNMIDITYREYLENMNDKRPPHIIKTIASKTQSIKKYSTDILGNLYEVKSKKHPQIIKKG(SEQID NO:58)。
MKRNYILGLDIGITSVGYGIIDYETRDVIDAGVRLFKEANVENNEGRRSKRGARRLKRRRRHRIQRVKKLLFDYNLLTDHSELSGINPYEARVKGLSQKLSEEEFSAALLHLAKRRGVHNVNEVEEDTGNELSTKEQISRNSKALEEKYVAELQLERLKKDGEVRGSINRFKTSDYVKEAKQLLKVQKAYHQLDQSFIDTYIDLLETRRTYYEGPGEGSPFGWKDIKEWYEMLMGHCTYFPEELRSVKYAYNADLYNALNDLNNLVITRDENEKLEYYEKFQIIENVFKQKKKPTLKQIAKEILVNEEDIKGYRVTSTGKPEFTNLKVYHDIKDITARKEIIENAELLDQIAKILTIYQSSEDIQEELTNLNSELTQEEIEQISNLKGYTGTHNLSLKAINLILDELWHTNDNQIAIFNRLKLVPKKVDLSQQKEIPTTLVDDFILSPVVKRSFIQSIKVINAIIKKYGLPNDIIIELAREKNSKDAQKMINEMQKRNRQTNERIEEIIRTTGKENAKYLIEKIKLHDMQEGKCLYSLEAIPLEDLLNNPFNYEVDHIIPRSVSFDNSFNNKVLVKQEENSKKGNRTPFQYLSSSDSKISYETFKKHILNLAKGKGRISKTKKEYLLEERDINRFSVQKDFINRNLVDTRYATRGLMNLLRSYFRVNNLDVKVKSINGGFTSFLRRKWKFKKERNKGYKHHAEDALIIANADFIFKEWKKLDKAKKVMENQMFEEKQAESMPEIETEQEYKEIFITPHQIKHIKDFKDYKYSHRVDKKPNRKLINDTLYSTRKDDKGNTLIVNNLNGLYDKDNDKLKKLINKSPEKLLMYHHDPQTYQKLKLIMEQYGDEKNPLYKYYEETGNYLTKYSKKDNGPVIKKIKYYGNKLNAHLDITDDYPNSRNKVVKLSLKPYRFDVYLDNGVYKFVTVKNLDVIKKENYYEVNSKCYEEAKKLKKISNQAEFIASFYKNDLIKINGELYRVRGDSSPRENRFEVNMIDITYREYLENMNDKRPPHIIKTIASKTQSIKKYSTDILGNLYEVKSKKHPQIIKKG(SEQID NO:59)。
MKRNYILGLDIGITSVGYGIIDYETRDVIDAGVRLFKEANVENNEGRRSKRGARRLKRRRRHRIQRVKKLLFDYNLLTDHSELSGINPYEARVKGLSQKLSEEEFSAALLHLAKRRGVHNVNEVEEDTGNELSTKEQISRNSKALEEKYVAELQLERLKKDGEVRGSINRFKTSDYVKEAKQLLKVQKAYHQLDQSFIDTYIDLLETRRTYYEGPGEGSPFGWKDIKEWYEMLMGHCTYFPEELRSVKYAYNADLYNALNDLNNLVITRDENEKLEYYEKFQIIENVFKQKKKPTLKQIAKEILVNEEDIKGYRVTSTGKPEFTNLKVYHDIKDITARKEIIENAELLDQIAKILTIYQSSEDIQEELTNLNSELTQEEIEQISNLKGYTGTHNLSLKAINLILDELWHTNDNQIAIFNRLKLVPKKVDLSQQKEIPTTLVDDFILSPVVKRSFIQSIKVINAIIKKYGLPNDIIIELAREKNSKDAQKMINEMQKRNRQTNERIEEIIRTTGKENAKYLIEKIKLHDMQEGKCLYSLEAIPLEDLLNNPFNYEVDHIIPRSVSFDNSFNNKVLVKQEENSKKGNRTPFQYLSSSDSKISYETFKKHILNLAKGKGRISKTKKEYLLEERDINRFSVQKDFINRNLVDTRYATRGLMNLLRSYFRVNNLDVKVKSINGGFTSFLRRKWKFKKERNKGYKHHAEDALIIANADFIFKEWKKLDKAKKVMENQMFEEKQAESMPEIETEQEYKEIFITPHQIKHIKDFKDYKYSHRVDKKPNRKLINDTLYSTRKDDKGNTLIVNNLNGLYDKDNDKLKKLINKSPEKLLMYHHDPQTYQKLKLIMEQYGDEKNPLYKYYEETGNYLTKYSKKDNGPVIKKIKYYGNKLNAHLDITDDYPNSRNKVVKLSLKPYRFDVYLDNGVYKFVTVKNLDVIKKENYYEVNSKCYEEAKKLKKISNQAEFIASFYKNDLIKINGELYRVRGDNDPKANKIEVNMIDITYREYLENMNDKRPPHIIKTIASKTQSIKKYSTDILGNLYEVKSKKHPQIIKKG(SEQID NO:60)。
MKRNYILGLDIGITSVGYGIIDYETRDVIDAGVRLFKEANVENNEGRRSKRGARRLKRRRRHRIQRVKKLLFDYNLLTDHSELSGINPYEARVKGLSQKLSEEEFSAALLHLAKRRGVHNVNEVEEDTGNELSTKEQISRNSKALEEKYVAELQLERLKKDGEVRGSINRFKTSDYVKEAKQLLKVQKAYHQLDQSFIDTYIDLLETRRTYYEGPGEGSPFGWKDIKEWYEMLMGHCTYFPEELRSVKYAYNADLYNALNDLNNLVITRDENEKLEYYEKFQIIENVFKQKKKPTLKQIAKEILVNEEDIKGYRVTSTGKPEFTNLKVYHDIKDITARKEIIENAELLDQIAKILTIYQSSEDIQEELTNLNSELTQEEIEQISNLKGYTGTHNLSLKAINLILDELWHTNDNQIAIFNRLKLVPKKVDLSQQKEIPTTLVDDFILSPVVKRSFIQSIKVINAIIKKYGLPNDIIIELAREKNSKDAQKMINEMQKRNRQTNERIEEIIRTTGKENAKYLIEKIKLHDMQEGKCLYSLEAIPLEDLLNNPFNYEVDHIIPRSVSFDNSFNNKVLVKQEENSKKGNRTPFQYLSSSDSKISYETFKKHILNLAKGKGRISKTKKEYLLEERDINRFSVQKDFINRNLVDTRYATRGLMNLLRSYFRVNNLDVKVKSINGGFTSFLRRKWKFKKERNKGYKHHAEDALIIANADFIFKEWKKLDKAKKVMENQMFEEKQAESMPEIETEQEYKEIFITPHQIKHIKDFKDYKYSHRVDKKPNRKLINDTLYSTRKDDKGNTLIVNNLNGLYDKDNDKLKKLINKSPEKLLMYHHDPQTYQKLKLIMEQYGDEKNPLYKYYEETGNYLTKYSKKDNGPVIKKIKYYGNKLNAHLDITDDYPNSRNKVVKLSLKPYRFDVYLDNGVYKFVTVKNLDVIKKENYYEVNSKCYEEAKKLKKISNQAEFIASFYKNDLIKINGELYRVIGVNAEKRNTIEVNMIDITYREYLENMNDKRPPHIIKTIASKTQSIKKYSTDILGNLYEVKSKKHPQIIKKG(SEQID NO:61)。
MKRNYILGLDIGITSVGYGIIDYETRDVIDAGVRLFKEANVENNEGRRSKRGARRLKRRRRHRIQRVKKLLFDYNLLTDHSELSGINPYEARVKGLSQKLSEEEFSAALLHLAKRRGVHNVNEVEEDTGNELSTKEQISRNSKALEEKYVAELQLERLKKDGEVRGSINRFKTSDYVKEAKQLLKVQKAYHQLDQSFIDTYIDLLETRRTYYEGPGEGSPFGWKDIKEWYEMLMGHCTYFPEELRSVKYAYNADLYNALNDLNNLVITRDENEKLEYYEKFQIIENVFKQKKKPTLKQIAKEILVNEEDIKGYRVTSTGKPEFTNLKVYHDIKDITARKEIIENAELLDQIAKILTIYQSSEDIQEELTNLNSELTQEEIEQISNLKGYTGTHNLSLKAINLILDELWHTNDNQIAIFNRLKLVPKKVDLSQQKEIPTTLVDDFILSPVVKRSFIQSIKVINAIIKKYGLPNDIIIELAREKNSKDAQKMINEMQKRNRQTNERIEEIIRTTGKENAKYLIEKIKLHDMQEGKCLYSLEAIPLEDLLNNPFNYEVDHIIPRSVSFDNSFNNKVLVKQEENSKKGNRTPFQYLSSSDSKISYETFKKHILNLAKGKGRISKTKKEYLLEERDINRFSVQKDFINRNLVDTRYATRGLMNLLRSYFRVNNLDVKVKSINGGFTSFLRRKWKFKKERNKGYKHHAEDALIIANADFIFKEWKKLDKAKKVMENQMFEEKQAESMPEIETEQEYKEIFITPHQIKHIKDFKDYKYSHRVDKKPNRKLINDTLYSTRKDDKGNTLIVNNLNGLYDKDNDKLKKLINKSPEKLLMYHHDPQTYQKLKLIMEQYGDEKNPLYKYYEETGNYLTKYSKKDNGPVIKKIKYYGNKLNAHLDITDDYPNSRNKVVKLSLKPYRFDVYLDNGVYKFVTVKNLDVIKKENYYEVNSKCYEEAKKLKKISNQAEFIASFYKNDLIKINGELYRVIGVNDDAKNTLELNMIDITYREYLENMNDKRPPHIIKTIASKTQSIKKYSTDILGNLYEVKSKKHPQIIKKG(SEQID NO:62)。
MKRNYILGLDIGITSVGYGIIDYETRDVIDAGVRLFKEANVENNEGRRSKRGARRLKRRRRHRIQRVKKLLFDYNLLTDHSELSGINPYEARVKGLSQKLSEEEFSAALLHLAKRRGVHNVNEVEEDTGNELSTKEQISRNSKALEEKYVAELQLERLKKDGEVRGSINRFKTSDYVKEAKQLLKVQKAYHQLDQSFIDTYIDLLETRRTYYEGPGEGSPFGWKDIKEWYEMLMGHCTYFPEELRSVKYAYNADLYNALNDLNNLVITRDENEKLEYYEKFQIIENVFKQKKKPTLKQIAKEILVNEEDIKGYRVTSTGKPEFTNLKVYHDIKDITARKEIIENAELLDQIAKILTIYQSSEDIQEELTNLNSELTQEEIEQISNLKGYTGTHNLSLKAINLILDELWHTNDNQIAIFNRLKLVPKKVDLSQQKEIPTTLVDDFILSPVVKRSFIQSIKVINAIIKKYGLPNDIIIELAREKNSKDAQKMINEMQKRNRQTNERIEEIIRTTGKENAKYLIEKIKLHDMQEGKCLYSLEAIPLEDLLNNPFNYEVDHIIPRSVSFDNSFNNKVLVKQEENSKKGNRTPFQYLSSSDSKISYETFKKHILNLAKGKGRISKTKKEYLLEERDINRFSVQKDFINRNLVDTRYATRGLMNLLRSYFRVNNLDVKVKSINGGFTSFLRRKWKFKKERNKGYKHHAEDALIIANADFIFKEWKKLDKAKKVMENQMFEEKQAESMPEIETEQEYKEIFITPHQIKHIKDFKDYKYSHRVDKKPNRKLINDTLYSTRKDDKGNTLIVNNLNGLYDKDNDKLKKLINKSPEKLLMYHHDPQTYQKLKLIMEQYGDEKNPLYKYYEETGNYLTKYSKKDNGPVIKKIKYYGNKLNAHLDITDDYPNSRNKVVKLSLKPYRFDVYLDNGVYKFVTVKNLDVIKKENYYEVNSKCYEEAKKLKKISNQAEFIASFYKNDLIKINGELYRVVGVNNDSVNRVELNMIDITYREYLENMNDKRPPHIIKTIASKTQSIKKYSTDILGNLYEVKSKKHPQIIKKG(SEQID NO:63)。
MKRNYILGLDIGITSVGYGIIDYETRDVIDAGVRLFKEANVENNEGRRSKRGARRLKRRRRHRIQRVKKLLFDYNLLTDHSELSGINPYEARVKGLSQKLSEEEFSAALLHLAKRRGVHNVNEVEEDTGNELSTKEQISRNSKALEEKYVAELQLERLKKDGEVRGSINRFKTSDYVKEAKQLLKVQKAYHQLDQSFIDTYIDLLETRRTYYEGPGEGSPFGWKDIKEWYEMLMGHCTYFPEELRSVKYAYNADLYNALNDLNNLVITRDENEKLEYYEKFQIIENVFKQKKKPTLKQIAKEILVNEEDIKGYRVTSTGKPEFTNLKVYHDIKDITARKEIIENAELLDQIAKILTIYQSSEDIQEELTNLNSELTQEEIEQISNLKGYTGTHNLSLKAINLILDELWHTNDNQIAIFNRLKLVPKKVDLSQQKEIPTTLVDDFILSPVVKRSFIQSIKVINAIIKKYGLPNDIIIELAREKNSKDAQKMINEMQKRNRQTNERIEEIIRTTGKENAKYLIEKIKLHDMQEGKCLYSLEAIPLEDLLNNPFNYEVDHIIPRSVSFDNSFNNKVLVKQEENSKKGNRTPFQYLSSSDSKISYETFKKHILNLAKGKGRISKTKKEYLLEERDINRFSVQKDFINRNLVDTRYATRGLMNLLRSYFRVNNLDVKVKSINGGFTSFLRRKWKFKKERNKGYKHHAEDALIIANADFIFKEWKKLDKAKKVMENQMFEEKQAESMPEIETEQEYKEIFITPHQIKHIKDFKDYKYSHRVDKKPNRKLINDTLYSTRKDDKGNTLIVNNLNGLYDKDNDKLKKLINKSPEKLLMYHHDPQTYQKLKLIMEQYGDEKNPLYKYYEETGNYLTKYSKKDNGPVIKKIKYYGNKLNAHLDITDDYPNSRNKVVKLSLKPYRFDVYLDNGVYKFVTVKNLDVIKKENYYEVNSKCYEEAKKLKKISNQAEFIASFYKNDLIKINGELYRVVGVNNDTRNVVELNMIDITYREYLENMNDKRPPHIIKTIASKTQSIKKYSTDILGNLYEVKSKKHPQIIKKG(SEQID NO:64)。
MKRNYILGLDIGITSVGYGIIDYETRDVIDAGVRLFKEANVENNEGRRSKRGARRLKRRRRHRIQRVKKLLFDYNLLTDHSELSGINPYEARVKGLSQKLSEEEFSAALLHLAKRRGVHNVNEVEEDTGNELSTKEQISRNSKALEEKYVAELQLERLKKDGEVRGSINRFKTSDYVKEAKQLLKVQKAYHQLDQSFIDTYIDLLETRRTYYEGPGEGSPFGWKDIKEWYEMLMGHCTYFPEELRSVKYAYNADLYNALNDLNNLVITRDENEKLEYYEKFQIIENVFKQKKKPTLKQIAKEILVNEEDIKGYRVTSTGKPEFTNLKVYHDIKDITARKEIIENAELLDQIAKILTIYQSSEDIQEELTNLNSELTQEEIEQISNLKGYTGTHNLSLKAINLILDELWHTNDNQIAIFNRLKLVPKKVDLSQQKEIPTTLVDDFILSPVVKRSFIQSIKVINAIIKKYGLPNDIIIELAREKNSKDAQKMINEMQKRNRQTNERIEEIIRTTGKENAKYLIEKIKLHDMQEGKCLYSLEAIPLEDLLNNPFNYEVDHIIPRSVSFDNSFNNKVLVKQEENSKKGNRTPFQYLSSSDSKISYETFKKHILNLAKGKGRISKTKKEYLLEERDINRFSVQKDFINRNLVDTRYATRGLMNLLRSYFRVNNLDVKVKSINGGFTSFLRRKWKFKKERNKGYKHHAEDALIIANADFIFKEWKKLDKAKKVMENQMFEEKQAESMPEIETEQEYKEIFITPHQIKHIKDFKDYKYSHRVDKKPNRKLINDTLYSTRKDDKGNTLIVNNLNGLYDKDNDKLKKLINKSPEKLLMYHHDPQTYQKLKLIMEQYGDEKNPLYKYYEETGNYLTKYSKKDNGPVIKKIKYYGNKLNAHLDITDDYPNSRNKVVKLSLKPYRFDVYLDNGVYKFVTVKNLDVIKKENYYEVNSKCYEEAKKLKKISNQAEFIASFYKNDLIKINGELYRVVGVNNDSRNVVELNMIDITYREYLENMNDKRPPHIIKTIASKTQSIKKYSTDILGNLYEVKSKKHPQIIKKG(SEQID NO:65)。
MKRNYILGLDIGITSVGYGIIDYETRDVIDAGVRLFKEANVENNEGRRSKRGARRLKRRRRHRIQRVKKLLFDYNLLTDHSELSGINPYEARVKGLSQKLSEEEFSAALLHLAKRRGVHNVNEVEEDTGNELSTKEQISRNSKALEEKYVAELQLERLKKDGEVRGSINRFKTSDYVKEAKQLLKVQKAYHQLDQSFIDTYIDLLETRRTYYEGPGEGSPFGWKDIKEWYEMLMGHCTYFPEELRSVKYAYNADLYNALNDLNNLVITRDENEKLEYYEKFQIIENVFKQKKKPTLKQIAKEILVNEEDIKGYRVTSTGKPEFTNLKVYHDIKDITARKEIIENAELLDQIAKILTIYQSSEDIQEELTNLNSELTQEEIEQISNLKGYTGTHNLSLKAINLILDELWHTNDNQIAIFNRLKLVPKKVDLSQQKEIPTTLVDDFILSPVVKRSFIQSIKVINAIIKKYGLPNDIIIELAREKNSKDAQKMINEMQKRNRQTNERIEEIIRTTGKENAKYLIEKIKLHDMQEGKCLYSLEAIPLEDLLNNPFNYEVDHIIPRSVSFDNSFNNKVLVKQEENSKKGNRTPFQYLSSSDSKISYETFKKHILNLAKGKGRISKTKKEYLLEERDINRFSVQKDFINRNLVDTRYATRGLMNLLRSYFRVNNLDVKVKSINGGFTSFLRRKWKFKKERNKGYKHHAEDALIIANADFIFKEWKKLDKAKKVMENQMFEEKQAESMPEIETEQEYKEIFITPHQIKHIKDFKDYKYSHRVDKKPNRKLINDTLYSTRKDDKGNTLIVNNLNGLYDKDNDKLKKLINKSPEKLLMYHHDPQTYQKLKLIMEQYGDEKNPLYKYYEETGNYLTKYSKKDNGPVIKKIKYYGNKLNAHLDITDDYPNSRNKVVKLSLKPYRFDVYLDNGVYKFVTVKNLDVIKKENYYEVNSKCYEEAKKLKKISNQAEFIASFYKNDLIKINGELYRVRGDSMPRQNKIEMNMIDITYREYLENMNDKRPPHIIKTIASKTQSIKKYSTDILGNLYEVKSKKHPQIIKKG(SEQID NO:66)。
MKRNYILGLDIGITSVGYGIIDYETRDVIDAGVRLFKEANVENNEGRRSKRGARRLKRRRRHRIQRVKKLLFDYNLLTDHSELSGINPYEARVKGLSQKLSEEEFSAALLHLAKRRGVHNVNEVEEDTGNELSTKEQISRNSKALEEKYVAELQLERLKKDGEVRGSINRFKTSDYVKEAKQLLKVQKAYHQLDQSFIDTYIDLLETRRTYYEGPGEGSPFGWKDIKEWYEMLMGHCTYFPEELRSVKYAYNADLYNALNDLNNLVITRDENEKLEYYEKFQIIENVFKQKKKPTLKQIAKEILVNEEDIKGYRVTSTGKPEFTNLKVYHDIKDITARKEIIENAELLDQIAKILTIYQSSEDIQEELTNLNSELTQEEIEQISNLKGYTGTHNLSLKAINLILDELWHTNDNQIAIFNRLKLVPKKVDLSQQKEIPTTLVDDFILSPVVKRSFIQSIKVINAIIKKYGLPNDIIIELAREKNSKDAQKMINEMQKRNRQTNERIEEIIRTTGKENAKYLIEKIKLHDMQEGKCLYSLEAIPLEDLLNNPFNYEVDHIIPRSVSFDNSFNNKVLVKQEENSKKGNRTPFQYLSSSDSKISYETFKKHILNLAKGKGRISKTKKEYLLEERDINRFSVQKDFINRNLVDTRYATRGLMNLLRSYFRVNNLDVKVKSINGGFTSFLRRKWKFKKERNKGYKHHAEDALIIANADFIFKEWKKLDKAKKVMENQMFEEKQAESMPEIETEQEYKEIFITPHQIKHIKDFKDYKYSHRVDKKPNRKLINDTLYSTRKDDKGNTLIVNNLNGLYDKDNDKLKKLINKSPEKLLMYHHDPQTYQKLKLIMEQYGDEKNPLYKYYEETGNYLTKYSKKDNGPVIKKIKYYGNKLNAHLDITDDYPNSRNKVVKLSLKPYRFDVYLDNGVYKFVTVKNLDVIKKENYYEVNSKCYEEAKKLKKISNQAEFIASFYKNDLIKINGELYRVRGDAMPRDNKIEVNMIDITYREYLENMNDKRPPHIIKTIASKTQSIKKYSTDILGNLYEVKSKKHPQIIKKG(SEQID NO:67)。
MKRNYILGLDIGITSVGYGIIDYETRDVIDAGVRLFKEANVENNEGRRSKRGARRLKRRRRHRIQRVKKLLFDYNLLTDHSELSGINPYEARVKGLSQKLSEEEFSAALLHLAKRRGVHNVNEVEEDTGNELSTKEQISRNSKALEEKYVAELQLERLKKDGEVRGSINRFKTSDYVKEAKQLLKVQKAYHQLDQSFIDTYIDLLETRRTYYEGPGEGSPFGWKDIKEWYEMLMGHCTYFPEELRSVKYAYNADLYNALNDLNNLVITRDENEKLEYYEKFQIIENVFKQKKKPTLKQIAKEILVNEEDIKGYRVTSTGKPEFTNLKVYHDIKDITARKEIIENAELLDQIAKILTIYQSSEDIQEELTNLNSELTQEEIEQISNLKGYTGTHNLSLKAINLILDELWHTNDNQIAIFNRLKLVPKKVDLSQQKEIPTTLVDDFILSPVVKRSFIQSIKVINAIIKKYGLPNDIIIELAREKNSKDAQKMINEMQKRNRQTNERIEEIIRTTGKENAKYLIEKIKLHDMQEGKCLYSLEAIPLEDLLNNPFNYEVDHIIPRSVSFDNSFNNKVLVKQEENSKKGNRTPFQYLSSSDSKISYETFKKHILNLAKGKGRISKTKKEYLLEERDINRFSVQKDFINRNLVDTRYATRGLMNLLRSYFRVNNLDVKVKSINGGFTSFLRRKWKFKKERNKGYKHHAEDALIIANADFIFKEWKKLDKAKKVMENQMFEEKQAESMPEIETEQEYKEIFITPHQIKHIKDFKDYKYSHRVDKKPNRKLINDTLYSTRKDDKGNTLIVNNLNGLYDKDNDKLKKLINKSPEKLLMYHHDPQTYQKLKLIMEQYGDEKNPLYKYYEETGNYLTKYSKKDNGPVIKKIKYYGNKLNAHLDITDDYPNSRNKVVKLSLKPYRFDVYLDNGVYKFVTVKNLDVIKKENYYEVNSKCYEEAKKLKKISNQAEFIASFYKNDLIKINGELYRVIGINNGDKNLVELNMIDITYREYLENMNDKRPPHIIKTIASKTQSIKKYSTDILGNLYEVKSKKHPQIIKKG(SEQID NO:68)。
MKRNYILGLDIGITSVGYGIIDYETRDVIDAGVRLFKEANVENNEGRRSKRGARRLKRRRRHRIQRVKKLLFDYNLLTDHSELSGINPYEARVKGLSQKLSEEEFSAALLHLAKRRGVHNVNEVEEDTGNELSTKEQISRNSKALEEKYVAELQLERLKKDGEVRGSINRFKTSDYVKEAKQLLKVQKAYHQLDQSFIDTYIDLLETRRTYYEGPGEGSPFGWKDIKEWYEMLMGHCTYFPEELRSVKYAYNADLYNALNDLNNLVITRDENEKLEYYEKFQIIENVFKQKKKPTLKQIAKEILVNEEDIKGYRVTSTGKPEFTNLKVYHDIKDITARKEIIENAELLDQIAKILTIYQSSEDIQEELTNLNSELTQEEIEQISNLKGYTGTHNLSLKAINLILDELWHTNDNQIAIFNRLKLVPKKVDLSQQKEIPTTLVDDFILSPVVKRSFIQSIKVINAIIKKYGLPNDIIIELAREKNSKDAQKMINEMQKRNRQTNERIEEIIRTTGKENAKYLIEKIKLHDMQEGKCLYSLEAIPLEDLLNNPFNYEVDHIIPRSVSFDNSFNNKVLVKQEENSKKGNRTPFQYLSSSDSKISYETFKKHILNLAKGKGRISKTKKEYLLEERDINRFSVQKDFINRNLVDTRYATRGLMNLLRSYFRVNNLDVKVKSINGGFTSFLRRKWKFKKERNKGYKHHAEDALIIANADFIFKEWKKLDKAKKVMENQMFEEKQAESMPEIETEQEYKEIFITPHQIKHIKDFKDYKYSHRVDKKPNRKLINDTLYSTRKDDKGNTLIVNNLNGLYDKDNDKLKKLINKSPEKLLMYHHDPQTYQKLKLIMEQYGDEKNPLYKYYEETGNYLTKYSKKDNGPVIKKIKYYGNKLNAHLDITDDYPNSRNKVVKLSLKPYRFDVYLDNGVYKFVTVKNLDVIKKENYYEVNSKCYEEAKKLKKISNQAEFIASFYKNDLIKINGELYRVRGDNNPRQNMIEVNMIDITYREYLENMNDKRPPHIIKTIASKTQSIKKYSTDILGNLYEVKSKKHPQIIKKG(SEQID NO:69)。
MKRNYILGLDIGITSVGYGIIDYETRDVIDAGVRLFKEANVENNEGRRSKRGARRLKRRRRHRIQRVKKLLFDYNLLTDHSELSGINPYEARVKGLSQKLSEEEFSAALLHLAKRRGVHNVNEVEEDTGNELSTKEQISRNSKALEEKYVAELQLERLKKDGEVRGSINRFKTSDYVKEAKQLLKVQKAYHQLDQSFIDTYIDLLETRRTYYEGPGEGSPFGWKDIKEWYEMLMGHCTYFPEELRSVKYAYNADLYNALNDLNNLVITRDENEKLEYYEKFQIIENVFKQKKKPTLKQIAKEILVNEEDIKGYRVTSTGKPEFTNLKVYHDIKDITARKEIIENAELLDQIAKILTIYQSSEDIQEELTNLNSELTQEEIEQISNLKGYTGTHNLSLKAINLILDELWHTNDNQIAIFNRLKLVPKKVDLSQQKEIPTTLVDDFILSPVVKRSFIQSIKVINAIIKKYGLPNDIIIELAREKNSKDAQKMINEMQKRNRQTNERIEEIIRTTGKENAKYLIEKIKLHDMQEGKCLYSLEAIPLEDLLNNPFNYEVDHIIPRSVSFDNSFNNKVLVKQEENSKKGNRTPFQYLSSSDSKISYETFKKHILNLAKGKGRISKTKKEYLLEERDINRFSVQKDFINRNLVDTRYATRGLMNLLRSYFRVNNLDVKVKSINGGFTSFLRRKWKFKKERNKGYKHHAEDALIIANADFIFKEWKKLDKAKKVMENQMFEEKQAESMPEIETEQEYKEIFITPHQIKHIKDFKDYKYSHRVDKKPNRKLINDTLYSTRKDDKGNTLIVNNLNGLYDKDNDKLKKLINKSPEKLLMYHHDPQTYQKLKLIMEQYGDEKNPLYKYYEETGNYLTKYSKKDNGPVIKKIKYYGNKLNAHLDITDDYPNSRNKVVKLSLKPYRFDVYLDNGVYKFVTVKNLDVIKKENYYEVNSKCYEEAKKLKKISNQAEFIASFYKNDLIKINGELYRVIGVNNDSTNRVELNMIDITYREYLENMNDKRPPHIIKTIASKTQSIKKYSTDILGNLYEVKSKKHPQIIKKG(SEQID NO:70)。
MKRNYILGLDIGITSVGYGIIDYETRDVIDAGVRLFKEANVENNEGRRSKRGARRLKRRRRHRIQRVKKLLFDYNLLTDHSELSGINPYEARVKGLSQKLSEEEFSAALLHLAKRRGVHNVNEVEEDTGNELSTKEQISRNSKALEEKYVAELQLERLKKDGEVRGSINRFKTSDYVKEAKQLLKVQKAYHQLDQSFIDTYIDLLETRRTYYEGPGEGSPFGWKDIKEWYEMLMGHCTYFPEELRSVKYAYNADLYNALNDLNNLVITRDENEKLEYYEKFQIIENVFKQKKKPTLKQIAKEILVNEEDIKGYRVTSTGKPEFTNLKVYHDIKDITARKEIIENAELLDQIAKILTIYQSSEDIQEELTNLNSELTQEEIEQISNLKGYTGTHNLSLKAINLILDELWHTNDNQIAIFNRLKLVPKKVDLSQQKEIPTTLVDDFILSPVVKRSFIQSIKVINAIIKKYGLPNDIIIELAREKNSKDAQKMINEMQKRNRQTNERIEEIIRTTGKENAKYLIEKIKLHDMQEGKCLYSLEAIPLEDLLNNPFNYEVDHIIPRSVSFDNSFNNKVLVKQEENSKKGNRTPFQYLSSSDSKISYETFKKHILNLAKGKGRISKTKKEYLLEERDINRFSVQKDFINRNLVDTRYATRGLMNLLRSYFRVNNLDVKVKSINGGFTSFLRRKWKFKKERNKGYKHHAEDALIIANADFIFKEWKKLDKAKKVMENQMFEEKQAESMPEIETEQEYKEIFITPHQIKHIKDFKDYKYSHRVDKKPNRKLINDTLYSTRKDDKGNTLIVNNLNGLYDKDNDKLKKLINKSPEKLLMYHHDPQTYQKLKLIMEQYGDEKNPLYKYYEETGNYLTKYSKKDNGPVIKKIKYYGNKLNAHLDITDDYPNSRNKVVKLSLKPYRFDVYLDNGVYKFVTVKNLDVIKKENYYEVNSKCYEEAKKLKKISNQAEFIASFYKNDLIKINGELYRVRGDNDPRRSTIELNMIDITYREYLENMNDKRPPHIIKTIASKTQSIKKYSTDILGNLYEVKSKKHPQIIKKG(SEQID NO:71)。
MKRNYILGLDIGITSVGYGIIDYETRDVIDAGVRLFKEANVENNEGRRSKRGARRLKRRRRHRIQRVKKLLFDYNLLTDHSELSGINPYEARVKGLSQKLSEEEFSAALLHLAKRRGVHNVNEVEEDTGNELSTKEQISRNSKALEEKYVAELQLERLKKDGEVRGSINRFKTSDYVKEAKQLLKVQKAYHQLDQSFIDTYIDLLETRRTYYEGPGEGSPFGWKDIKEWYEMLMGHCTYFPEELRSVKYAYNADLYNALNDLNNLVITRDENEKLEYYEKFQIIENVFKQKKKPTLKQIAKEILVNEEDIKGYRVTSTGKPEFTNLKVYHDIKDITARKEIIENAELLDQIAKILTIYQSSEDIQEELTNLNSELTQEEIEQISNLKGYTGTHNLSLKAINLILDELWHTNDNQIAIFNRLKLVPKKVDLSQQKEIPTTLVDDFILSPVVKRSFIQSIKVINAIIKKYGLPNDIIIELAREKNSKDAQKMINEMQKRNRQTNERIEEIIRTTGKENAKYLIEKIKLHDMQEGKCLYSLEAIPLEDLLNNPFNYEVDHIIPRSVSFDNSFNNKVLVKQEENSKKGNRTPFQYLSSSDSKISYETFKKHILNLAKGKGRISKTKKEYLLEERDINRFSVQKDFINRNLVDTRYATRGLMNLLRSYFRVNNLDVKVKSINGGFTSFLRRKWKFKKERNKGYKHHAEDALIIANADFIFKEWKKLDKAKKVMENQMFEEKQAESMPEIETEQEYKEIFITPHQIKHIKDFKDYKYSHRVDKKPNRKLINDTLYSTRKDDKGNTLIVNNLNGLYDKDNDKLKKLINKSPEKLLMYHHDPQTYQKLKLIMEQYGDEKNPLYKYYEETGNYLTKYSKKDNGPVIKKIKYYGNKLNAHLDITDDYPNSRNKVVKLSLKPYRFDVYLDNGVYKFVTVKNLDVIKKENYYEVNSKCYEEAKKLKKISNQAEFIASFYKNDLIKINGELYRVRGDNNPRQNKLEVNMIDITYREYLENMNDKRPPHIIKTIASKTQSIKKYSTDILGNLYEVKSKKHPQIIKKG(SEQID NO:72)。
MKRNYILGLDIGITSVGYGIIDYETRDVIDAGVRLFKEANVENNEGRRSKRGARRLKRRRRHRIQRVKKLLFDYNLLTDHSELSGINPYEARVKGLSQKLSEEEFSAALLHLAKRRGVHNVNEVEEDTGNELSTKEQISRNSKALEEKYVAELQLERLKKDGEVRGSINRFKTSDYVKEAKQLLKVQKAYHQLDQSFIDTYIDLLETRRTYYEGPGEGSPFGWKDIKEWYEMLMGHCTYFPEELRSVKYAYNADLYNALNDLNNLVITRDENEKLEYYEKFQIIENVFKQKKKPTLKQIAKEILVNEEDIKGYRVTSTGKPEFTNLKVYHDIKDITARKEIIENAELLDQIAKILTIYQSSEDIQEELTNLNSELTQEEIEQISNLKGYTGTHNLSLKAINLILDELWHTNDNQIAIFNRLKLVPKKVDLSQQKEIPTTLVDDFILSPVVKRSFIQSIKVINAIIKKYGLPNDIIIELAREKNSKDAQKMINEMQKRNRQTNERIEEIIRTTGKENAKYLIEKIKLHDMQEGKCLYSLEAIPLEDLLNNPFNYEVDHIIPRSVSFDNSFNNKVLVKQEENSKKGNRTPFQYLSSSDSKISYETFKKHILNLAKGKGRISKTKKEYLLEERDINRFSVQKDFINRNLVDTRYATRGLMNLLRSYFRVNNLDVKVKSINGGFTSFLRRKWKFKKERNKGYKHHAEDALIIANADFIFKEWKKLDKAKKVMENQMFEEKQAESMPEIETEQEYKEIFITPHQIKHIKDFKDYKYSHRVDKKPNRKLINDTLYSTRKDDKGNTLIVNNLNGLYDKDNDKLKKLINKSPEKLLMYHHDPQTYQKLKLIMEQYGDEKNPLYKYYEETGNYLTKYSKKDNGPVIKKIKYYGNKLNAHLDITDDYPNSRNKVVKLSLKPYRFDVYLDNGVYKFVTVKNLDVIKKENYYEVNSKCYEEAKKLKKISNQAEFIASFYKNDLIKINGELYRVTATNNDKKNMIEVNMIDITYREYLENMNDKRPPHIIKTIASKTQSIKKYSTDILGNLYEVKSKKHPQIIKKG(SEQID NO:73)。
MKRNYILGLDIGITSVGYGIIDYETRDVIDAGVRLFKEANVENNEGRRSKRGARRLKRRRRHRIQRVKKLLFDYNLLTDHSELSGINPYEARVKGLSQKLSEEEFSAALLHLAKRRGVHNVNEVEEDTGNELSTKEQISRNSKALEEKYVAELQLERLKKDGEVRGSINRFKTSDYVKEAKQLLKVQKAYHQLDQSFIDTYIDLLETRRTYYEGPGEGSPFGWKDIKEWYEMLMGHCTYFPEELRSVKYAYNADLYNALNDLNNLVITRDENEKLEYYEKFQIIENVFKQKKKPTLKQIAKEILVNEEDIKGYRVTSTGKPEFTNLKVYHDIKDITARKEIIENAELLDQIAKILTIYQSSEDIQEELTNLNSELTQEEIEQISNLKGYTGTHNLSLKAINLILDELWHTNDNQIAIFNRLKLVPKKVDLSQQKEIPTTLVDDFILSPVVKRSFIQSIKVINAIIKKYGLPNDIIIELAREKNSKDAQKMINEMQKRNRQTNERIEEIIRTTGKENAKYLIEKIKLHDMQEGKCLYSLEAIPLEDLLNNPFNYEVDHIIPRSVSFDNSFNNKVLVKQEENSKKGNRTPFQYLSSSDSKISYETFKKHILNLAKGKGRISKTKKEYLLEERDINRFSVQKDFINRNLVDTRYATRGLMNLLRSYFRVNNLDVKVKSINGGFTSFLRRKWKFKKERNKGYKHHAEDALIIANADFIFKEWKKLDKAKKVMENQMFEEKQAESMPEIETEQEYKEIFITPHQIKHIKDFKDYKYSHRVDKKPNRKLINDTLYSTRKDDKGNTLIVNNLNGLYDKDNDKLKKLINKSPEKLLMYHHDPQTYQKLKLIMEQYGDEKNPLYKYYEETGNYLTKYSKKDNGPVIKKIKYYGNKLNAHLDITDDYPNSRNKVVKLSLKPYRFDVYLDNGVYKFVTVKNLDVIKKENYYEVNSKCYEEAKKLKKISNQAEFIASFYKNDLIKINGELYRVIGVNNNRLNKIELNMIDITYREYLENMNDKRPPHIIKTIASKTQSIKKYSTDILGNLYEVKSKKHPQIIKKG(SEQID NO:74)。
MKRNYILGLDIGITSVGYGIIDYETRDVIDAGVRLFKEANVENNEGRRSKRGARRLKRRRRHRIQRVKKLLFDYNLLTDHSELSGINPYEARVKGLSQKLSEEEFSAALLHLAKRRGVHNVNEVEEDTGNELSTKEQISRNSKALEEKYVAELQLERLKKDGEVRGSINRFKTSDYVKEAKQLLKVQKAYHQLDQSFIDTYIDLLETRRTYYEGPGEGSPFGWKDIKEWYEMLMGHCTYFPEELRSVKYAYNADLYNALNDLNNLVITRDENEKLEYYEKFQIIENVFKQKKKPTLKQIAKEILVNEEDIKGYRVTSTGKPEFTNLKVYHDIKDITARKEIIENAELLDQIAKILTIYQSSEDIQEELTNLNSELTQEEIEQISNLKGYTGTHNLSLKAINLILDELWHTNDNQIAIFNRLKLVPKKVDLSQQKEIPTTLVDDFILSPVVKRSFIQSIKVINAIIKKYGLPNDIIIELAREKNSKDAQKMINEMQKRNRQTNERIEEIIRTTGKENAKYLIEKIKLHDMQEGKCLYSLEAIPLEDLLNNPFNYEVDHIIPRSVSFDNSFNNKVLVKQEENSKKGNRTPFQYLSSSDSKISYETFKKHILNLAKGKGRISKTKKEYLLEERDINRFSVQKDFINRNLVDTRYATRGLMNLLRSYFRVNNLDVKVKSINGGFTSFLRRKWKFKKERNKGYKHHAEDALIIANADFIFKEWKKLDKAKKVMENQMFEEKQAESMPEIETEQEYKEIFITPHQIKHIKDFKDYKYSHRVDKKPNRKLINDTLYSTRKDDKGNTLIVNNLNGLYDKDNDKLKKLINKSPEKLLMYHHDPQTYQKLKLIMEQYGDEKNPLYKYYEETGNYLTKYSKKDNGPVIKKIKYYGNKLNAHLDITDDYPNSRNKVVKLSLKPYRFDVYLDNGVYKFVTVKNLDVIKKENYYEVNSKCYEEAKKLKKISNQAEFIASFYKNDLIKINGELYRVIGVFSDAGNLLEVNMIDITYREYLENMNDKRPPHIIKTIASKTQSIKKYSTDILGNLYEVKSKKHPQIIKKG(SEQID NO:75)。
MKRNYILGLDIGITSVGYGIIDYETRDVIDAGVRLFKEANVENNEGRRSKRGARRLKRRRRHRIQRVKKLLFDYNLLTDHSELSGINPYEARVKGLSQKLSEEEFSAALLHLAKRRGVHNVNEVEEDTGNELSTKEQISRNSKALEEKYVAELQLERLKKDGEVRGSINRFKTSDYVKEAKQLLKVQKAYHQLDQSFIDTYIDLLETRRTYYEGPGEGSPFGWKDIKEWYEMLMGHCTYFPEELRSVKYAYNADLYNALNDLNNLVITRDENEKLEYYEKFQIIENVFKQKKKPTLKQIAKEILVNEEDIKGYRVTSTGKPEFTNLKVYHDIKDITARKEIIENAELLDQIAKILTIYQSSEDIQEELTNLNSELTQEEIEQISNLKGYTGTHNLSLKAINLILDELWHTNDNQIAIFNRLKLVPKKVDLSQQKEIPTTLVDDFILSPVVKRSFIQSIKVINAIIKKYGLPNDIIIELAREKNSKDAQKMINEMQKRNRQTNERIEEIIRTTGKENAKYLIEKIKLHDMQEGKCLYSLEAIPLEDLLNNPFNYEVDHIIPRSVSFDNSFNNKVLVKQEENSKKGNRTPFQYLSSSDSKISYETFKKHILNLAKGKGRISKTKKEYLLEERDINRFSVQKDFINRNLVDTRYATRGLMNLLRSYFRVNNLDVKVKSINGGFTSFLRRKWKFKKERNKGYKHHAEDALIIANADFIFKEWKKLDKAKKVMENQMFEEKQAESMPEIETEQEYKEIFITPHQIKHIKDFKDYKYSHRVDKKPNRKLINDTLYSTRKDDKGNTLIVNNLNGLYDKDNDKLKKLINKSPEKLLMYHHDPQTYQKLKLIMEQYGDEKNPLYKYYEETGNYLTKYSKKDNGPVIKKIKYYGNKLNAHLDITDDYPNSRNKVVKLSLKPYRFDVYLDNGVYKFVTVKNLDVIKKENYYEVNSKCYEEAKKLKKISNQAEFIASFYKNDLIKINGELYRVIGDNNPRNNVIEVNMIDITYREYLENMNDKRPPHIIKTIASKTQSIKKYSTDILGNLYEVKSKKHPQIIKKG(SEQID NO:76)。
MKRNYILGLDIGITSVGYGIIDYETRDVIDAGVRLFKEANVENNEGRRSKRGARRLKRRRRHRIQRVKKLLFDYNLLTDHSELSGINPYEARVKGLSQKLSEEEFSAALLHLAKRRGVHNVNEVEEDTGNELSTKEQISRNSKALEEKYVAELQLERLKKDGEVRGSINRFKTSDYVKEAKQLLKVQKAYHQLDQSFIDTYIDLLETRRTYYEGPGEGSPFGWKDIKEWYEMLMGHCTYFPEELRSVKYAYNADLYNALNDLNNLVITRDENEKLEYYEKFQIIENVFKQKKKPTLKQIAKEILVNEEDIKGYRVTSTGKPEFTNLKVYHDIKDITARKEIIENAELLDQIAKILTIYQSSEDIQEELTNLNSELTQEEIEQISNLKGYTGTHNLSLKAINLILDELWHTNDNQIAIFNRLKLVPKKVDLSQQKEIPTTLVDDFILSPVVKRSFIQSIKVINAIIKKYGLPNDIIIELAREKNSKDAQKMINEMQKRNRQTNERIEEIIRTTGKENAKYLIEKIKLHDMQEGKCLYSLEAIPLEDLLNNPFNYEVDHIIPRSVSFDNSFNNKVLVKQEENSKKGNRTPFQYLSSSDSKISYETFKKHILNLAKGKGRISKTKKEYLLEERDINRFSVQKDFINRNLVDTRYATRGLMNLLRSYFRVNNLDVKVKSINGGFTSFLRRKWKFKKERNKGYKHHAEDALIIANADFIFKEWKKLDKAKKVMENQMFEEKQAESMPEIETEQEYKEIFITPHQIKHIKDFKDYKYSHRVDKKPNRKLINDTLYSTRKDDKGNTLIVNNLNGLYDKDNDKLKKLINKSPEKLLMYHHDPQTYQKLKLIMEQYGDEKNPLYKYYEETGNYLTKYSKKDNGPVIKKIKYYGNKLNAHLDITDDYPNSRNKVVKLSLKPYRFDVYLDNGVYKFVTVKNLDVIKKENYYEVNSKCYEEAKKLKKISNQAEFIASFYKNDLIKINGELYRVIGVNSDDRNLIELNMIDITYREYLENMNDKRPPHIIKTIASKTQSIKKYSTDILGNLYEVKSKKHPQIIKKG(SEQID NO:77)。
MKRNYILGLDIGITSVGYGIIDYETRDVIDAGVRLFKEANVENNEGRRSKRGARRLKRRRRHRIQRVKKLLFDYNLLTDHSELSGINPYEARVKGLSQKLSEEEFSAALLHLAKRRGVHNVNEVEEDTGNELSTKEQISRNSKALEEKYVAELQLERLKKDGEVRGSINRFKTSDYVKEAKQLLKVQKAYHQLDQSFIDTYIDLLETRRTYYEGPGEGSPFGWKDIKEWYEMLMGHCTYFPEELRSVKYAYNADLYNALNDLNNLVITRDENEKLEYYEKFQIIENVFKQKKKPTLKQIAKEILVNEEDIKGYRVTSTGKPEFTNLKVYHDIKDITARKEIIENAELLDQIAKILTIYQSSEDIQEELTNLNSELTQEEIEQISNLKGYTGTHNLSLKAINLILDELWHTNDNQIAIFNRLKLVPKKVDLSQQKEIPTTLVDDFILSPVVKRSFIQSIKVINAIIKKYGLPNDIIIELAREKNSKDAQKMINEMQKRNRQTNERIEEIIRTTGKENAKYLIEKIKLHDMQEGKCLYSLEAIPLEDLLNNPFNYEVDHIIPRSVSFDNSFNNKVLVKQEENSKKGNRTPFQYLSSSDSKISYETFKKHILNLAKGKGRISKTKKEYLLEERDINRFSVQKDFINRNLVDTRYATRGLMNLLRSYFRVNNLDVKVKSINGGFTSFLRRKWKFKKERNKGYKHHAEDALIIANADFIFKEWKKLDKAKKVMENQMFEEKQAESMPEIETEQEYKEIFITPHQIKHIKDFKDYKYSHRVDKKPNRKLINDTLYSTRKDDKGNTLIVNNLNGLYDKDNDKLKKLINKSPEKLLMYHHDPQTYQKLKLIMEQYGDEKNPLYKYYEETGNYLTKYSKKDNGPVIKKIKYYGNKLNAHLDITDDYPNSRNKVVKLSLKPYRFDVYLDNGVYKFVTVKNLDVIKKENYYEVNSKCYEEAKKLKKISNQAEFIASFYKNDLIKINGELYRVIGVNSDDRNKIELNMIDITYREYLENMNDKRPPHIIKTIASKTQSIKKYSTDILGNLYEVKSKKHPQIIKKG(SEQID NO:78)。
MKRNYILGLDIGITSVGYGIIDYETRDVIDAGVRLFKEANVENNEGRRSKRGARRLKRRRRHRIQRVKKLLFDYNLLTDHSELSGINPYEARVKGLSQKLSEEEFSAALLHLAKRRGVHNVNEVEEDTGNELSTKEQISRNSKALEEKYVAELQLERLKKDGEVRGSINRFKTSDYVKEAKQLLKVQKAYHQLDQSFIDTYIDLLETRRTYYEGPGEGSPFGWKDIKEWYEMLMGHCTYFPEELRSVKYAYNADLYNALNDLNNLVITRDENEKLEYYEKFQIIENVFKQKKKPTLKQIAKEILVNEEDIKGYRVTSTGKPEFTNLKVYHDIKDITARKEIIENAELLDQIAKILTIYQSSEDIQEELTNLNSELTQEEIEQISNLKGYTGTHNLSLKAINLILDELWHTNDNQIAIFNRLKLVPKKVDLSQQKEIPTTLVDDFILSPVVKRSFIQSIKVINAIIKKYGLPNDIIIELAREKNSKDAQKMINEMQKRNRQTNERIEEIIRTTGKENAKYLIEKIKLHDMQEGKCLYSLEAIPLEDLLNNPFNYEVDHIIPRSVSFDNSFNNKVLVKQEENSKKGNRTPFQYLSSSDSKISYETFKKHILNLAKGKGRISKTKKEYLLEERDINRFSVQKDFINRNLVDTRYATRGLMNLLRSYFRVNNLDVKVKSINGGFTSFLRRKWKFKKERNKGYKHHAEDALIIANADFIFKEWKKLDKAKKVMENQMFEEKQAESMPEIETEQEYKEIFITPHQIKHIKDFKDYKYSHRVDKKPNRKLINDTLYSTRKDDKGNTLIVNNLNGLYDKDNDKLKKLINKSPEKLLMYHHDPQTYQKLKLIMEQYGDEKNPLYKYYEETGNYLTKYSKKDNGPVIKKIKYYGNKLNAHLDITDDYPNSRNKVVKLSLKPYRFDVYLDNGVYKFVTVKNLDVIKKENYYEVNSKCYEEAKKLKKISNQAEFIASFYKNDLIKINGELYRVIGVNSDDRNRIELNMIDITYREYLENMNDKRPPHIIKTIASKTQSIKKYSTDILGNLYEVKSKKHPQIIKKG(SEQID NO:79)。
MKRNYILGLDIGITSVGYGIIDYETRDVIDAGVRLFKEANVENNEGRRSKRGARRLKRRRRHRIQRVKKLLFDYNLLTDHSELSGINPYEARVKGLSQKLSEEEFSAALLHLAKRRGVHNVNEVEEDTGNELSTKEQISRNSKALEEKYVAELQLERLKKDGEVRGSINRFKTSDYVKEAKQLLKVQKAYHQLDQSFIDTYIDLLETRRTYYEGPGEGSPFGWKDIKEWYEMLMGHCTYFPEELRSVKYAYNADLYNALNDLNNLVITRDENEKLEYYEKFQIIENVFKQKKKPTLKQIAKEILVNEEDIKGYRVTSTGKPEFTNLKVYHDIKDITARKEIIENAELLDQIAKILTIYQSSEDIQEELTNLNSELTQEEIEQISNLKGYTGTHNLSLKAINLILDELWHTNDNQIAIFNRLKLVPKKVDLSQQKEIPTTLVDDFILSPVVKRSFIQSIKVINAIIKKYGLPNDIIIELAREKNSKDAQKMINEMQKRNRQTNERIEEIIRTTGKENAKYLIEKIKLHDMQEGKCLYSLEAIPLEDLLNNPFNYEVDHIIPRSVSFDNSFNNKVLVKQEENSKKGNRTPFQYLSSSDSKISYETFKKHILNLAKGKGRISKTKKEYLLEERDINRFSVQKDFINRNLVDTRYATRGLMNLLRSYFRVNNLDVKVKSINGGFTSFLRRKWKFKKERNKGYKHHAEDALIIANADFIFKEWKKLDKAKKVMENQMFEEKQAESMPEIETEQEYKEIFITPHQIKHIKDFKDYKYSHRVDKKPNRKLINDTLYSTRKDDKGNTLIVNNLNGLYDKDNDKLKKLINKSPEKLLMYHHDPQTYQKLKLIMEQYGDEKNPLYKYYEETGNYLTKYSKKDNGPVIKKIKYYGNKLNAHLDITDDYPNSRNKVVKLSLKPYRFDVYLDNGVYKFVTVKNLDVIKKENYYEVNSKCYEEAKKLKKISNQAEFIASFYKNDLIKINGELYRVIGVNSDLLNRIEVNMIDITYREYLENMNDKRPPHIIKTIASKTQSIKKYSTDILGNLYEVKSKKHPQIIKKG(SEQID NO:80)。
MKRNYILGLDIGITSVGYGIIDYETRDVIDAGVRLFKEANVENNEGRRSKRGARRLKRRRRHRIQRVKKLLFDYNLLTDHSELSGINPYEARVKGLSQKLSEEEFSAALLHLAKRRGVHNVNEVEEDTGNELSTKEQISRNSKALEEKYVAELQLERLKKDGEVRGSINRFKTSDYVKEAKQLLKVQKAYHQLDQSFIDTYIDLLETRRTYYEGPGEGSPFGWKDIKEWYEMLMGHCTYFPEELRSVKYAYNADLYNALNDLNNLVITRDENEKLEYYEKFQIIENVFKQKKKPTLKQIAKEILVNEEDIKGYRVTSTGKPEFTNLKVYHDIKDITARKEIIENAELLDQIAKILTIYQSSEDIQEELTNLNSELTQEEIEQISNLKGYTGTHNLSLKAINLILDELWHTNDNQIAIFNRLKLVPKKVDLSQQKEIPTTLVDDFILSPVVKRSFIQSIKVINAIIKKYGLPNDIIIELAREKNSKDAQKMINEMQKRNRQTNERIEEIIRTTGKENAKYLIEKIKLHDMQEGKCLYSLEAIPLEDLLNNPFNYEVDHIIPRSVSFDNSFNNKVLVKQEENSKKGNRTPFQYLSSSDSKISYETFKKHILNLAKGKGRISKTKKEYLLEERDINRFSVQKDFINRNLVDTRYATRGLMNLLRSYFRVNNLDVKVKSINGGFTSFLRRKWKFKKERNKGYKHHAEDALIIANADFIFKEWKKLDKAKKVMENQMFEEKQAESMPEIETEQEYKEIFITPHQIKHIKDFKDYKYSHRVDKKPNRKLINDTLYSTRKDDKGNTLIVNNLNGLYDKDNDKLKKLINKSPEKLLMYHHDPQTYQKLKLIMEQYGDEKNPLYKYYEETGNYLTKYSKKDNGPVIKKIKYYGNKLNAHLDITDDYPNSRNKVVKLSLKPYRFDVYLDNGVYKFVTVKNLDVIKKENYYEVNSKCYEEAKKLKKISNQAEFIASFYKNDLIKINGELYRVIGVNNNLLNKIEVNMIDITYREYLENMNDKRPPHIIKTIASKTQSIKKYSTDILGNLYEVKSKKHPQIIKKG(SEQID NO:81)。
MKRNYILGLDIGITSVGYGIIDYETRDVIDAGVRLFKEANVENNEGRRSKRGARRLKRRRRHRIQRVKKLLFDYNLLTDHSELSGINPYEARVKGLSQKLSEEEFSAALLHLAKRRGVHNVNEVEEDTGNELSTKEQISRNSKALEEKYVAELQLERLKKDGEVRGSINRFKTSDYVKEAKQLLKVQKAYHQLDQSFIDTYIDLLETRRTYYEGPGEGSPFGWKDIKEWYEMLMGHCTYFPEELRSVKYAYNADLYNALNDLNNLVITRDENEKLEYYEKFQIIENVFKQKKKPTLKQIAKEILVNEEDIKGYRVTSTGKPEFTNLKVYHDIKDITARKEIIENAELLDQIAKILTIYQSSEDIQEELTNLNSELTQEEIEQISNLKGYTGTHNLSLKAINLILDELWHTNDNQIAIFNRLKLVPKKVDLSQQKEIPTTLVDDFILSPVVKRSFIQSIKVINAIIKKYGLPNDIIIELAREKNSKDAQKMINEMQKRNRQTNERIEEIIRTTGKENAKYLIEKIKLHDMQEGKCLYSLEAIPLEDLLNNPFNYEVDHIIPRSVSFDNSFNNKVLVKQEENSKKGNRTPFQYLSSSDSKISYETFKKHILNLAKGKGRISKTKKEYLLEERDINRFSVQKDFINRNLVDTRYATRGLMNLLRSYFRVNNLDVKVKSINGGFTSFLRRKWKFKKERNKGYKHHAEDALIIANADFIFKEWKKLDKAKKVMENQMFEEKQAESMPEIETEQEYKEIFITPHQIKHIKDFKDYKYSHRVDKKPNRKLINDTLYSTRKDDKGNTLIVNNLNGLYDKDNDKLKKLINKSPEKLLMYHHDPQTYQKLKLIMEQYGDEKNPLYKYYEETGNYLTKYSKKDNGPVIKKIKYYGNKLNAHLDITDDYPNSRNKVVKLSLKPYRFDVYLDNGVYKFVTVKNLDVIKKENYYEVNSKCYEEAKKLKKISNQAEFIASFYKNDLIKINGELYRVIGVNNDLLNKIEVNMIDITYREYLENMNDKRPPHIIKTIASKTQSIKKYSTDILGNLYEVKSKKHPQIIKKG(SEQID NO:82)。
MKRNYILGLDIGITSVGYGIIDYETRDVIDAGVRLFKEANVENNEGRRSKRGARRLKRRRRHRIQRVKKLLFDYNLLTDHSELSGINPYEARVKGLSQKLSEEEFSAALLHLAKRRGVHNVNEVEEDTGNELSTKEQISRNSKALEEKYVAELQLERLKKDGEVRGSINRFKTSDYVKEAKQLLKVQKAYHQLDQSFIDTYIDLLETRRTYYEGPGEGSPFGWKDIKEWYEMLMGHCTYFPEELRSVKYAYNADLYNALNDLNNLVITRDENEKLEYYEKFQIIENVFKQKKKPTLKQIAKEILVNEEDIKGYRVTSTGKPEFTNLKVYHDIKDITARKEIIENAELLDQIAKILTIYQSSEDIQEELTNLNSELTQEEIEQISNLKGYTGTHNLSLKAINLILDELWHTNDNQIAIFNRLKLVPKKVDLSQQKEIPTTLVDDFILSPVVKRSFIQSIKVINAIIKKYGLPNDIIIELAREKNSKDAQKMINEMQKRNRQTNERIEEIIRTTGKENAKYLIEKIKLHDMQEGKCLYSLEAIPLEDLLNNPFNYEVDHIIPRSVSFDNSFNNKVLVKQEENSKKGNRTPFQYLSSSDSKISYETFKKHILNLAKGKGRISKTKKEYLLEERDINRFSVQKDFINRNLVDTRYATRGLMNLLRSYFRVNNLDVKVKSINGGFTSFLRRKWKFKKERNKGYKHHAEDALIIANADFIFKEWKKLDKAKKVMENQMFEEKQAESMPEIETEQEYKEIFITPHQIKHIKDFKDYKYSHRVDKKPNRKLINDTLYSTRKDDKGNTLIVNNLNGLYDKDNDKLKKLINKSPEKLLMYHHDPQTYQKLKLIMEQYGDEKNPLYKYYEETGNYLTKYSKKDNGPVIKKIKYYGNKLNAHLDITDDYPNSRNKVVKLSLKPYRFDVYLDNGVYKFVTVKNLDVIKKENYYEVNSKCYEEAKKLKKISNQAEFIASFYKNDLIKINGELYRVIGVNNSTRNIKELNMIDITYREYLENMNDKRPPHIIKTIASKTQSIKKYSTDILGNLYEVKSKKHPQIIKKG(SEQID NO:83)。
MKRNYILGLDIGITSVGYGIIDYETRDVIDAGVRLFKEANVENNEGRRSKRGARRLKRRRRHRIQRVKKLLFDYNLLTDHSELSGINPYEARVKGLSQKLSEEEFSAALLHLAKRRGVHNVNEVEEDTGNELSTKEQISRNSKALEEKYVAELQLERLKKDGEVRGSINRFKTSDYVKEAKQLLKVQKAYHQLDQSFIDTYIDLLETRRTYYEGPGEGSPFGWKDIKEWYEMLMGHCTYFPEELRSVKYAYNADLYNALNDLNNLVITRDENEKLEYYEKFQIIENVFKQKKKPTLKQIAKEILVNEEDIKGYRVTSTGKPEFTNLKVYHDIKDITARKEIIENAELLDQIAKILTIYQSSEDIQEELTNLNSELTQEEIEQISNLKGYTGTHNLSLKAINLILDELWHTNDNQIAIFNRLKLVPKKVDLSQQKEIPTTLVDDFILSPVVKRSFIQSIKVINAIIKKYGLPNDIIIELAREKNSKDAQKMINEMQKRNRQTNERIEEIIRTTGKENAKYLIEKIKLHDMQEGKCLYSLEAIPLEDLLNNPFNYEVDHIIPRSVSFDNSFNNKVLVKQEENSKKGNRTPFQYLSSSDSKISYETFKKHILNLAKGKGRISKTKKEYLLEERDINRFSVQKDFINRNLVDTRYATRGLMNLLRSYFRVNNLDVKVKSINGGFTSFLRRKWKFKKERNKGYKHHAEDALIIANADFIFKEWKKLDKAKKVMENQMFEEKQAESMPEIETEQEYKEIFITPHQIKHIKDFKDYKYSHRVDKKPNRKLINDTLYSTRKDDKGNTLIVNNLNGLYDKDNDKLKKLINKSPEKLLMYHHDPQTYQKLKLIMEQYGDEKNPLYKYYEETGNYLTKYSKKDNGPVIKKIKYYGNKLNAHLDITDDYPNSRNKVVKLSLKPYRFDVYLDNGVYKFVTVKNLDVIKKENYYEVNSKCYEEAKKLKKISNQAEFIASFYKNDLIKINGELYRVIGVNNSTRNILELNMIDITYREYLENMNDKRPPHIIKTIASKTQSIKKYSTDILGNLYEVKSKKHPQIIKKG(SEQID NO:84)。
MKRNYILGLDIGITSVGYGIIDYETRDVIDAGVRLFKEANVENNEGRRSKRGARRLKRRRRHRIQRVKKLLFDYNLLTDHSELSGINPYEARVKGLSQKLSEEEFSAALLHLAKRRGVHNVNEVEEDTGNELSTKEQISRNSKALEEKYVAELQLERLKKDGEVRGSINRFKTSDYVKEAKQLLKVQKAYHQLDQSFIDTYIDLLETRRTYYEGPGEGSPFGWKDIKEWYEMLMGHCTYFPEELRSVKYAYNADLYNALNDLNNLVITRDENEKLEYYEKFQIIENVFKQKKKPTLKQIAKEILVNEEDIKGYRVTSTGKPEFTNLKVYHDIKDITARKEIIENAELLDQIAKILTIYQSSEDIQEELTNLNSELTQEEIEQISNLKGYTGTHNLSLKAINLILDELWHTNDNQIAIFNRLKLVPKKVDLSQQKEIPTTLVDDFILSPVVKRSFIQSIKVINAIIKKYGLPNDIIIELAREKNSKDAQKMINEMQKRNAKTNERIEEIIRTTGKENAKYLIEKIKLHDMQEGKCLYSLEAIPLEDLLNNPFNYEVDHIIPRSVSFDNSFNNKVLVKQEENSKKGNRTPFQYLSSSDSKISYETFKKHILNLAKGKGRISKTKKEYLLEERDINRFSVQKDFINRNLVDTRYATARLMNLLRSYFRVNNLDVKVKSINGGFTSFLRRKWKFKKERNKGYKHHAEDALIIANADFIFKEWKKLDKAKKVMENQMFEEKQAESMPEIETEQEYKEIFITPHQIKHIKDFKDYKYSHRVDKKPNRKLINDTLYSTRKDDKGNTLIVNNLNGLYDKDNDKLKKLINKSPEKLLMYHHDPQTYQKLKLIMEQYGDEKNPLYKYYEETGNYLTKYSKKDNGPVIKKIKYYGNKLNAHLDITDDYPNSRNKVVKLSLKPYRFDVYLDNGVYKFVTVKNLDVIKKENYYEVNSKCYEEAKKLKKISNQAEFIASFYKNDLIKINGELYRVIGVNSDDRNRIELNMIDITYREYLENMNDKRPPHIIKTIASKTQSIKKYSTDILGNLYEVKSKKHPQIIKKG(SEQID NO:131)。
Compared with Cas9, the PAM recognition range of the Cas9 protein mutant with the amino acid sequence is expanded to be close to 1/2, the PAM tendency is greatly expanded, and the off-target efficiency of the Cas9 protein mutant with the amino acid sequence shown as SEQID NO. 131 is further reduced.
In a second aspect of the invention, the invention features a nucleic acid. According to an embodiment of the invention, the nucleic acid encodes the Cas9 protein mutant described previously. Further, after introducing the nucleic acid according to the embodiment of the present invention into the recipient cell, the Cas9 protein mutant described above is obtained under appropriate conditions. The nucleic acid provided by the embodiment of the invention can be used as a leader nucleic acid of a CRISPR/Cas9 system, and can realize a stronger gene editing function after being introduced into a cell to successfully express the Cas9 protein mutant, such as a SaCas9 chimera.
According to an embodiment of the present invention, the above-mentioned nucleic acid may further comprise at least one of the following additional technical features:
according to an embodiment of the invention, the nucleic acid has the nucleotide sequence of any one of SEQ ID NOs 85-125 and 132.
ATGAAGCGGAACTACATCCTGGGCCTGGCCATCGGCATCACCAGCGTGGGCTACGGCATCATCGACTACGAGACACGGGACGTGATCGATGCCGGCGTGCGGCTGTTCAAAGAGGCCAACGTGGAAAACAACGAGGGCAGGCGGAGCAAGAGAGGCGCCAGAAGGCTGAAGCGGCGGAGGCGGCATAGAATCCAGAGAGTGAAGAAGCTGCTGTTCGACTACAACCTGCTGACCGACCACAGCGAGCTGAGCGGCATCAACCCCTACGAGGCCAGAGTGAAGGGCCTGAGCCAGAAGCTGAGCGAGGAAGAGTTCTCTGCCGCCCTGCTGCACCTGGCCAAGAGAAGAGGCGTGCACAACGTGAACGAGGTGGAAGAGGACACCGGCAACGAGCTGTCCACCAAAGAGCAGATCAGCCGGAACAGCAAGGCCCTGGAAGAGAAATACGTGGCCGAACTGCAGCTGGAACGGCTGAAGAAAGACGGCGAAGTGCGGGGCAGCATCAACAGATTCAAGACCAGCGACTACGTGAAAGAAGCCAAACAGCTGCTGAAGGTGCAGAAGGCCTACCACCAGCTGGACCAGAGCTTCATCGACACCTACATCGACCTGCTGGAAACCCGGCGGACCTACTATGAGGGACCTGGCGAGGGCAGCCCCTTCGGCTGGAAGGACATCAAAGAATGGTACGAGATGCTGATGGGCCACTGCACCTACTTCCCCGAGGAACTGCGGAGCGTGAAGTACGCCTACAACGCCGACCTGTACAACGCCCTGAACGACCTGAACAATCTCGTGATCACCAGGGACGAGAACGAGAAGCTGGAATATTACGAGAAGTTCCAGATCATCGAGAACGTGTTCAAGCAGAAGAAGAAGCCCACCCTGAAGCAGATCGCCAAAGAAATCCTCGTGAACGAAGAGGATATTAAGGGCTACAGAGTGACCAGCACCGGCAAGCCCGAGTTCACCAACCTGAAGGTGTACCACGACATCAAGGACATTACCGCCCGGAAAGAGATTATTGAGAACGCCGAGCTGCTGGATCAGATTGCCAAGATCCTGACCATCTACCAGAGCAGCGAGGACATCCAGGAAGAACTGACCAATCTGAACTCCGAGCTGACCCAGGAAGAGATCGAGCAGATCTCTAATCTGAAGGGCTATACCGGCACCCACAACCTGAGCCTGAAGGCCATCAACCTGATCCTGGACGAGCTGTGGCACACCAACGACAACCAGATCGCTATCTTCAACCGGCTGAAGCTGGTGCCCAAGAAGGTGGACCTGTCCCAGCAGAAAGAGATCCCCACCACCCTGGTGGACGACTTCATCCTGAGCCCCGTCGTGAAGAGAAGCTTCATCCAGAGCATCAAAGTGATCAACGCCATCATCAAGAAGTACGGCCTGCCCAACGACATCATTATCGAGCTGGCCCGCGAGAAGAACTCCAAGGACGCCCAGAAAATGATCAACGAGATGCAGAAGCGGAACCGGCAGACCAACGAGCGGATCGAGGAAATCATCCGGACCACCGGCAAAGAGAACGCCAAGTACCTGATCGAGAAGATCAAGCTGCACGACATGCAGGAAGGCAAGTGCCTGTACAGCCTGGAAGCCATCCCTCTGGAAGATCTGCTGAACAACCCCTTCAACTATGAGGTGGACCACATCATCCCCAGAAGCGTGTCCTTCGACAACAGCTTCAACAACAAGGTGCTCGTGAAGCAGGAAGAAGCCAGCAAGAAGGGCAACCGGACCCCATTCCAGTACCTGAGCAGCAGCGACAGCAAGATCAGCTACGAAACCTTCAAGAAGCACATCCTGAATCTGGCCAAGGGCAAGGGCAGAATCAGCAAGACCAAGAAAGAGTATCTGCTGGAAGAACGGGACATCAACAGGTTCTCCGTGCAGAAAGACTTCATCAACCGGAACCTGGTGGATACCAGATACGCCACCAGAGGCCTGATGAACCTGCTGCGGAGCTACTTCAGAGTGAACAACCTGGACGTGAAAGTGAAGTCCATCAATGGCGGCTTCACCAGCTTTCTGCGGCGGAAGTGGAAGTTTAAGAAAGAGCGGAACAAGGGGTACAAGCACCACGCCGAGGACGCCCTGATCATTGCCAACGCCGATTTCATCTTCAAAGAGTGGAAGAAACTGGACAAGGCCAAAAAAGTGATGGAAAACCAGATGTTCGAGGAAAAGCAGGCCGAGAGCATGCCCGAGATCGAAACCGAGCAGGAGTACAAAGAGATCTTCATCACCCCCCACCAGATCAAGCACATTAAGGACTTCAAGGACTACAAGTACAGCCACCGGGTGGACAAGAAGCCTAATAGAAAGCTGATTAACGACACCCTGTACTCCACCCGGAAGGACGACAAGGGCAACACCCTGATCGTGAACAATCTGAACGGCCTGTACGACAAGGACAATGACAAGCTGAAAAAGCTGATCAACAAGAGCCCCGAAAAGCTGCTGATGTACCACCACGACCCCCAGACCTACCAGAAACTGAAGCTGATTATGGAACAGTACGGCGACGAGAAGAATCCCCTGTACAAGTACTACGAGGAAACCGGGAACTACCTGACCAAGTACTCCAAAAAGGACAACGGCCCCGTGATCAAGAAGATTAAGTATTACGGCAACAAACTGAACGCCCATCTGGACATCACCGACGACTACCCCAACAGCAGAAACAAGGTCGTGAAGCTGTCCCTGAAGCCCTACAGATTCGACGTGTACCTGGACAATGGCGTGTACAAGTTCGTGACCGTGAAGAATCTGGATGTGATCAAAAAAGAAAACTACTACGAAGTGAATAGCAAGTGCTATGAGGAAGCTAAGAAGCTGAAGAAGATCAGCAACCAGGCCGAGTTTATCGCCTCCTTCTACAAGAACGATCTGATCAAGATCAACGGCGAGCTGTATAGAGTGAGGAGCGACAGCAGCCCCAGGGAGAACAGGCTGGAGGTGTGAGACGGGCCATACTCGTCTCGAACATGATCGACATCACCTACCGCGAGTACCTGGAAAACATGAACGACAAGAGGCCCCCCCACATCATTAAGACAATCGCCTCCAAGACCCAGAGCATTAAGAAGTACAGCACAGACATTCTGGGCAACCTGTATGAAGTGAAATCTAAGAAGCACCCTCAGATCATCAAAAAGGGC(SEQ ID NO:85)。
ATGAAGCGGAACTACATCCTGGGCCTGGCCATCGGCATCACCAGCGTGGGCTACGGCATCATCGACTACGAGACACGGGACGTGATCGATGCCGGCGTGCGGCTGTTCAAAGAGGCCAACGTGGAAAACAACGAGGGCAGGCGGAGCAAGAGAGGCGCCAGAAGGCTGAAGCGGCGGAGGCGGCATAGAATCCAGAGAGTGAAGAAGCTGCTGTTCGACTACAACCTGCTGACCGACCACAGCGAGCTGAGCGGCATCAACCCCTACGAGGCCAGAGTGAAGGGCCTGAGCCAGAAGCTGAGCGAGGAAGAGTTCTCTGCCGCCCTGCTGCACCTGGCCAAGAGAAGAGGCGTGCACAACGTGAACGAGGTGGAAGAGGACACCGGCAACGAGCTGTCCACCAAAGAGCAGATCAGCCGGAACAGCAAGGCCCTGGAAGAGAAATACGTGGCCGAACTGCAGCTGGAACGGCTGAAGAAAGACGGCGAAGTGCGGGGCAGCATCAACAGATTCAAGACCAGCGACTACGTGAAAGAAGCCAAACAGCTGCTGAAGGTGCAGAAGGCCTACCACCAGCTGGACCAGAGCTTCATCGACACCTACATCGACCTGCTGGAAACCCGGCGGACCTACTATGAGGGACCTGGCGAGGGCAGCCCCTTCGGCTGGAAGGACATCAAAGAATGGTACGAGATGCTGATGGGCCACTGCACCTACTTCCCCGAGGAACTGCGGAGCGTGAAGTACGCCTACAACGCCGACCTGTACAACGCCCTGAACGACCTGAACAATCTCGTGATCACCAGGGACGAGAACGAGAAGCTGGAATATTACGAGAAGTTCCAGATCATCGAGAACGTGTTCAAGCAGAAGAAGAAGCCCACCCTGAAGCAGATCGCCAAAGAAATCCTCGTGAACGAAGAGGATATTAAGGGCTACAGAGTGACCAGCACCGGCAAGCCCGAGTTCACCAACCTGAAGGTGTACCACGACATCAAGGACATTACCGCCCGGAAAGAGATTATTGAGAACGCCGAGCTGCTGGATCAGATTGCCAAGATCCTGACCATCTACCAGAGCAGCGAGGACATCCAGGAAGAACTGACCAATCTGAACTCCGAGCTGACCCAGGAAGAGATCGAGCAGATCTCTAATCTGAAGGGCTATACCGGCACCCACAACCTGAGCCTGAAGGCCATCAACCTGATCCTGGACGAGCTGTGGCACACCAACGACAACCAGATCGCTATCTTCAACCGGCTGAAGCTGGTGCCCAAGAAGGTGGACCTGTCCCAGCAGAAAGAGATCCCCACCACCCTGGTGGACGACTTCATCCTGAGCCCCGTCGTGAAGAGAAGCTTCATCCAGAGCATCAAAGTGATCAACGCCATCATCAAGAAGTACGGCCTGCCCAACGACATCATTATCGAGCTGGCCCGCGAGAAGAACTCCAAGGACGCCCAGAAAATGATCAACGAGATGCAGAAGCGGAACCGGCAGACCAACGAGCGGATCGAGGAAATCATCCGGACCACCGGCAAAGAGAACGCCAAGTACCTGATCGAGAAGATCAAGCTGCACGACATGCAGGAAGGCAAGTGCCTGTACAGCCTGGAAGCCATCCCTCTGGAAGATCTGCTGAACAACCCCTTCAACTATGAGGTGGACCACATCATCCCCAGAAGCGTGTCCTTCGACAACAGCTTCAACAACAAGGTGCTCGTGAAGCAGGAAGAAGCCAGCAAGAAGGGCAACCGGACCCCATTCCAGTACCTGAGCAGCAGCGACAGCAAGATCAGCTACGAAACCTTCAAGAAGCACATCCTGAATCTGGCCAAGGGCAAGGGCAGAATCAGCAAGACCAAGAAAGAGTATCTGCTGGAAGAACGGGACATCAACAGGTTCTCCGTGCAGAAAGACTTCATCAACCGGAACCTGGTGGATACCAGATACGCCACCAGAGGCCTGATGAACCTGCTGCGGAGCTACTTCAGAGTGAACAACCTGGACGTGAAAGTGAAGTCCATCAATGGCGGCTTCACCAGCTTTCTGCGGCGGAAGTGGAAGTTTAAGAAAGAGCGGAACAAGGGGTACAAGCACCACGCCGAGGACGCCCTGATCATTGCCAACGCCGATTTCATCTTCAAAGAGTGGAAGAAACTGGACAAGGCCAAAAAAGTGATGGAAAACCAGATGTTCGAGGAAAAGCAGGCCGAGAGCATGCCCGAGATCGAAACCGAGCAGGAGTACAAAGAGATCTTCATCACCCCCCACCAGATCAAGCACATTAAGGACTTCAAGGACTACAAGTACAGCCACCGGGTGGACAAGAAGCCTAATAGAAAGCTGATTAACGACACCCTGTACTCCACCCGGAAGGACGACAAGGGCAACACCCTGATCGTGAACAATCTGAACGGCCTGTACGACAAGGACAATGACAAGCTGAAAAAGCTGATCAACAAGAGCCCCGAAAAGCTGCTGATGTACCACCACGACCCCCAGACCTACCAGAAACTGAAGCTGATTATGGAACAGTACGGCGACGAGAAGAATCCCCTGTACAAGTACTACGAGGAAACCGGGAACTACCTGACCAAGTACTCCAAAAAGGACAACGGCCCCGTGATCAAGAAGATTAAGTATTACGGCAACAAACTGAACGCCCATCTGGACATCACCGACGACTACCCCAACAGCAGAAACAAGGTCGTGAAGCTGTCCCTGAAGCCCTACAGATTCGACGTGTACCTGGACAATGGCGTGTACAAGTTCGTGACCGTGAAGAATCTGGATGTGATCAAAAAAGAAAACTACTACGAAGTGAATAGCAAGTGCTATGAGGAAGCTAAGAAGCTGAAGAAGATCAGCAACCAGGCCGAGTTTATCGCCTCCTTCTACAAGAACGATCTGATCAAGATCAACGGCGAGCTGTATAGAGTGAAGGGCGACGCCATGCCCAGGGGCAACAAGATCGAGATCTGAGACGGGCCATACTCGTCTCGAACATGATCGACATCACCTACCGCGAGTACCTGGAAAACATGAACGACAAGAGGCCCCCCCACATCATTAAGACAATCGCCTCCAAGACCCAGAGCATTAAGAAGTACAGCACAGACATTCTGGGCAACCTGTATGAAGTGAAATCTAAGAAGCACCCTCAGATCATCAAAAAGGGC(SEQ ID NO:86)。
ATGAAGCGGAACTACATCCTGGGCCTGGCCATCGGCATCACCAGCGTGGGCTACGGCATCATCGACTACGAGACACGGGACGTGATCGATGCCGGCGTGCGGCTGTTCAAAGAGGCCAACGTGGAAAACAACGAGGGCAGGCGGAGCAAGAGAGGCGCCAGAAGGCTGAAGCGGCGGAGGCGGCATAGAATCCAGAGAGTGAAGAAGCTGCTGTTCGACTACAACCTGCTGACCGACCACAGCGAGCTGAGCGGCATCAACCCCTACGAGGCCAGAGTGAAGGGCCTGAGCCAGAAGCTGAGCGAGGAAGAGTTCTCTGCCGCCCTGCTGCACCTGGCCAAGAGAAGAGGCGTGCACAACGTGAACGAGGTGGAAGAGGACACCGGCAACGAGCTGTCCACCAAAGAGCAGATCAGCCGGAACAGCAAGGCCCTGGAAGAGAAATACGTGGCCGAACTGCAGCTGGAACGGCTGAAGAAAGACGGCGAAGTGCGGGGCAGCATCAACAGATTCAAGACCAGCGACTACGTGAAAGAAGCCAAACAGCTGCTGAAGGTGCAGAAGGCCTACCACCAGCTGGACCAGAGCTTCATCGACACCTACATCGACCTGCTGGAAACCCGGCGGACCTACTATGAGGGACCTGGCGAGGGCAGCCCCTTCGGCTGGAAGGACATCAAAGAATGGTACGAGATGCTGATGGGCCACTGCACCTACTTCCCCGAGGAACTGCGGAGCGTGAAGTACGCCTACAACGCCGACCTGTACAACGCCCTGAACGACCTGAACAATCTCGTGATCACCAGGGACGAGAACGAGAAGCTGGAATATTACGAGAAGTTCCAGATCATCGAGAACGTGTTCAAGCAGAAGAAGAAGCCCACCCTGAAGCAGATCGCCAAAGAAATCCTCGTGAACGAAGAGGATATTAAGGGCTACAGAGTGACCAGCACCGGCAAGCCCGAGTTCACCAACCTGAAGGTGTACCACGACATCAAGGACATTACCGCCCGGAAAGAGATTATTGAGAACGCCGAGCTGCTGGATCAGATTGCCAAGATCCTGACCATCTACCAGAGCAGCGAGGACATCCAGGAAGAACTGACCAATCTGAACTCCGAGCTGACCCAGGAAGAGATCGAGCAGATCTCTAATCTGAAGGGCTATACCGGCACCCACAACCTGAGCCTGAAGGCCATCAACCTGATCCTGGACGAGCTGTGGCACACCAACGACAACCAGATCGCTATCTTCAACCGGCTGAAGCTGGTGCCCAAGAAGGTGGACCTGTCCCAGCAGAAAGAGATCCCCACCACCCTGGTGGACGACTTCATCCTGAGCCCCGTCGTGAAGAGAAGCTTCATCCAGAGCATCAAAGTGATCAACGCCATCATCAAGAAGTACGGCCTGCCCAACGACATCATTATCGAGCTGGCCCGCGAGAAGAACTCCAAGGACGCCCAGAAAATGATCAACGAGATGCAGAAGCGGAACCGGCAGACCAACGAGCGGATCGAGGAAATCATCCGGACCACCGGCAAAGAGAACGCCAAGTACCTGATCGAGAAGATCAAGCTGCACGACATGCAGGAAGGCAAGTGCCTGTACAGCCTGGAAGCCATCCCTCTGGAAGATCTGCTGAACAACCCCTTCAACTATGAGGTGGACCACATCATCCCCAGAAGCGTGTCCTTCGACAACAGCTTCAACAACAAGGTGCTCGTGAAGCAGGAAGAAGCCAGCAAGAAGGGCAACCGGACCCCATTCCAGTACCTGAGCAGCAGCGACAGCAAGATCAGCTACGAAACCTTCAAGAAGCACATCCTGAATCTGGCCAAGGGCAAGGGCAGAATCAGCAAGACCAAGAAAGAGTATCTGCTGGAAGAACGGGACATCAACAGGTTCTCCGTGCAGAAAGACTTCATCAACCGGAACCTGGTGGATACCAGATACGCCACCAGAGGCCTGATGAACCTGCTGCGGAGCTACTTCAGAGTGAACAACCTGGACGTGAAAGTGAAGTCCATCAATGGCGGCTTCACCAGCTTTCTGCGGCGGAAGTGGAAGTTTAAGAAAGAGCGGAACAAGGGGTACAAGCACCACGCCGAGGACGCCCTGATCATTGCCAACGCCGATTTCATCTTCAAAGAGTGGAAGAAACTGGACAAGGCCAAAAAAGTGATGGAAAACCAGATGTTCGAGGAAAAGCAGGCCGAGAGCATGCCCGAGATCGAAACCGAGCAGGAGTACAAAGAGATCTTCATCACCCCCCACCAGATCAAGCACATTAAGGACTTCAAGGACTACAAGTACAGCCACCGGGTGGACAAGAAGCCTAATAGAAAGCTGATTAACGACACCCTGTACTCCACCCGGAAGGACGACAAGGGCAACACCCTGATCGTGAACAATCTGAACGGCCTGTACGACAAGGACAATGACAAGCTGAAAAAGCTGATCAACAAGAGCCCCGAAAAGCTGCTGATGTACCACCACGACCCCCAGACCTACCAGAAACTGAAGCTGATTATGGAACAGTACGGCGACGAGAAGAATCCCCTGTACAAGTACTACGAGGAAACCGGGAACTACCTGACCAAGTACTCCAAAAAGGACAACGGCCCCGTGATCAAGAAGATTAAGTATTACGGCAACAAACTGAACGCCCATCTGGACATCACCGACGACTACCCCAACAGCAGAAACAAGGTCGTGAAGCTGTCCCTGAAGCCCTACAGATTCGACGTGTACCTGGACAATGGCGTGTACAAGTTCGTGACCGTGAAGAATCTGGATGTGATCAAAAAAGAAAACTACTACGAAGTGAATAGCAAGTGCTATGAGGAAGCTAAGAAGCTGAAGAAGATCAGCAACCAGGCCGAGTTTATCGCCTCCTTCTACAAGAACGATCTGATCAAGATCAACGGCGAGCTGTATAGAGTGACCGCCACCAACAACGACAAGAGCAACAAGATCGAGGTGTGAGACGGGCCATACTCGTCTCGAACATGATCGACATCACCTACCGCGAGTACCTGGAAAACATGAACGACAAGAGGCCCCCCCACATCATTAAGACAATCGCCTCCAAGACCCAGAGCATTAAGAAGTACAGCACAGACATTCTGGGCAACCTGTATGAAGTGAAATCTAAGAAGCACCCTCAGATCATCAAAAAGGGC(SEQ ID NO:87)。
ATGAAGCGGAACTACATCCTGGGCCTGGCCATCGGCATCACCAGCGTGGGCTACGGCATCATCGACTACGAGACACGGGACGTGATCGATGCCGGCGTGCGGCTGTTCAAAGAGGCCAACGTGGAAAACAACGAGGGCAGGCGGAGCAAGAGAGGCGCCAGAAGGCTGAAGCGGCGGAGGCGGCATAGAATCCAGAGAGTGAAGAAGCTGCTGTTCGACTACAACCTGCTGACCGACCACAGCGAGCTGAGCGGCATCAACCCCTACGAGGCCAGAGTGAAGGGCCTGAGCCAGAAGCTGAGCGAGGAAGAGTTCTCTGCCGCCCTGCTGCACCTGGCCAAGAGAAGAGGCGTGCACAACGTGAACGAGGTGGAAGAGGACACCGGCAACGAGCTGTCCACCAAAGAGCAGATCAGCCGGAACAGCAAGGCCCTGGAAGAGAAATACGTGGCCGAACTGCAGCTGGAACGGCTGAAGAAAGACGGCGAAGTGCGGGGCAGCATCAACAGATTCAAGACCAGCGACTACGTGAAAGAAGCCAAACAGCTGCTGAAGGTGCAGAAGGCCTACCACCAGCTGGACCAGAGCTTCATCGACACCTACATCGACCTGCTGGAAACCCGGCGGACCTACTATGAGGGACCTGGCGAGGGCAGCCCCTTCGGCTGGAAGGACATCAAAGAATGGTACGAGATGCTGATGGGCCACTGCACCTACTTCCCCGAGGAACTGCGGAGCGTGAAGTACGCCTACAACGCCGACCTGTACAACGCCCTGAACGACCTGAACAATCTCGTGATCACCAGGGACGAGAACGAGAAGCTGGAATATTACGAGAAGTTCCAGATCATCGAGAACGTGTTCAAGCAGAAGAAGAAGCCCACCCTGAAGCAGATCGCCAAAGAAATCCTCGTGAACGAAGAGGATATTAAGGGCTACAGAGTGACCAGCACCGGCAAGCCCGAGTTCACCAACCTGAAGGTGTACCACGACATCAAGGACATTACCGCCCGGAAAGAGATTATTGAGAACGCCGAGCTGCTGGATCAGATTGCCAAGATCCTGACCATCTACCAGAGCAGCGAGGACATCCAGGAAGAACTGACCAATCTGAACTCCGAGCTGACCCAGGAAGAGATCGAGCAGATCTCTAATCTGAAGGGCTATACCGGCACCCACAACCTGAGCCTGAAGGCCATCAACCTGATCCTGGACGAGCTGTGGCACACCAACGACAACCAGATCGCTATCTTCAACCGGCTGAAGCTGGTGCCCAAGAAGGTGGACCTGTCCCAGCAGAAAGAGATCCCCACCACCCTGGTGGACGACTTCATCCTGAGCCCCGTCGTGAAGAGAAGCTTCATCCAGAGCATCAAAGTGATCAACGCCATCATCAAGAAGTACGGCCTGCCCAACGACATCATTATCGAGCTGGCCCGCGAGAAGAACTCCAAGGACGCCCAGAAAATGATCAACGAGATGCAGAAGCGGAACCGGCAGACCAACGAGCGGATCGAGGAAATCATCCGGACCACCGGCAAAGAGAACGCCAAGTACCTGATCGAGAAGATCAAGCTGCACGACATGCAGGAAGGCAAGTGCCTGTACAGCCTGGAAGCCATCCCTCTGGAAGATCTGCTGAACAACCCCTTCAACTATGAGGTGGACCACATCATCCCCAGAAGCGTGTCCTTCGACAACAGCTTCAACAACAAGGTGCTCGTGAAGCAGGAAGAAGCCAGCAAGAAGGGCAACCGGACCCCATTCCAGTACCTGAGCAGCAGCGACAGCAAGATCAGCTACGAAACCTTCAAGAAGCACATCCTGAATCTGGCCAAGGGCAAGGGCAGAATCAGCAAGACCAAGAAAGAGTATCTGCTGGAAGAACGGGACATCAACAGGTTCTCCGTGCAGAAAGACTTCATCAACCGGAACCTGGTGGATACCAGATACGCCACCAGAGGCCTGATGAACCTGCTGCGGAGCTACTTCAGAGTGAACAACCTGGACGTGAAAGTGAAGTCCATCAATGGCGGCTTCACCAGCTTTCTGCGGCGGAAGTGGAAGTTTAAGAAAGAGCGGAACAAGGGGTACAAGCACCACGCCGAGGACGCCCTGATCATTGCCAACGCCGATTTCATCTTCAAAGAGTGGAAGAAACTGGACAAGGCCAAAAAAGTGATGGAAAACCAGATGTTCGAGGAAAAGCAGGCCGAGAGCATGCCCGAGATCGAAACCGAGCAGGAGTACAAAGAGATCTTCATCACCCCCCACCAGATCAAGCACATTAAGGACTTCAAGGACTACAAGTACAGCCACCGGGTGGACAAGAAGCCTAATAGAAAGCTGATTAACGACACCCTGTACTCCACCCGGAAGGACGACAAGGGCAACACCCTGATCGTGAACAATCTGAACGGCCTGTACGACAAGGACAATGACAAGCTGAAAAAGCTGATCAACAAGAGCCCCGAAAAGCTGCTGATGTACCACCACGACCCCCAGACCTACCAGAAACTGAAGCTGATTATGGAACAGTACGGCGACGAGAAGAATCCCCTGTACAAGTACTACGAGGAAACCGGGAACTACCTGACCAAGTACTCCAAAAAGGACAACGGCCCCGTGATCAAGAAGATTAAGTATTACGGCAACAAACTGAACGCCCATCTGGACATCACCGACGACTACCCCAACAGCAGAAACAAGGTCGTGAAGCTGTCCCTGAAGCCCTACAGATTCGACGTGTACCTGGACAATGGCGTGTACAAGTTCGTGACCGTGAAGAATCTGGATGTGATCAAAAAAGAAAACTACTACGAAGTGAATAGCAAGTGCTATGAGGAAGCTAAGAAGCTGAAGAAGATCAGCAACCAGGCCGAGTTTATCGCCTCCTTCTACAAGAACGATCTGATCAAGATCAACGGCGAGCTGTATAGAGTGCTGGGCGACGCCAACAGCAGGCAGAACATCCTGGAGGCCTGAGACGGGCCATACTCGTCTCGAACATGATCGACATCACCTACCGCGAGTACCTGGAAAACATGAACGACAAGAGGCCCCCCCACATCATTAAGACAATCGCCTCCAAGACCCAGAGCATTAAGAAGTACAGCACAGACATTCTGGGCAACCTGTATGAAGTGAAATCTAAGAAGCACCCTCAGATCATCAAAAAGGGC(SEQ ID NO:88)。
ATGAAGCGGAACTACATCCTGGGCCTGGCCATCGGCATCACCAGCGTGGGCTACGGCATCATCGACTACGAGACACGGGACGTGATCGATGCCGGCGTGCGGCTGTTCAAAGAGGCCAACGTGGAAAACAACGAGGGCAGGCGGAGCAAGAGAGGCGCCAGAAGGCTGAAGCGGCGGAGGCGGCATAGAATCCAGAGAGTGAAGAAGCTGCTGTTCGACTACAACCTGCTGACCGACCACAGCGAGCTGAGCGGCATCAACCCCTACGAGGCCAGAGTGAAGGGCCTGAGCCAGAAGCTGAGCGAGGAAGAGTTCTCTGCCGCCCTGCTGCACCTGGCCAAGAGAAGAGGCGTGCACAACGTGAACGAGGTGGAAGAGGACACCGGCAACGAGCTGTCCACCAAAGAGCAGATCAGCCGGAACAGCAAGGCCCTGGAAGAGAAATACGTGGCCGAACTGCAGCTGGAACGGCTGAAGAAAGACGGCGAAGTGCGGGGCAGCATCAACAGATTCAAGACCAGCGACTACGTGAAAGAAGCCAAACAGCTGCTGAAGGTGCAGAAGGCCTACCACCAGCTGGACCAGAGCTTCATCGACACCTACATCGACCTGCTGGAAACCCGGCGGACCTACTATGAGGGACCTGGCGAGGGCAGCCCCTTCGGCTGGAAGGACATCAAAGAATGGTACGAGATGCTGATGGGCCACTGCACCTACTTCCCCGAGGAACTGCGGAGCGTGAAGTACGCCTACAACGCCGACCTGTACAACGCCCTGAACGACCTGAACAATCTCGTGATCACCAGGGACGAGAACGAGAAGCTGGAATATTACGAGAAGTTCCAGATCATCGAGAACGTGTTCAAGCAGAAGAAGAAGCCCACCCTGAAGCAGATCGCCAAAGAAATCCTCGTGAACGAAGAGGATATTAAGGGCTACAGAGTGACCAGCACCGGCAAGCCCGAGTTCACCAACCTGAAGGTGTACCACGACATCAAGGACATTACCGCCCGGAAAGAGATTATTGAGAACGCCGAGCTGCTGGATCAGATTGCCAAGATCCTGACCATCTACCAGAGCAGCGAGGACATCCAGGAAGAACTGACCAATCTGAACTCCGAGCTGACCCAGGAAGAGATCGAGCAGATCTCTAATCTGAAGGGCTATACCGGCACCCACAACCTGAGCCTGAAGGCCATCAACCTGATCCTGGACGAGCTGTGGCACACCAACGACAACCAGATCGCTATCTTCAACCGGCTGAAGCTGGTGCCCAAGAAGGTGGACCTGTCCCAGCAGAAAGAGATCCCCACCACCCTGGTGGACGACTTCATCCTGAGCCCCGTCGTGAAGAGAAGCTTCATCCAGAGCATCAAAGTGATCAACGCCATCATCAAGAAGTACGGCCTGCCCAACGACATCATTATCGAGCTGGCCCGCGAGAAGAACTCCAAGGACGCCCAGAAAATGATCAACGAGATGCAGAAGCGGAACCGGCAGACCAACGAGCGGATCGAGGAAATCATCCGGACCACCGGCAAAGAGAACGCCAAGTACCTGATCGAGAAGATCAAGCTGCACGACATGCAGGAAGGCAAGTGCCTGTACAGCCTGGAAGCCATCCCTCTGGAAGATCTGCTGAACAACCCCTTCAACTATGAGGTGGACCACATCATCCCCAGAAGCGTGTCCTTCGACAACAGCTTCAACAACAAGGTGCTCGTGAAGCAGGAAGAAGCCAGCAAGAAGGGCAACCGGACCCCATTCCAGTACCTGAGCAGCAGCGACAGCAAGATCAGCTACGAAACCTTCAAGAAGCACATCCTGAATCTGGCCAAGGGCAAGGGCAGAATCAGCAAGACCAAGAAAGAGTATCTGCTGGAAGAACGGGACATCAACAGGTTCTCCGTGCAGAAAGACTTCATCAACCGGAACCTGGTGGATACCAGATACGCCACCAGAGGCCTGATGAACCTGCTGCGGAGCTACTTCAGAGTGAACAACCTGGACGTGAAAGTGAAGTCCATCAATGGCGGCTTCACCAGCTTTCTGCGGCGGAAGTGGAAGTTTAAGAAAGAGCGGAACAAGGGGTACAAGCACCACGCCGAGGACGCCCTGATCATTGCCAACGCCGATTTCATCTTCAAAGAGTGGAAGAAACTGGACAAGGCCAAAAAAGTGATGGAAAACCAGATGTTCGAGGAAAAGCAGGCCGAGAGCATGCCCGAGATCGAAACCGAGCAGGAGTACAAAGAGATCTTCATCACCCCCCACCAGATCAAGCACATTAAGGACTTCAAGGACTACAAGTACAGCCACCGGGTGGACAAGAAGCCTAATAGAAAGCTGATTAACGACACCCTGTACTCCACCCGGAAGGACGACAAGGGCAACACCCTGATCGTGAACAATCTGAACGGCCTGTACGACAAGGACAATGACAAGCTGAAAAAGCTGATCAACAAGAGCCCCGAAAAGCTGCTGATGTACCACCACGACCCCCAGACCTACCAGAAACTGAAGCTGATTATGGAACAGTACGGCGACGAGAAGAATCCCCTGTACAAGTACTACGAGGAAACCGGGAACTACCTGACCAAGTACTCCAAAAAGGACAACGGCCCCGTGATCAAGAAGATTAAGTATTACGGCAACAAACTGAACGCCCATCTGGACATCACCGACGACTACCCCAACAGCAGAAACAAGGTCGTGAAGCTGTCCCTGAAGCCCTACAGATTCGACGTGTACCTGGACAATGGCGTGTACAAGTTCGTGACCGTGAAGAATCTGGATGTGATCAAAAAAGAAAACTACTACGAAGTGAATAGCAAGTGCTATGAGGAAGCTAAGAAGCTGAAGAAGATCAGCAACCAGGCCGAGTTTATCGCCTCCTTCTACAAGAACGATCTGATCAAGATCAACGGCGAGCTGTATAGAGTGATCGGCGTGAACCACGACGAGGGCAACAGGATCGAGATGTGAGACGGGCCATACTCGTCTCGAACATGATCGACATCACCTACCGCGAGTACCTGGAAAACATGAACGACAAGAGGCCCCCCCACATCATTAAGACAATCGCCTCCAAGACCCAGAGCATTAAGAAGTACAGCACAGACATTCTGGGCAACCTGTATGAAGTGAAATCTAAGAAGCACCCTCAGATCATCAAAAAGGGC(SEQ ID NO:89)。
ATGAAGCGGAACTACATCCTGGGCCTGGCCATCGGCATCACCAGCGTGGGCTACGGCATCATCGACTACGAGACACGGGACGTGATCGATGCCGGCGTGCGGCTGTTCAAAGAGGCCAACGTGGAAAACAACGAGGGCAGGCGGAGCAAGAGAGGCGCCAGAAGGCTGAAGCGGCGGAGGCGGCATAGAATCCAGAGAGTGAAGAAGCTGCTGTTCGACTACAACCTGCTGACCGACCACAGCGAGCTGAGCGGCATCAACCCCTACGAGGCCAGAGTGAAGGGCCTGAGCCAGAAGCTGAGCGAGGAAGAGTTCTCTGCCGCCCTGCTGCACCTGGCCAAGAGAAGAGGCGTGCACAACGTGAACGAGGTGGAAGAGGACACCGGCAACGAGCTGTCCACCAAAGAGCAGATCAGCCGGAACAGCAAGGCCCTGGAAGAGAAATACGTGGCCGAACTGCAGCTGGAACGGCTGAAGAAAGACGGCGAAGTGCGGGGCAGCATCAACAGATTCAAGACCAGCGACTACGTGAAAGAAGCCAAACAGCTGCTGAAGGTGCAGAAGGCCTACCACCAGCTGGACCAGAGCTTCATCGACACCTACATCGACCTGCTGGAAACCCGGCGGACCTACTATGAGGGACCTGGCGAGGGCAGCCCCTTCGGCTGGAAGGACATCAAAGAATGGTACGAGATGCTGATGGGCCACTGCACCTACTTCCCCGAGGAACTGCGGAGCGTGAAGTACGCCTACAACGCCGACCTGTACAACGCCCTGAACGACCTGAACAATCTCGTGATCACCAGGGACGAGAACGAGAAGCTGGAATATTACGAGAAGTTCCAGATCATCGAGAACGTGTTCAAGCAGAAGAAGAAGCCCACCCTGAAGCAGATCGCCAAAGAAATCCTCGTGAACGAAGAGGATATTAAGGGCTACAGAGTGACCAGCACCGGCAAGCCCGAGTTCACCAACCTGAAGGTGTACCACGACATCAAGGACATTACCGCCCGGAAAGAGATTATTGAGAACGCCGAGCTGCTGGATCAGATTGCCAAGATCCTGACCATCTACCAGAGCAGCGAGGACATCCAGGAAGAACTGACCAATCTGAACTCCGAGCTGACCCAGGAAGAGATCGAGCAGATCTCTAATCTGAAGGGCTATACCGGCACCCACAACCTGAGCCTGAAGGCCATCAACCTGATCCTGGACGAGCTGTGGCACACCAACGACAACCAGATCGCTATCTTCAACCGGCTGAAGCTGGTGCCCAAGAAGGTGGACCTGTCCCAGCAGAAAGAGATCCCCACCACCCTGGTGGACGACTTCATCCTGAGCCCCGTCGTGAAGAGAAGCTTCATCCAGAGCATCAAAGTGATCAACGCCATCATCAAGAAGTACGGCCTGCCCAACGACATCATTATCGAGCTGGCCCGCGAGAAGAACTCCAAGGACGCCCAGAAAATGATCAACGAGATGCAGAAGCGGAACCGGCAGACCAACGAGCGGATCGAGGAAATCATCCGGACCACCGGCAAAGAGAACGCCAAGTACCTGATCGAGAAGATCAAGCTGCACGACATGCAGGAAGGCAAGTGCCTGTACAGCCTGGAAGCCATCCCTCTGGAAGATCTGCTGAACAACCCCTTCAACTATGAGGTGGACCACATCATCCCCAGAAGCGTGTCCTTCGACAACAGCTTCAACAACAAGGTGCTCGTGAAGCAGGAAGAAGCCAGCAAGAAGGGCAACCGGACCCCATTCCAGTACCTGAGCAGCAGCGACAGCAAGATCAGCTACGAAACCTTCAAGAAGCACATCCTGAATCTGGCCAAGGGCAAGGGCAGAATCAGCAAGACCAAGAAAGAGTATCTGCTGGAAGAACGGGACATCAACAGGTTCTCCGTGCAGAAAGACTTCATCAACCGGAACCTGGTGGATACCAGATACGCCACCAGAGGCCTGATGAACCTGCTGCGGAGCTACTTCAGAGTGAACAACCTGGACGTGAAAGTGAAGTCCATCAATGGCGGCTTCACCAGCTTTCTGCGGCGGAAGTGGAAGTTTAAGAAAGAGCGGAACAAGGGGTACAAGCACCACGCCGAGGACGCCCTGATCATTGCCAACGCCGATTTCATCTTCAAAGAGTGGAAGAAACTGGACAAGGCCAAAAAAGTGATGGAAAACCAGATGTTCGAGGAAAAGCAGGCCGAGAGCATGCCCGAGATCGAAACCGAGCAGGAGTACAAAGAGATCTTCATCACCCCCCACCAGATCAAGCACATTAAGGACTTCAAGGACTACAAGTACAGCCACCGGGTGGACAAGAAGCCTAATAGAAAGCTGATTAACGACACCCTGTACTCCACCCGGAAGGACGACAAGGGCAACACCCTGATCGTGAACAATCTGAACGGCCTGTACGACAAGGACAATGACAAGCTGAAAAAGCTGATCAACAAGAGCCCCGAAAAGCTGCTGATGTACCACCACGACCCCCAGACCTACCAGAAACTGAAGCTGATTATGGAACAGTACGGCGACGAGAAGAATCCCCTGTACAAGTACTACGAGGAAACCGGGAACTACCTGACCAAGTACTCCAAAAAGGACAACGGCCCCGTGATCAAGAAGATTAAGTATTACGGCAACAAACTGAACGCCCATCTGGACATCACCGACGACTACCCCAACAGCAGAAACAAGGTCGTGAAGCTGTCCCTGAAGCCCTACAGATTCGACGTGTACCTGGACAATGGCGTGTACAAGTTCGTGACCGTGAAGAATCTGGATGTGATCAAAAAAGAAAACTACTACGAAGTGAATAGCAAGTGCTATGAGGAAGCTAAGAAGCTGAAGAAGATCAGCAACCAGGCCGAGTTTATCGCCTCCTTCTACAAGAACGATCTGATCAAGATCAACGGCGAGCTGTATAGAGTGATCGGCGTGAACAGCGACAAGAACAACCTGATCGAGGTGTGAGACGGGCCATACTCGTCTCGAACATGATCGACATCACCTACCGCGAGTACCTGGAAAACATGAACGACAAGAGGCCCCCCCACATCATTAAGACAATCGCCTCCAAGACCCAGAGCATTAAGAAGTACAGCACAGACATTCTGGGCAACCTGTATGAAGTGAAATCTAAGAAGCACCCTCAGATCATCAAAAAGGGC(SEQ ID NO:90)。
ATGAAGCGGAACTACATCCTGGGCCTGGCCATCGGCATCACCAGCGTGGGCTACGGCATCATCGACTACGAGACACGGGACGTGATCGATGCCGGCGTGCGGCTGTTCAAAGAGGCCAACGTGGAAAACAACGAGGGCAGGCGGAGCAAGAGAGGCGCCAGAAGGCTGAAGCGGCGGAGGCGGCATAGAATCCAGAGAGTGAAGAAGCTGCTGTTCGACTACAACCTGCTGACCGACCACAGCGAGCTGAGCGGCATCAACCCCTACGAGGCCAGAGTGAAGGGCCTGAGCCAGAAGCTGAGCGAGGAAGAGTTCTCTGCCGCCCTGCTGCACCTGGCCAAGAGAAGAGGCGTGCACAACGTGAACGAGGTGGAAGAGGACACCGGCAACGAGCTGTCCACCAAAGAGCAGATCAGCCGGAACAGCAAGGCCCTGGAAGAGAAATACGTGGCCGAACTGCAGCTGGAACGGCTGAAGAAAGACGGCGAAGTGCGGGGCAGCATCAACAGATTCAAGACCAGCGACTACGTGAAAGAAGCCAAACAGCTGCTGAAGGTGCAGAAGGCCTACCACCAGCTGGACCAGAGCTTCATCGACACCTACATCGACCTGCTGGAAACCCGGCGGACCTACTATGAGGGACCTGGCGAGGGCAGCCCCTTCGGCTGGAAGGACATCAAAGAATGGTACGAGATGCTGATGGGCCACTGCACCTACTTCCCCGAGGAACTGCGGAGCGTGAAGTACGCCTACAACGCCGACCTGTACAACGCCCTGAACGACCTGAACAATCTCGTGATCACCAGGGACGAGAACGAGAAGCTGGAATATTACGAGAAGTTCCAGATCATCGAGAACGTGTTCAAGCAGAAGAAGAAGCCCACCCTGAAGCAGATCGCCAAAGAAATCCTCGTGAACGAAGAGGATATTAAGGGCTACAGAGTGACCAGCACCGGCAAGCCCGAGTTCACCAACCTGAAGGTGTACCACGACATCAAGGACATTACCGCCCGGAAAGAGATTATTGAGAACGCCGAGCTGCTGGATCAGATTGCCAAGATCCTGACCATCTACCAGAGCAGCGAGGACATCCAGGAAGAACTGACCAATCTGAACTCCGAGCTGACCCAGGAAGAGATCGAGCAGATCTCTAATCTGAAGGGCTATACCGGCACCCACAACCTGAGCCTGAAGGCCATCAACCTGATCCTGGACGAGCTGTGGCACACCAACGACAACCAGATCGCTATCTTCAACCGGCTGAAGCTGGTGCCCAAGAAGGTGGACCTGTCCCAGCAGAAAGAGATCCCCACCACCCTGGTGGACGACTTCATCCTGAGCCCCGTCGTGAAGAGAAGCTTCATCCAGAGCATCAAAGTGATCAACGCCATCATCAAGAAGTACGGCCTGCCCAACGACATCATTATCGAGCTGGCCCGCGAGAAGAACTCCAAGGACGCCCAGAAAATGATCAACGAGATGCAGAAGCGGAACCGGCAGACCAACGAGCGGATCGAGGAAATCATCCGGACCACCGGCAAAGAGAACGCCAAGTACCTGATCGAGAAGATCAAGCTGCACGACATGCAGGAAGGCAAGTGCCTGTACAGCCTGGAAGCCATCCCTCTGGAAGATCTGCTGAACAACCCCTTCAACTATGAGGTGGACCACATCATCCCCAGAAGCGTGTCCTTCGACAACAGCTTCAACAACAAGGTGCTCGTGAAGCAGGAAGAAGCCAGCAAGAAGGGCAACCGGACCCCATTCCAGTACCTGAGCAGCAGCGACAGCAAGATCAGCTACGAAACCTTCAAGAAGCACATCCTGAATCTGGCCAAGGGCAAGGGCAGAATCAGCAAGACCAAGAAAGAGTATCTGCTGGAAGAACGGGACATCAACAGGTTCTCCGTGCAGAAAGACTTCATCAACCGGAACCTGGTGGATACCAGATACGCCACCAGAGGCCTGATGAACCTGCTGCGGAGCTACTTCAGAGTGAACAACCTGGACGTGAAAGTGAAGTCCATCAATGGCGGCTTCACCAGCTTTCTGCGGCGGAAGTGGAAGTTTAAGAAAGAGCGGAACAAGGGGTACAAGCACCACGCCGAGGACGCCCTGATCATTGCCAACGCCGATTTCATCTTCAAAGAGTGGAAGAAACTGGACAAGGCCAAAAAAGTGATGGAAAACCAGATGTTCGAGGAAAAGCAGGCCGAGAGCATGCCCGAGATCGAAACCGAGCAGGAGTACAAAGAGATCTTCATCACCCCCCACCAGATCAAGCACATTAAGGACTTCAAGGACTACAAGTACAGCCACCGGGTGGACAAGAAGCCTAATAGAAAGCTGATTAACGACACCCTGTACTCCACCCGGAAGGACGACAAGGGCAACACCCTGATCGTGAACAATCTGAACGGCCTGTACGACAAGGACAATGACAAGCTGAAAAAGCTGATCAACAAGAGCCCCGAAAAGCTGCTGATGTACCACCACGACCCCCAGACCTACCAGAAACTGAAGCTGATTATGGAACAGTACGGCGACGAGAAGAATCCCCTGTACAAGTACTACGAGGAAACCGGGAACTACCTGACCAAGTACTCCAAAAAGGACAACGGCCCCGTGATCAAGAAGATTAAGTATTACGGCAACAAACTGAACGCCCATCTGGACATCACCGACGACTACCCCAACAGCAGAAACAAGGTCGTGAAGCTGTCCCTGAAGCCCTACAGATTCGACGTGTACCTGGACAATGGCGTGTACAAGTTCGTGACCGTGAAGAATCTGGATGTGATCAAAAAAGAAAACTACTACGAAGTGAATAGCAAGTGCTATGAGGAAGCTAAGAAGCTGAAGAAGATCAGCAACCAGGCCGAGTTTATCGCCTCCTTCTACAAGAACGATCTGATCAAGATCAACGGCGAGCTGTATAGAGTGATCGGCGTGAACAACAGCACCAGGAACATCGTGGAGCTGTGAGACGGGCCATACTCGTCTCGAACATGATCGACATCACCTACCGCGAGTACCTGGAAAACATGAACGACAAGAGGCCCCCCCACATCATTAAGACAATCGCCTCCAAGACCCAGAGCATTAAGAAGTACAGCACAGACATTCTGGGCAACCTGTATGAAGTGAAATCTAAGAAGCACCCTCAGATCATCAAAAAGGGC(SEQ ID NO:91)。
ATGAAGCGGAACTACATCCTGGGCCTGGCCATCGGCATCACCAGCGTGGGCTACGGCATCATCGACTACGAGACACGGGACGTGATCGATGCCGGCGTGCGGCTGTTCAAAGAGGCCAACGTGGAAAACAACGAGGGCAGGCGGAGCAAGAGAGGCGCCAGAAGGCTGAAGCGGCGGAGGCGGCATAGAATCCAGAGAGTGAAGAAGCTGCTGTTCGACTACAACCTGCTGACCGACCACAGCGAGCTGAGCGGCATCAACCCCTACGAGGCCAGAGTGAAGGGCCTGAGCCAGAAGCTGAGCGAGGAAGAGTTCTCTGCCGCCCTGCTGCACCTGGCCAAGAGAAGAGGCGTGCACAACGTGAACGAGGTGGAAGAGGACACCGGCAACGAGCTGTCCACCAAAGAGCAGATCAGCCGGAACAGCAAGGCCCTGGAAGAGAAATACGTGGCCGAACTGCAGCTGGAACGGCTGAAGAAAGACGGCGAAGTGCGGGGCAGCATCAACAGATTCAAGACCAGCGACTACGTGAAAGAAGCCAAACAGCTGCTGAAGGTGCAGAAGGCCTACCACCAGCTGGACCAGAGCTTCATCGACACCTACATCGACCTGCTGGAAACCCGGCGGACCTACTATGAGGGACCTGGCGAGGGCAGCCCCTTCGGCTGGAAGGACATCAAAGAATGGTACGAGATGCTGATGGGCCACTGCACCTACTTCCCCGAGGAACTGCGGAGCGTGAAGTACGCCTACAACGCCGACCTGTACAACGCCCTGAACGACCTGAACAATCTCGTGATCACCAGGGACGAGAACGAGAAGCTGGAATATTACGAGAAGTTCCAGATCATCGAGAACGTGTTCAAGCAGAAGAAGAAGCCCACCCTGAAGCAGATCGCCAAAGAAATCCTCGTGAACGAAGAGGATATTAAGGGCTACAGAGTGACCAGCACCGGCAAGCCCGAGTTCACCAACCTGAAGGTGTACCACGACATCAAGGACATTACCGCCCGGAAAGAGATTATTGAGAACGCCGAGCTGCTGGATCAGATTGCCAAGATCCTGACCATCTACCAGAGCAGCGAGGACATCCAGGAAGAACTGACCAATCTGAACTCCGAGCTGACCCAGGAAGAGATCGAGCAGATCTCTAATCTGAAGGGCTATACCGGCACCCACAACCTGAGCCTGAAGGCCATCAACCTGATCCTGGACGAGCTGTGGCACACCAACGACAACCAGATCGCTATCTTCAACCGGCTGAAGCTGGTGCCCAAGAAGGTGGACCTGTCCCAGCAGAAAGAGATCCCCACCACCCTGGTGGACGACTTCATCCTGAGCCCCGTCGTGAAGAGAAGCTTCATCCAGAGCATCAAAGTGATCAACGCCATCATCAAGAAGTACGGCCTGCCCAACGACATCATTATCGAGCTGGCCCGCGAGAAGAACTCCAAGGACGCCCAGAAAATGATCAACGAGATGCAGAAGCGGAACCGGCAGACCAACGAGCGGATCGAGGAAATCATCCGGACCACCGGCAAAGAGAACGCCAAGTACCTGATCGAGAAGATCAAGCTGCACGACATGCAGGAAGGCAAGTGCCTGTACAGCCTGGAAGCCATCCCTCTGGAAGATCTGCTGAACAACCCCTTCAACTATGAGGTGGACCACATCATCCCCAGAAGCGTGTCCTTCGACAACAGCTTCAACAACAAGGTGCTCGTGAAGCAGGAAGAAGCCAGCAAGAAGGGCAACCGGACCCCATTCCAGTACCTGAGCAGCAGCGACAGCAAGATCAGCTACGAAACCTTCAAGAAGCACATCCTGAATCTGGCCAAGGGCAAGGGCAGAATCAGCAAGACCAAGAAAGAGTATCTGCTGGAAGAACGGGACATCAACAGGTTCTCCGTGCAGAAAGACTTCATCAACCGGAACCTGGTGGATACCAGATACGCCACCAGAGGCCTGATGAACCTGCTGCGGAGCTACTTCAGAGTGAACAACCTGGACGTGAAAGTGAAGTCCATCAATGGCGGCTTCACCAGCTTTCTGCGGCGGAAGTGGAAGTTTAAGAAAGAGCGGAACAAGGGGTACAAGCACCACGCCGAGGACGCCCTGATCATTGCCAACGCCGATTTCATCTTCAAAGAGTGGAAGAAACTGGACAAGGCCAAAAAAGTGATGGAAAACCAGATGTTCGAGGAAAAGCAGGCCGAGAGCATGCCCGAGATCGAAACCGAGCAGGAGTACAAAGAGATCTTCATCACCCCCCACCAGATCAAGCACATTAAGGACTTCAAGGACTACAAGTACAGCCACCGGGTGGACAAGAAGCCTAATAGAAAGCTGATTAACGACACCCTGTACTCCACCCGGAAGGACGACAAGGGCAACACCCTGATCGTGAACAATCTGAACGGCCTGTACGACAAGGACAATGACAAGCTGAAAAAGCTGATCAACAAGAGCCCCGAAAAGCTGCTGATGTACCACCACGACCCCCAGACCTACCAGAAACTGAAGCTGATTATGGAACAGTACGGCGACGAGAAGAATCCCCTGTACAAGTACTACGAGGAAACCGGGAACTACCTGACCAAGTACTCCAAAAAGGACAACGGCCCCGTGATCAAGAAGATTAAGTATTACGGCAACAAACTGAACGCCCATCTGGACATCACCGACGACTACCCCAACAGCAGAAACAAGGTCGTGAAGCTGTCCCTGAAGCCCTACAGATTCGACGTGTACCTGGACAATGGCGTGTACAAGTTCGTGACCGTGAAGAATCTGGATGTGATCAAAAAAGAAAACTACTACGAAGTGAATAGCAAGTGCTATGAGGAAGCTAAGAAGCTGAAGAAGATCAGCAACCAGGCCGAGTTTATCGCCTCCTTCTACAAGAACGATCTGATCAAGATCAACGGCGAGCTGTATAGAGTGAGGGGCGACAACAACCCCAGGCAGAACAAGCTGGAGGTGTGAGACGGGCCATACTCGTCTCGAACATGATCGACATCACCTACCGCGAGTACCTGGAAAACATGAACGACAAGAGGCCCCCCCACATCATTAAGACAATCGCCTCCAAGACCCAGAGCATTAAGAAGTACAGCACAGACATTCTGGGCAACCTGTATGAAGTGAAATCTAAGAAGCACCCTCAGATCATCAAAAAGGGC(SEQ ID NO:92)。
ATGAAGCGGAACTACATCCTGGGCCTGGCCATCGGCATCACCAGCGTGGGCTACGGCATCATCGACTACGAGACACGGGACGTGATCGATGCCGGCGTGCGGCTGTTCAAAGAGGCCAACGTGGAAAACAACGAGGGCAGGCGGAGCAAGAGAGGCGCCAGAAGGCTGAAGCGGCGGAGGCGGCATAGAATCCAGAGAGTGAAGAAGCTGCTGTTCGACTACAACCTGCTGACCGACCACAGCGAGCTGAGCGGCATCAACCCCTACGAGGCCAGAGTGAAGGGCCTGAGCCAGAAGCTGAGCGAGGAAGAGTTCTCTGCCGCCCTGCTGCACCTGGCCAAGAGAAGAGGCGTGCACAACGTGAACGAGGTGGAAGAGGACACCGGCAACGAGCTGTCCACCAAAGAGCAGATCAGCCGGAACAGCAAGGCCCTGGAAGAGAAATACGTGGCCGAACTGCAGCTGGAACGGCTGAAGAAAGACGGCGAAGTGCGGGGCAGCATCAACAGATTCAAGACCAGCGACTACGTGAAAGAAGCCAAACAGCTGCTGAAGGTGCAGAAGGCCTACCACCAGCTGGACCAGAGCTTCATCGACACCTACATCGACCTGCTGGAAACCCGGCGGACCTACTATGAGGGACCTGGCGAGGGCAGCCCCTTCGGCTGGAAGGACATCAAAGAATGGTACGAGATGCTGATGGGCCACTGCACCTACTTCCCCGAGGAACTGCGGAGCGTGAAGTACGCCTACAACGCCGACCTGTACAACGCCCTGAACGACCTGAACAATCTCGTGATCACCAGGGACGAGAACGAGAAGCTGGAATATTACGAGAAGTTCCAGATCATCGAGAACGTGTTCAAGCAGAAGAAGAAGCCCACCCTGAAGCAGATCGCCAAAGAAATCCTCGTGAACGAAGAGGATATTAAGGGCTACAGAGTGACCAGCACCGGCAAGCCCGAGTTCACCAACCTGAAGGTGTACCACGACATCAAGGACATTACCGCCCGGAAAGAGATTATTGAGAACGCCGAGCTGCTGGATCAGATTGCCAAGATCCTGACCATCTACCAGAGCAGCGAGGACATCCAGGAAGAACTGACCAATCTGAACTCCGAGCTGACCCAGGAAGAGATCGAGCAGATCTCTAATCTGAAGGGCTATACCGGCACCCACAACCTGAGCCTGAAGGCCATCAACCTGATCCTGGACGAGCTGTGGCACACCAACGACAACCAGATCGCTATCTTCAACCGGCTGAAGCTGGTGCCCAAGAAGGTGGACCTGTCCCAGCAGAAAGAGATCCCCACCACCCTGGTGGACGACTTCATCCTGAGCCCCGTCGTGAAGAGAAGCTTCATCCAGAGCATCAAAGTGATCAACGCCATCATCAAGAAGTACGGCCTGCCCAACGACATCATTATCGAGCTGGCCCGCGAGAAGAACTCCAAGGACGCCCAGAAAATGATCAACGAGATGCAGAAGCGGAACCGGCAGACCAACGAGCGGATCGAGGAAATCATCCGGACCACCGGCAAAGAGAACGCCAAGTACCTGATCGAGAAGATCAAGCTGCACGACATGCAGGAAGGCAAGTGCCTGTACAGCCTGGAAGCCATCCCTCTGGAAGATCTGCTGAACAACCCCTTCAACTATGAGGTGGACCACATCATCCCCAGAAGCGTGTCCTTCGACAACAGCTTCAACAACAAGGTGCTCGTGAAGCAGGAAGAAGCCAGCAAGAAGGGCAACCGGACCCCATTCCAGTACCTGAGCAGCAGCGACAGCAAGATCAGCTACGAAACCTTCAAGAAGCACATCCTGAATCTGGCCAAGGGCAAGGGCAGAATCAGCAAGACCAAGAAAGAGTATCTGCTGGAAGAACGGGACATCAACAGGTTCTCCGTGCAGAAAGACTTCATCAACCGGAACCTGGTGGATACCAGATACGCCACCAGAGGCCTGATGAACCTGCTGCGGAGCTACTTCAGAGTGAACAACCTGGACGTGAAAGTGAAGTCCATCAATGGCGGCTTCACCAGCTTTCTGCGGCGGAAGTGGAAGTTTAAGAAAGAGCGGAACAAGGGGTACAAGCACCACGCCGAGGACGCCCTGATCATTGCCAACGCCGATTTCATCTTCAAAGAGTGGAAGAAACTGGACAAGGCCAAAAAAGTGATGGAAAACCAGATGTTCGAGGAAAAGCAGGCCGAGAGCATGCCCGAGATCGAAACCGAGCAGGAGTACAAAGAGATCTTCATCACCCCCCACCAGATCAAGCACATTAAGGACTTCAAGGACTACAAGTACAGCCACCGGGTGGACAAGAAGCCTAATAGAAAGCTGATTAACGACACCCTGTACTCCACCCGGAAGGACGACAAGGGCAACACCCTGATCGTGAACAATCTGAACGGCCTGTACGACAAGGACAATGACAAGCTGAAAAAGCTGATCAACAAGAGCCCCGAAAAGCTGCTGATGTACCACCACGACCCCCAGACCTACCAGAAACTGAAGCTGATTATGGAACAGTACGGCGACGAGAAGAATCCCCTGTACAAGTACTACGAGGAAACCGGGAACTACCTGACCAAGTACTCCAAAAAGGACAACGGCCCCGTGATCAAGAAGATTAAGTATTACGGCAACAAACTGAACGCCCATCTGGACATCACCGACGACTACCCCAACAGCAGAAACAAGGTCGTGAAGCTGTCCCTGAAGCCCTACAGATTCGACGTGTACCTGGACAATGGCGTGTACAAGTTCGTGACCGTGAAGAATCTGGATGTGATCAAAAAAGAAAACTACTACGAAGTGAATAGCAAGTGCTATGAGGAAGCTAAGAAGCTGAAGAAGATCAGCAACCAGGCCGAGTTTATCGCCTCCTTCTACAAGAACGATCTGATCAAGATCAACGGCGAGCTGTATAGAGTGATCGGCGTGAACAACGACAAGAACAACGTGATCGAGCTGTGAGACGGGCCATACTCGTCTCGAACATGATCGACATCACCTACCGCGAGTACCTGGAAAACATGAACGACAAGAGGCCCCCCCACATCATTAAGACAATCGCCTCCAAGACCCAGAGCATTAAGAAGTACAGCACAGACATTCTGGGCAACCTGTATGAAGTGAAATCTAAGAAGCACCCTCAGATCATCAAAAAGGGC(SEQ ID NO:93)。
ATGAAGCGGAACTACATCCTGGGCCTGGCCATCGGCATCACCAGCGTGGGCTACGGCATCATCGACTACGAGACACGGGACGTGATCGATGCCGGCGTGCGGCTGTTCAAAGAGGCCAACGTGGAAAACAACGAGGGCAGGCGGAGCAAGAGAGGCGCCAGAAGGCTGAAGCGGCGGAGGCGGCATAGAATCCAGAGAGTGAAGAAGCTGCTGTTCGACTACAACCTGCTGACCGACCACAGCGAGCTGAGCGGCATCAACCCCTACGAGGCCAGAGTGAAGGGCCTGAGCCAGAAGCTGAGCGAGGAAGAGTTCTCTGCCGCCCTGCTGCACCTGGCCAAGAGAAGAGGCGTGCACAACGTGAACGAGGTGGAAGAGGACACCGGCAACGAGCTGTCCACCAAAGAGCAGATCAGCCGGAACAGCAAGGCCCTGGAAGAGAAATACGTGGCCGAACTGCAGCTGGAACGGCTGAAGAAAGACGGCGAAGTGCGGGGCAGCATCAACAGATTCAAGACCAGCGACTACGTGAAAGAAGCCAAACAGCTGCTGAAGGTGCAGAAGGCCTACCACCAGCTGGACCAGAGCTTCATCGACACCTACATCGACCTGCTGGAAACCCGGCGGACCTACTATGAGGGACCTGGCGAGGGCAGCCCCTTCGGCTGGAAGGACATCAAAGAATGGTACGAGATGCTGATGGGCCACTGCACCTACTTCCCCGAGGAACTGCGGAGCGTGAAGTACGCCTACAACGCCGACCTGTACAACGCCCTGAACGACCTGAACAATCTCGTGATCACCAGGGACGAGAACGAGAAGCTGGAATATTACGAGAAGTTCCAGATCATCGAGAACGTGTTCAAGCAGAAGAAGAAGCCCACCCTGAAGCAGATCGCCAAAGAAATCCTCGTGAACGAAGAGGATATTAAGGGCTACAGAGTGACCAGCACCGGCAAGCCCGAGTTCACCAACCTGAAGGTGTACCACGACATCAAGGACATTACCGCCCGGAAAGAGATTATTGAGAACGCCGAGCTGCTGGATCAGATTGCCAAGATCCTGACCATCTACCAGAGCAGCGAGGACATCCAGGAAGAACTGACCAATCTGAACTCCGAGCTGACCCAGGAAGAGATCGAGCAGATCTCTAATCTGAAGGGCTATACCGGCACCCACAACCTGAGCCTGAAGGCCATCAACCTGATCCTGGACGAGCTGTGGCACACCAACGACAACCAGATCGCTATCTTCAACCGGCTGAAGCTGGTGCCCAAGAAGGTGGACCTGTCCCAGCAGAAAGAGATCCCCACCACCCTGGTGGACGACTTCATCCTGAGCCCCGTCGTGAAGAGAAGCTTCATCCAGAGCATCAAAGTGATCAACGCCATCATCAAGAAGTACGGCCTGCCCAACGACATCATTATCGAGCTGGCCCGCGAGAAGAACTCCAAGGACGCCCAGAAAATGATCAACGAGATGCAGAAGCGGAACCGGCAGACCAACGAGCGGATCGAGGAAATCATCCGGACCACCGGCAAAGAGAACGCCAAGTACCTGATCGAGAAGATCAAGCTGCACGACATGCAGGAAGGCAAGTGCCTGTACAGCCTGGAAGCCATCCCTCTGGAAGATCTGCTGAACAACCCCTTCAACTATGAGGTGGACCACATCATCCCCAGAAGCGTGTCCTTCGACAACAGCTTCAACAACAAGGTGCTCGTGAAGCAGGAAGAAGCCAGCAAGAAGGGCAACCGGACCCCATTCCAGTACCTGAGCAGCAGCGACAGCAAGATCAGCTACGAAACCTTCAAGAAGCACATCCTGAATCTGGCCAAGGGCAAGGGCAGAATCAGCAAGACCAAGAAAGAGTATCTGCTGGAAGAACGGGACATCAACAGGTTCTCCGTGCAGAAAGACTTCATCAACCGGAACCTGGTGGATACCAGATACGCCACCAGAGGCCTGATGAACCTGCTGCGGAGCTACTTCAGAGTGAACAACCTGGACGTGAAAGTGAAGTCCATCAATGGCGGCTTCACCAGCTTTCTGCGGCGGAAGTGGAAGTTTAAGAAAGAGCGGAACAAGGGGTACAAGCACCACGCCGAGGACGCCCTGATCATTGCCAACGCCGATTTCATCTTCAAAGAGTGGAAGAAACTGGACAAGGCCAAAAAAGTGATGGAAAACCAGATGTTCGAGGAAAAGCAGGCCGAGAGCATGCCCGAGATCGAAACCGAGCAGGAGTACAAAGAGATCTTCATCACCCCCCACCAGATCAAGCACATTAAGGACTTCAAGGACTACAAGTACAGCCACCGGGTGGACAAGAAGCCTAATAGAAAGCTGATTAACGACACCCTGTACTCCACCCGGAAGGACGACAAGGGCAACACCCTGATCGTGAACAATCTGAACGGCCTGTACGACAAGGACAATGACAAGCTGAAAAAGCTGATCAACAAGAGCCCCGAAAAGCTGCTGATGTACCACCACGACCCCCAGACCTACCAGAAACTGAAGCTGATTATGGAACAGTACGGCGACGAGAAGAATCCCCTGTACAAGTACTACGAGGAAACCGGGAACTACCTGACCAAGTACTCCAAAAAGGACAACGGCCCCGTGATCAAGAAGATTAAGTATTACGGCAACAAACTGAACGCCCATCTGGACATCACCGACGACTACCCCAACAGCAGAAACAAGGTCGTGAAGCTGTCCCTGAAGCCCTACAGATTCGACGTGTACCTGGACAATGGCGTGTACAAGTTCGTGACCGTGAAGAATCTGGATGTGATCAAAAAAGAAAACTACTACGAAGTGAATAGCAAGTGCTATGAGGAAGCTAAGAAGCTGAAGAAGATCAGCAACCAGGCCGAGTTTATCGCCTCCTTCTACAAGAACGATCTGATCAAGATCAACGGCGAGCTGTATAGAGTGATCGGCATCAACGACAACAAGCACAACAAGATCGAGCTGTGAGACGGGCCATACTCGTCTCGAACATGATCGACATCACCTACCGCGAGTACCTGGAAAACATGAACGACAAGAGGCCCCCCCACATCATTAAGACAATCGCCTCCAAGACCCAGAGCATTAAGAAGTACAGCACAGACATTCTGGGCAACCTGTATGAAGTGAAATCTAAGAAGCACCCTCAGATCATCAAAAAGGGC(SEQ ID NO:94)。
ATGAAGCGGAACTACATCCTGGGCCTGGCCATCGGCATCACCAGCGTGGGCTACGGCATCATCGACTACGAGACACGGGACGTGATCGATGCCGGCGTGCGGCTGTTCAAAGAGGCCAACGTGGAAAACAACGAGGGCAGGCGGAGCAAGAGAGGCGCCAGAAGGCTGAAGCGGCGGAGGCGGCATAGAATCCAGAGAGTGAAGAAGCTGCTGTTCGACTACAACCTGCTGACCGACCACAGCGAGCTGAGCGGCATCAACCCCTACGAGGCCAGAGTGAAGGGCCTGAGCCAGAAGCTGAGCGAGGAAGAGTTCTCTGCCGCCCTGCTGCACCTGGCCAAGAGAAGAGGCGTGCACAACGTGAACGAGGTGGAAGAGGACACCGGCAACGAGCTGTCCACCAAAGAGCAGATCAGCCGGAACAGCAAGGCCCTGGAAGAGAAATACGTGGCCGAACTGCAGCTGGAACGGCTGAAGAAAGACGGCGAAGTGCGGGGCAGCATCAACAGATTCAAGACCAGCGACTACGTGAAAGAAGCCAAACAGCTGCTGAAGGTGCAGAAGGCCTACCACCAGCTGGACCAGAGCTTCATCGACACCTACATCGACCTGCTGGAAACCCGGCGGACCTACTATGAGGGACCTGGCGAGGGCAGCCCCTTCGGCTGGAAGGACATCAAAGAATGGTACGAGATGCTGATGGGCCACTGCACCTACTTCCCCGAGGAACTGCGGAGCGTGAAGTACGCCTACAACGCCGACCTGTACAACGCCCTGAACGACCTGAACAATCTCGTGATCACCAGGGACGAGAACGAGAAGCTGGAATATTACGAGAAGTTCCAGATCATCGAGAACGTGTTCAAGCAGAAGAAGAAGCCCACCCTGAAGCAGATCGCCAAAGAAATCCTCGTGAACGAAGAGGATATTAAGGGCTACAGAGTGACCAGCACCGGCAAGCCCGAGTTCACCAACCTGAAGGTGTACCACGACATCAAGGACATTACCGCCCGGAAAGAGATTATTGAGAACGCCGAGCTGCTGGATCAGATTGCCAAGATCCTGACCATCTACCAGAGCAGCGAGGACATCCAGGAAGAACTGACCAATCTGAACTCCGAGCTGACCCAGGAAGAGATCGAGCAGATCTCTAATCTGAAGGGCTATACCGGCACCCACAACCTGAGCCTGAAGGCCATCAACCTGATCCTGGACGAGCTGTGGCACACCAACGACAACCAGATCGCTATCTTCAACCGGCTGAAGCTGGTGCCCAAGAAGGTGGACCTGTCCCAGCAGAAAGAGATCCCCACCACCCTGGTGGACGACTTCATCCTGAGCCCCGTCGTGAAGAGAAGCTTCATCCAGAGCATCAAAGTGATCAACGCCATCATCAAGAAGTACGGCCTGCCCAACGACATCATTATCGAGCTGGCCCGCGAGAAGAACTCCAAGGACGCCCAGAAAATGATCAACGAGATGCAGAAGCGGAACCGGCAGACCAACGAGCGGATCGAGGAAATCATCCGGACCACCGGCAAAGAGAACGCCAAGTACCTGATCGAGAAGATCAAGCTGCACGACATGCAGGAAGGCAAGTGCCTGTACAGCCTGGAAGCCATCCCTCTGGAAGATCTGCTGAACAACCCCTTCAACTATGAGGTGGACCACATCATCCCCAGAAGCGTGTCCTTCGACAACAGCTTCAACAACAAGGTGCTCGTGAAGCAGGAAGAAGCCAGCAAGAAGGGCAACCGGACCCCATTCCAGTACCTGAGCAGCAGCGACAGCAAGATCAGCTACGAAACCTTCAAGAAGCACATCCTGAATCTGGCCAAGGGCAAGGGCAGAATCAGCAAGACCAAGAAAGAGTATCTGCTGGAAGAACGGGACATCAACAGGTTCTCCGTGCAGAAAGACTTCATCAACCGGAACCTGGTGGATACCAGATACGCCACCAGAGGCCTGATGAACCTGCTGCGGAGCTACTTCAGAGTGAACAACCTGGACGTGAAAGTGAAGTCCATCAATGGCGGCTTCACCAGCTTTCTGCGGCGGAAGTGGAAGTTTAAGAAAGAGCGGAACAAGGGGTACAAGCACCACGCCGAGGACGCCCTGATCATTGCCAACGCCGATTTCATCTTCAAAGAGTGGAAGAAACTGGACAAGGCCAAAAAAGTGATGGAAAACCAGATGTTCGAGGAAAAGCAGGCCGAGAGCATGCCCGAGATCGAAACCGAGCAGGAGTACAAAGAGATCTTCATCACCCCCCACCAGATCAAGCACATTAAGGACTTCAAGGACTACAAGTACAGCCACCGGGTGGACAAGAAGCCTAATAGAAAGCTGATTAACGACACCCTGTACTCCACCCGGAAGGACGACAAGGGCAACACCCTGATCGTGAACAATCTGAACGGCCTGTACGACAAGGACAATGACAAGCTGAAAAAGCTGATCAACAAGAGCCCCGAAAAGCTGCTGATGTACCACCACGACCCCCAGACCTACCAGAAACTGAAGCTGATTATGGAACAGTACGGCGACGAGAAGAATCCCCTGTACAAGTACTACGAGGAAACCGGGAACTACCTGACCAAGTACTCCAAAAAGGACAACGGCCCCGTGATCAAGAAGATTAAGTATTACGGCAACAAACTGAACGCCCATCTGGACATCACCGACGACTACCCCAACAGCAGAAACAAGGTCGTGAAGCTGTCCCTGAAGCCCTACAGATTCGACGTGTACCTGGACAATGGCGTGTACAAGTTCGTGACCGTGAAGAATCTGGATGTGATCAAAAAAGAAAACTACTACGAAGTGAATAGCAAGTGCTATGAGGAAGCTAAGAAGCTGAAGAAGATCAGCAACCAGGCCGAGTTTATCGCCTCCTTCTACAAGAACGATCTGATCAAGATCAACGGCGAGCTGTATAGAGTGATCGGCGTGAACAGCGACGACAGGAACATCATCGAGCTGTGAGACGGGCCATACTCGTCTCGAACATGATCGACATCACCTACCGCGAGTACCTGGAAAACATGAACGACAAGAGGCCCCCCCACATCATTAAGACAATCGCCTCCAAGACCCAGAGCATTAAGAAGTACAGCACAGACATTCTGGGCAACCTGTATGAAGTGAAATCTAAGAAGCACCCTCAGATCATCAAAAAGGGC(SEQ ID NO:95)。
ATGAAGCGGAACTACATCCTGGGCCTGGCCATCGGCATCACCAGCGTGGGCTACGGCATCATCGACTACGAGACACGGGACGTGATCGATGCCGGCGTGCGGCTGTTCAAAGAGGCCAACGTGGAAAACAACGAGGGCAGGCGGAGCAAGAGAGGCGCCAGAAGGCTGAAGCGGCGGAGGCGGCATAGAATCCAGAGAGTGAAGAAGCTGCTGTTCGACTACAACCTGCTGACCGACCACAGCGAGCTGAGCGGCATCAACCCCTACGAGGCCAGAGTGAAGGGCCTGAGCCAGAAGCTGAGCGAGGAAGAGTTCTCTGCCGCCCTGCTGCACCTGGCCAAGAGAAGAGGCGTGCACAACGTGAACGAGGTGGAAGAGGACACCGGCAACGAGCTGTCCACCAAAGAGCAGATCAGCCGGAACAGCAAGGCCCTGGAAGAGAAATACGTGGCCGAACTGCAGCTGGAACGGCTGAAGAAAGACGGCGAAGTGCGGGGCAGCATCAACAGATTCAAGACCAGCGACTACGTGAAAGAAGCCAAACAGCTGCTGAAGGTGCAGAAGGCCTACCACCAGCTGGACCAGAGCTTCATCGACACCTACATCGACCTGCTGGAAACCCGGCGGACCTACTATGAGGGACCTGGCGAGGGCAGCCCCTTCGGCTGGAAGGACATCAAAGAATGGTACGAGATGCTGATGGGCCACTGCACCTACTTCCCCGAGGAACTGCGGAGCGTGAAGTACGCCTACAACGCCGACCTGTACAACGCCCTGAACGACCTGAACAATCTCGTGATCACCAGGGACGAGAACGAGAAGCTGGAATATTACGAGAAGTTCCAGATCATCGAGAACGTGTTCAAGCAGAAGAAGAAGCCCACCCTGAAGCAGATCGCCAAAGAAATCCTCGTGAACGAAGAGGATATTAAGGGCTACAGAGTGACCAGCACCGGCAAGCCCGAGTTCACCAACCTGAAGGTGTACCACGACATCAAGGACATTACCGCCCGGAAAGAGATTATTGAGAACGCCGAGCTGCTGGATCAGATTGCCAAGATCCTGACCATCTACCAGAGCAGCGAGGACATCCAGGAAGAACTGACCAATCTGAACTCCGAGCTGACCCAGGAAGAGATCGAGCAGATCTCTAATCTGAAGGGCTATACCGGCACCCACAACCTGAGCCTGAAGGCCATCAACCTGATCCTGGACGAGCTGTGGCACACCAACGACAACCAGATCGCTATCTTCAACCGGCTGAAGCTGGTGCCCAAGAAGGTGGACCTGTCCCAGCAGAAAGAGATCCCCACCACCCTGGTGGACGACTTCATCCTGAGCCCCGTCGTGAAGAGAAGCTTCATCCAGAGCATCAAAGTGATCAACGCCATCATCAAGAAGTACGGCCTGCCCAACGACATCATTATCGAGCTGGCCCGCGAGAAGAACTCCAAGGACGCCCAGAAAATGATCAACGAGATGCAGAAGCGGAACCGGCAGACCAACGAGCGGATCGAGGAAATCATCCGGACCACCGGCAAAGAGAACGCCAAGTACCTGATCGAGAAGATCAAGCTGCACGACATGCAGGAAGGCAAGTGCCTGTACAGCCTGGAAGCCATCCCTCTGGAAGATCTGCTGAACAACCCCTTCAACTATGAGGTGGACCACATCATCCCCAGAAGCGTGTCCTTCGACAACAGCTTCAACAACAAGGTGCTCGTGAAGCAGGAAGAAGCCAGCAAGAAGGGCAACCGGACCCCATTCCAGTACCTGAGCAGCAGCGACAGCAAGATCAGCTACGAAACCTTCAAGAAGCACATCCTGAATCTGGCCAAGGGCAAGGGCAGAATCAGCAAGACCAAGAAAGAGTATCTGCTGGAAGAACGGGACATCAACAGGTTCTCCGTGCAGAAAGACTTCATCAACCGGAACCTGGTGGATACCAGATACGCCACCAGAGGCCTGATGAACCTGCTGCGGAGCTACTTCAGAGTGAACAACCTGGACGTGAAAGTGAAGTCCATCAATGGCGGCTTCACCAGCTTTCTGCGGCGGAAGTGGAAGTTTAAGAAAGAGCGGAACAAGGGGTACAAGCACCACGCCGAGGACGCCCTGATCATTGCCAACGCCGATTTCATCTTCAAAGAGTGGAAGAAACTGGACAAGGCCAAAAAAGTGATGGAAAACCAGATGTTCGAGGAAAAGCAGGCCGAGAGCATGCCCGAGATCGAAACCGAGCAGGAGTACAAAGAGATCTTCATCACCCCCCACCAGATCAAGCACATTAAGGACTTCAAGGACTACAAGTACAGCCACCGGGTGGACAAGAAGCCTAATAGAAAGCTGATTAACGACACCCTGTACTCCACCCGGAAGGACGACAAGGGCAACACCCTGATCGTGAACAATCTGAACGGCCTGTACGACAAGGACAATGACAAGCTGAAAAAGCTGATCAACAAGAGCCCCGAAAAGCTGCTGATGTACCACCACGACCCCCAGACCTACCAGAAACTGAAGCTGATTATGGAACAGTACGGCGACGAGAAGAATCCCCTGTACAAGTACTACGAGGAAACCGGGAACTACCTGACCAAGTACTCCAAAAAGGACAACGGCCCCGTGATCAAGAAGATTAAGTATTACGGCAACAAACTGAACGCCCATCTGGACATCACCGACGACTACCCCAACAGCAGAAACAAGGTCGTGAAGCTGTCCCTGAAGCCCTACAGATTCGACGTGTACCTGGACAATGGCGTGTACAAGTTCGTGACCGTGAAGAATCTGGATGTGATCAAAAAAGAAAACTACTACGAAGTGAATAGCAAGTGCTATGAGGAAGCTAAGAAGCTGAAGAAGATCAGCAACCAGGCCGAGTTTATCGCCTCCTTCTACAAGAACGATCTGATCAAGATCAACGGCGAGCTGTATAGAGTGATCGGCGTGAACGACAGCGAGAAGAACAAGATCCAGCTGTGAGACGGGCCATACTCGTCTCGAACATGATCGACATCACCTACCGCGAGTACCTGGAAAACATGAACGACAAGAGGCCCCCCCACATCATTAAGACAATCGCCTCCAAGACCCAGAGCATTAAGAAGTACAGCACAGACATTCTGGGCAACCTGTATGAAGTGAAATCTAAGAAGCACCCTCAGATCATCAAAAAGGGC(SEQ ID NO:96)。
ATGAAGCGGAACTACATCCTGGGCCTGGCCATCGGCATCACCAGCGTGGGCTACGGCATCATCGACTACGAGACACGGGACGTGATCGATGCCGGCGTGCGGCTGTTCAAAGAGGCCAACGTGGAAAACAACGAGGGCAGGCGGAGCAAGAGAGGCGCCAGAAGGCTGAAGCGGCGGAGGCGGCATAGAATCCAGAGAGTGAAGAAGCTGCTGTTCGACTACAACCTGCTGACCGACCACAGCGAGCTGAGCGGCATCAACCCCTACGAGGCCAGAGTGAAGGGCCTGAGCCAGAAGCTGAGCGAGGAAGAGTTCTCTGCCGCCCTGCTGCACCTGGCCAAGAGAAGAGGCGTGCACAACGTGAACGAGGTGGAAGAGGACACCGGCAACGAGCTGTCCACCAAAGAGCAGATCAGCCGGAACAGCAAGGCCCTGGAAGAGAAATACGTGGCCGAACTGCAGCTGGAACGGCTGAAGAAAGACGGCGAAGTGCGGGGCAGCATCAACAGATTCAAGACCAGCGACTACGTGAAAGAAGCCAAACAGCTGCTGAAGGTGCAGAAGGCCTACCACCAGCTGGACCAGAGCTTCATCGACACCTACATCGACCTGCTGGAAACCCGGCGGACCTACTATGAGGGACCTGGCGAGGGCAGCCCCTTCGGCTGGAAGGACATCAAAGAATGGTACGAGATGCTGATGGGCCACTGCACCTACTTCCCCGAGGAACTGCGGAGCGTGAAGTACGCCTACAACGCCGACCTGTACAACGCCCTGAACGACCTGAACAATCTCGTGATCACCAGGGACGAGAACGAGAAGCTGGAATATTACGAGAAGTTCCAGATCATCGAGAACGTGTTCAAGCAGAAGAAGAAGCCCACCCTGAAGCAGATCGCCAAAGAAATCCTCGTGAACGAAGAGGATATTAAGGGCTACAGAGTGACCAGCACCGGCAAGCCCGAGTTCACCAACCTGAAGGTGTACCACGACATCAAGGACATTACCGCCCGGAAAGAGATTATTGAGAACGCCGAGCTGCTGGATCAGATTGCCAAGATCCTGACCATCTACCAGAGCAGCGAGGACATCCAGGAAGAACTGACCAATCTGAACTCCGAGCTGACCCAGGAAGAGATCGAGCAGATCTCTAATCTGAAGGGCTATACCGGCACCCACAACCTGAGCCTGAAGGCCATCAACCTGATCCTGGACGAGCTGTGGCACACCAACGACAACCAGATCGCTATCTTCAACCGGCTGAAGCTGGTGCCCAAGAAGGTGGACCTGTCCCAGCAGAAAGAGATCCCCACCACCCTGGTGGACGACTTCATCCTGAGCCCCGTCGTGAAGAGAAGCTTCATCCAGAGCATCAAAGTGATCAACGCCATCATCAAGAAGTACGGCCTGCCCAACGACATCATTATCGAGCTGGCCCGCGAGAAGAACTCCAAGGACGCCCAGAAAATGATCAACGAGATGCAGAAGCGGAACCGGCAGACCAACGAGCGGATCGAGGAAATCATCCGGACCACCGGCAAAGAGAACGCCAAGTACCTGATCGAGAAGATCAAGCTGCACGACATGCAGGAAGGCAAGTGCCTGTACAGCCTGGAAGCCATCCCTCTGGAAGATCTGCTGAACAACCCCTTCAACTATGAGGTGGACCACATCATCCCCAGAAGCGTGTCCTTCGACAACAGCTTCAACAACAAGGTGCTCGTGAAGCAGGAAGAAGCCAGCAAGAAGGGCAACCGGACCCCATTCCAGTACCTGAGCAGCAGCGACAGCAAGATCAGCTACGAAACCTTCAAGAAGCACATCCTGAATCTGGCCAAGGGCAAGGGCAGAATCAGCAAGACCAAGAAAGAGTATCTGCTGGAAGAACGGGACATCAACAGGTTCTCCGTGCAGAAAGACTTCATCAACCGGAACCTGGTGGATACCAGATACGCCACCAGAGGCCTGATGAACCTGCTGCGGAGCTACTTCAGAGTGAACAACCTGGACGTGAAAGTGAAGTCCATCAATGGCGGCTTCACCAGCTTTCTGCGGCGGAAGTGGAAGTTTAAGAAAGAGCGGAACAAGGGGTACAAGCACCACGCCGAGGACGCCCTGATCATTGCCAACGCCGATTTCATCTTCAAAGAGTGGAAGAAACTGGACAAGGCCAAAAAAGTGATGGAAAACCAGATGTTCGAGGAAAAGCAGGCCGAGAGCATGCCCGAGATCGAAACCGAGCAGGAGTACAAAGAGATCTTCATCACCCCCCACCAGATCAAGCACATTAAGGACTTCAAGGACTACAAGTACAGCCACCGGGTGGACAAGAAGCCTAATAGAAAGCTGATTAACGACACCCTGTACTCCACCCGGAAGGACGACAAGGGCAACACCCTGATCGTGAACAATCTGAACGGCCTGTACGACAAGGACAATGACAAGCTGAAAAAGCTGATCAACAAGAGCCCCGAAAAGCTGCTGATGTACCACCACGACCCCCAGACCTACCAGAAACTGAAGCTGATTATGGAACAGTACGGCGACGAGAAGAATCCCCTGTACAAGTACTACGAGGAAACCGGGAACTACCTGACCAAGTACTCCAAAAAGGACAACGGCCCCGTGATCAAGAAGATTAAGTATTACGGCAACAAACTGAACGCCCATCTGGACATCACCGACGACTACCCCAACAGCAGAAACAAGGTCGTGAAGCTGTCCCTGAAGCCCTACAGATTCGACGTGTACCTGGACAATGGCGTGTACAAGTTCGTGACCGTGAAGAATCTGGATGTGATCAAAAAAGAAAACTACTACGAAGTGAATAGCAAGTGCTATGAGGAAGCTAAGAAGCTGAAGAAGATCAGCAACCAGGCCGAGTTTATCGCCTCCTTCTACAAGAACGATCTGATCAAGATCAACGGCGAGCTGTATAGAGTGAAGTGCATCAACAACGAGAAGACCCACAGGATCGAGATCTGAGACGGGCCATACTCGTCTCGAACATGATCGACATCACCTACCGCGAGTACCTGGAAAACATGAACGACAAGAGGCCCCCCCACATCATTAAGACAATCGCCTCCAAGACCCAGAGCATTAAGAAGTACAGCACAGACATTCTGGGCAACCTGTATGAAGTGAAATCTAAGAAGCACCCTCAGATCATCAAAAAGGGC(SEQ ID NO:97)。
ATGAAGCGGAACTACATCCTGGGCCTGGCCATCGGCATCACCAGCGTGGGCTACGGCATCATCGACTACGAGACACGGGACGTGATCGATGCCGGCGTGCGGCTGTTCAAAGAGGCCAACGTGGAAAACAACGAGGGCAGGCGGAGCAAGAGAGGCGCCAGAAGGCTGAAGCGGCGGAGGCGGCATAGAATCCAGAGAGTGAAGAAGCTGCTGTTCGACTACAACCTGCTGACCGACCACAGCGAGCTGAGCGGCATCAACCCCTACGAGGCCAGAGTGAAGGGCCTGAGCCAGAAGCTGAGCGAGGAAGAGTTCTCTGCCGCCCTGCTGCACCTGGCCAAGAGAAGAGGCGTGCACAACGTGAACGAGGTGGAAGAGGACACCGGCAACGAGCTGTCCACCAAAGAGCAGATCAGCCGGAACAGCAAGGCCCTGGAAGAGAAATACGTGGCCGAACTGCAGCTGGAACGGCTGAAGAAAGACGGCGAAGTGCGGGGCAGCATCAACAGATTCAAGACCAGCGACTACGTGAAAGAAGCCAAACAGCTGCTGAAGGTGCAGAAGGCCTACCACCAGCTGGACCAGAGCTTCATCGACACCTACATCGACCTGCTGGAAACCCGGCGGACCTACTATGAGGGACCTGGCGAGGGCAGCCCCTTCGGCTGGAAGGACATCAAAGAATGGTACGAGATGCTGATGGGCCACTGCACCTACTTCCCCGAGGAACTGCGGAGCGTGAAGTACGCCTACAACGCCGACCTGTACAACGCCCTGAACGACCTGAACAATCTCGTGATCACCAGGGACGAGAACGAGAAGCTGGAATATTACGAGAAGTTCCAGATCATCGAGAACGTGTTCAAGCAGAAGAAGAAGCCCACCCTGAAGCAGATCGCCAAAGAAATCCTCGTGAACGAAGAGGATATTAAGGGCTACAGAGTGACCAGCACCGGCAAGCCCGAGTTCACCAACCTGAAGGTGTACCACGACATCAAGGACATTACCGCCCGGAAAGAGATTATTGAGAACGCCGAGCTGCTGGATCAGATTGCCAAGATCCTGACCATCTACCAGAGCAGCGAGGACATCCAGGAAGAACTGACCAATCTGAACTCCGAGCTGACCCAGGAAGAGATCGAGCAGATCTCTAATCTGAAGGGCTATACCGGCACCCACAACCTGAGCCTGAAGGCCATCAACCTGATCCTGGACGAGCTGTGGCACACCAACGACAACCAGATCGCTATCTTCAACCGGCTGAAGCTGGTGCCCAAGAAGGTGGACCTGTCCCAGCAGAAAGAGATCCCCACCACCCTGGTGGACGACTTCATCCTGAGCCCCGTCGTGAAGAGAAGCTTCATCCAGAGCATCAAAGTGATCAACGCCATCATCAAGAAGTACGGCCTGCCCAACGACATCATTATCGAGCTGGCCCGCGAGAAGAACTCCAAGGACGCCCAGAAAATGATCAACGAGATGCAGAAGCGGAACCGGCAGACCAACGAGCGGATCGAGGAAATCATCCGGACCACCGGCAAAGAGAACGCCAAGTACCTGATCGAGAAGATCAAGCTGCACGACATGCAGGAAGGCAAGTGCCTGTACAGCCTGGAAGCCATCCCTCTGGAAGATCTGCTGAACAACCCCTTCAACTATGAGGTGGACCACATCATCCCCAGAAGCGTGTCCTTCGACAACAGCTTCAACAACAAGGTGCTCGTGAAGCAGGAAGAAGCCAGCAAGAAGGGCAACCGGACCCCATTCCAGTACCTGAGCAGCAGCGACAGCAAGATCAGCTACGAAACCTTCAAGAAGCACATCCTGAATCTGGCCAAGGGCAAGGGCAGAATCAGCAAGACCAAGAAAGAGTATCTGCTGGAAGAACGGGACATCAACAGGTTCTCCGTGCAGAAAGACTTCATCAACCGGAACCTGGTGGATACCAGATACGCCACCAGAGGCCTGATGAACCTGCTGCGGAGCTACTTCAGAGTGAACAACCTGGACGTGAAAGTGAAGTCCATCAATGGCGGCTTCACCAGCTTTCTGCGGCGGAAGTGGAAGTTTAAGAAAGAGCGGAACAAGGGGTACAAGCACCACGCCGAGGACGCCCTGATCATTGCCAACGCCGATTTCATCTTCAAAGAGTGGAAGAAACTGGACAAGGCCAAAAAAGTGATGGAAAACCAGATGTTCGAGGAAAAGCAGGCCGAGAGCATGCCCGAGATCGAAACCGAGCAGGAGTACAAAGAGATCTTCATCACCCCCCACCAGATCAAGCACATTAAGGACTTCAAGGACTACAAGTACAGCCACCGGGTGGACAAGAAGCCTAATAGAAAGCTGATTAACGACACCCTGTACTCCACCCGGAAGGACGACAAGGGCAACACCCTGATCGTGAACAATCTGAACGGCCTGTACGACAAGGACAATGACAAGCTGAAAAAGCTGATCAACAAGAGCCCCGAAAAGCTGCTGATGTACCACCACGACCCCCAGACCTACCAGAAACTGAAGCTGATTATGGAACAGTACGGCGACGAGAAGAATCCCCTGTACAAGTACTACGAGGAAACCGGGAACTACCTGACCAAGTACTCCAAAAAGGACAACGGCCCCGTGATCAAGAAGATTAAGTATTACGGCAACAAACTGAACGCCCATCTGGACATCACCGACGACTACCCCAACAGCAGAAACAAGGTCGTGAAGCTGTCCCTGAAGCCCTACAGATTCGACGTGTACCTGGACAATGGCGTGTACAAGTTCGTGACCGTGAAGAATCTGGATGTGATCAAAAAAGAAAACTACTACGAAGTGAATAGCAAGTGCTATGAGGAAGCTAAGAAGCTGAAGAAGATCAGCAACCAGGCCGAGTTTATCGCCTCCTTCTACAAGAACGATCTGATCAAGATCAACGGCGAGCTGTATAGAGTGATCGGCGTGAACCACGACAAGACCAACAGGATCGAGTGCTGAGACGGGCCATACTCGTCTCGAACATGATCGACATCACCTACCGCGAGTACCTGGAAAACATGAACGACAAGAGGCCCCCCCACATCATTAAGACAATCGCCTCCAAGACCCAGAGCATTAAGAAGTACAGCACAGACATTCTGGGCAACCTGTATGAAGTGAAATCTAAGAAGCACCCTCAGATCATCAAAAAGGGC(SEQ ID NO:98)。
ATGAAGCGGAACTACATCCTGGGCCTGGCCATCGGCATCACCAGCGTGGGCTACGGCATCATCGACTACGAGACACGGGACGTGATCGATGCCGGCGTGCGGCTGTTCAAAGAGGCCAACGTGGAAAACAACGAGGGCAGGCGGAGCAAGAGAGGCGCCAGAAGGCTGAAGCGGCGGAGGCGGCATAGAATCCAGAGAGTGAAGAAGCTGCTGTTCGACTACAACCTGCTGACCGACCACAGCGAGCTGAGCGGCATCAACCCCTACGAGGCCAGAGTGAAGGGCCTGAGCCAGAAGCTGAGCGAGGAAGAGTTCTCTGCCGCCCTGCTGCACCTGGCCAAGAGAAGAGGCGTGCACAACGTGAACGAGGTGGAAGAGGACACCGGCAACGAGCTGTCCACCAAAGAGCAGATCAGCCGGAACAGCAAGGCCCTGGAAGAGAAATACGTGGCCGAACTGCAGCTGGAACGGCTGAAGAAAGACGGCGAAGTGCGGGGCAGCATCAACAGATTCAAGACCAGCGACTACGTGAAAGAAGCCAAACAGCTGCTGAAGGTGCAGAAGGCCTACCACCAGCTGGACCAGAGCTTCATCGACACCTACATCGACCTGCTGGAAACCCGGCGGACCTACTATGAGGGACCTGGCGAGGGCAGCCCCTTCGGCTGGAAGGACATCAAAGAATGGTACGAGATGCTGATGGGCCACTGCACCTACTTCCCCGAGGAACTGCGGAGCGTGAAGTACGCCTACAACGCCGACCTGTACAACGCCCTGAACGACCTGAACAATCTCGTGATCACCAGGGACGAGAACGAGAAGCTGGAATATTACGAGAAGTTCCAGATCATCGAGAACGTGTTCAAGCAGAAGAAGAAGCCCACCCTGAAGCAGATCGCCAAAGAAATCCTCGTGAACGAAGAGGATATTAAGGGCTACAGAGTGACCAGCACCGGCAAGCCCGAGTTCACCAACCTGAAGGTGTACCACGACATCAAGGACATTACCGCCCGGAAAGAGATTATTGAGAACGCCGAGCTGCTGGATCAGATTGCCAAGATCCTGACCATCTACCAGAGCAGCGAGGACATCCAGGAAGAACTGACCAATCTGAACTCCGAGCTGACCCAGGAAGAGATCGAGCAGATCTCTAATCTGAAGGGCTATACCGGCACCCACAACCTGAGCCTGAAGGCCATCAACCTGATCCTGGACGAGCTGTGGCACACCAACGACAACCAGATCGCTATCTTCAACCGGCTGAAGCTGGTGCCCAAGAAGGTGGACCTGTCCCAGCAGAAAGAGATCCCCACCACCCTGGTGGACGACTTCATCCTGAGCCCCGTCGTGAAGAGAAGCTTCATCCAGAGCATCAAAGTGATCAACGCCATCATCAAGAAGTACGGCCTGCCCAACGACATCATTATCGAGCTGGCCCGCGAGAAGAACTCCAAGGACGCCCAGAAAATGATCAACGAGATGCAGAAGCGGAACCGGCAGACCAACGAGCGGATCGAGGAAATCATCCGGACCACCGGCAAAGAGAACGCCAAGTACCTGATCGAGAAGATCAAGCTGCACGACATGCAGGAAGGCAAGTGCCTGTACAGCCTGGAAGCCATCCCTCTGGAAGATCTGCTGAACAACCCCTTCAACTATGAGGTGGACCACATCATCCCCAGAAGCGTGTCCTTCGACAACAGCTTCAACAACAAGGTGCTCGTGAAGCAGGAAGAAGCCAGCAAGAAGGGCAACCGGACCCCATTCCAGTACCTGAGCAGCAGCGACAGCAAGATCAGCTACGAAACCTTCAAGAAGCACATCCTGAATCTGGCCAAGGGCAAGGGCAGAATCAGCAAGACCAAGAAAGAGTATCTGCTGGAAGAACGGGACATCAACAGGTTCTCCGTGCAGAAAGACTTCATCAACCGGAACCTGGTGGATACCAGATACGCCACCAGAGGCCTGATGAACCTGCTGCGGAGCTACTTCAGAGTGAACAACCTGGACGTGAAAGTGAAGTCCATCAATGGCGGCTTCACCAGCTTTCTGCGGCGGAAGTGGAAGTTTAAGAAAGAGCGGAACAAGGGGTACAAGCACCACGCCGAGGACGCCCTGATCATTGCCAACGCCGATTTCATCTTCAAAGAGTGGAAGAAACTGGACAAGGCCAAAAAAGTGATGGAAAACCAGATGTTCGAGGAAAAGCAGGCCGAGAGCATGCCCGAGATCGAAACCGAGCAGGAGTACAAAGAGATCTTCATCACCCCCCACCAGATCAAGCACATTAAGGACTTCAAGGACTACAAGTACAGCCACCGGGTGGACAAGAAGCCTAATAGAAAGCTGATTAACGACACCCTGTACTCCACCCGGAAGGACGACAAGGGCAACACCCTGATCGTGAACAATCTGAACGGCCTGTACGACAAGGACAATGACAAGCTGAAAAAGCTGATCAACAAGAGCCCCGAAAAGCTGCTGATGTACCACCACGACCCCCAGACCTACCAGAAACTGAAGCTGATTATGGAACAGTACGGCGACGAGAAGAATCCCCTGTACAAGTACTACGAGGAAACCGGGAACTACCTGACCAAGTACTCCAAAAAGGACAACGGCCCCGTGATCAAGAAGATTAAGTATTACGGCAACAAACTGAACGCCCATCTGGACATCACCGACGACTACCCCAACAGCAGAAACAAGGTCGTGAAGCTGTCCCTGAAGCCCTACAGATTCGACGTGTACCTGGACAATGGCGTGTACAAGTTCGTGACCGTGAAGAATCTGGATGTGATCAAAAAAGAAAACTACTACGAAGTGAATAGCAAGTGCTATGAGGAAGCTAAGAAGCTGAAGAAGATCAGCAACCAGGCCGAGTTTATCGCCTCCTTCTACAAGAACGATCTGATCAAGATCAACGGCGAGCTGTATAGAGTGATCGGCGTGAACGAGGACAAGAGGAACATCATCGAGCTGTGAGACGGGCCATACTCGTCTCGAACATGATCGACATCACCTACCGCGAGTACCTGGAAAACATGAACGACAAGAGGCCCCCCCACATCATTAAGACAATCGCCTCCAAGACCCAGAGCATTAAGAAGTACAGCACAGACATTCTGGGCAACCTGTATGAAGTGAAATCTAAGAAGCACCCTCAGATCATCAAAAAGGGC(SEQ ID NO:99)。
ATGAAGCGGAACTACATCCTGGGCCTGGCCATCGGCATCACCAGCGTGGGCTACGGCATCATCGACTACGAGACACGGGACGTGATCGATGCCGGCGTGCGGCTGTTCAAAGAGGCCAACGTGGAAAACAACGAGGGCAGGCGGAGCAAGAGAGGCGCCAGAAGGCTGAAGCGGCGGAGGCGGCATAGAATCCAGAGAGTGAAGAAGCTGCTGTTCGACTACAACCTGCTGACCGACCACAGCGAGCTGAGCGGCATCAACCCCTACGAGGCCAGAGTGAAGGGCCTGAGCCAGAAGCTGAGCGAGGAAGAGTTCTCTGCCGCCCTGCTGCACCTGGCCAAGAGAAGAGGCGTGCACAACGTGAACGAGGTGGAAGAGGACACCGGCAACGAGCTGTCCACCAAAGAGCAGATCAGCCGGAACAGCAAGGCCCTGGAAGAGAAATACGTGGCCGAACTGCAGCTGGAACGGCTGAAGAAAGACGGCGAAGTGCGGGGCAGCATCAACAGATTCAAGACCAGCGACTACGTGAAAGAAGCCAAACAGCTGCTGAAGGTGCAGAAGGCCTACCACCAGCTGGACCAGAGCTTCATCGACACCTACATCGACCTGCTGGAAACCCGGCGGACCTACTATGAGGGACCTGGCGAGGGCAGCCCCTTCGGCTGGAAGGACATCAAAGAATGGTACGAGATGCTGATGGGCCACTGCACCTACTTCCCCGAGGAACTGCGGAGCGTGAAGTACGCCTACAACGCCGACCTGTACAACGCCCTGAACGACCTGAACAATCTCGTGATCACCAGGGACGAGAACGAGAAGCTGGAATATTACGAGAAGTTCCAGATCATCGAGAACGTGTTCAAGCAGAAGAAGAAGCCCACCCTGAAGCAGATCGCCAAAGAAATCCTCGTGAACGAAGAGGATATTAAGGGCTACAGAGTGACCAGCACCGGCAAGCCCGAGTTCACCAACCTGAAGGTGTACCACGACATCAAGGACATTACCGCCCGGAAAGAGATTATTGAGAACGCCGAGCTGCTGGATCAGATTGCCAAGATCCTGACCATCTACCAGAGCAGCGAGGACATCCAGGAAGAACTGACCAATCTGAACTCCGAGCTGACCCAGGAAGAGATCGAGCAGATCTCTAATCTGAAGGGCTATACCGGCACCCACAACCTGAGCCTGAAGGCCATCAACCTGATCCTGGACGAGCTGTGGCACACCAACGACAACCAGATCGCTATCTTCAACCGGCTGAAGCTGGTGCCCAAGAAGGTGGACCTGTCCCAGCAGAAAGAGATCCCCACCACCCTGGTGGACGACTTCATCCTGAGCCCCGTCGTGAAGAGAAGCTTCATCCAGAGCATCAAAGTGATCAACGCCATCATCAAGAAGTACGGCCTGCCCAACGACATCATTATCGAGCTGGCCCGCGAGAAGAACTCCAAGGACGCCCAGAAAATGATCAACGAGATGCAGAAGCGGAACCGGCAGACCAACGAGCGGATCGAGGAAATCATCCGGACCACCGGCAAAGAGAACGCCAAGTACCTGATCGAGAAGATCAAGCTGCACGACATGCAGGAAGGCAAGTGCCTGTACAGCCTGGAAGCCATCCCTCTGGAAGATCTGCTGAACAACCCCTTCAACTATGAGGTGGACCACATCATCCCCAGAAGCGTGTCCTTCGACAACAGCTTCAACAACAAGGTGCTCGTGAAGCAGGAAGAAGCCAGCAAGAAGGGCAACCGGACCCCATTCCAGTACCTGAGCAGCAGCGACAGCAAGATCAGCTACGAAACCTTCAAGAAGCACATCCTGAATCTGGCCAAGGGCAAGGGCAGAATCAGCAAGACCAAGAAAGAGTATCTGCTGGAAGAACGGGACATCAACAGGTTCTCCGTGCAGAAAGACTTCATCAACCGGAACCTGGTGGATACCAGATACGCCACCAGAGGCCTGATGAACCTGCTGCGGAGCTACTTCAGAGTGAACAACCTGGACGTGAAAGTGAAGTCCATCAATGGCGGCTTCACCAGCTTTCTGCGGCGGAAGTGGAAGTTTAAGAAAGAGCGGAACAAGGGGTACAAGCACCACGCCGAGGACGCCCTGATCATTGCCAACGCCGATTTCATCTTCAAAGAGTGGAAGAAACTGGACAAGGCCAAAAAAGTGATGGAAAACCAGATGTTCGAGGAAAAGCAGGCCGAGAGCATGCCCGAGATCGAAACCGAGCAGGAGTACAAAGAGATCTTCATCACCCCCCACCAGATCAAGCACATTAAGGACTTCAAGGACTACAAGTACAGCCACCGGGTGGACAAGAAGCCTAATAGAAAGCTGATTAACGACACCCTGTACTCCACCCGGAAGGACGACAAGGGCAACACCCTGATCGTGAACAATCTGAACGGCCTGTACGACAAGGACAATGACAAGCTGAAAAAGCTGATCAACAAGAGCCCCGAAAAGCTGCTGATGTACCACCACGACCCCCAGACCTACCAGAAACTGAAGCTGATTATGGAACAGTACGGCGACGAGAAGAATCCCCTGTACAAGTACTACGAGGAAACCGGGAACTACCTGACCAAGTACTCCAAAAAGGACAACGGCCCCGTGATCAAGAAGATTAAGTATTACGGCAACAAACTGAACGCCCATCTGGACATCACCGACGACTACCCCAACAGCAGAAACAAGGTCGTGAAGCTGTCCCTGAAGCCCTACAGATTCGACGTGTACCTGGACAATGGCGTGTACAAGTTCGTGACCGTGAAGAATCTGGATGTGATCAAAAAAGAAAACTACTACGAAGTGAATAGCAAGTGCTATGAGGAAGCTAAGAAGCTGAAGAAGATCAGCAACCAGGCCGAGTTTATCGCCTCCTTCTACAAGAACGATCTGATCAAGATCAACGGCGAGCTGTATAGAGTGAGGGGCGACAGCAGCCCCAGGGAGAACAGGTTCGAGGTGTGAGACGGGCCATACTCGTCTCGAACATGATCGACATCACCTACCGCGAGTACCTGGAAAACATGAACGACAAGAGGCCCCCCCACATCATTAAGACAATCGCCTCCAAGACCCAGAGCATTAAGAAGTACAGCACAGACATTCTGGGCAACCTGTATGAAGTGAAATCTAAGAAGCACCCTCAGATCATCAAAAAGGGC(SEQ ID NO:100)。
ATGAAGCGGAACTACATCCTGGGCCTGGCCATCGGCATCACCAGCGTGGGCTACGGCATCATCGACTACGAGACACGGGACGTGATCGATGCCGGCGTGCGGCTGTTCAAAGAGGCCAACGTGGAAAACAACGAGGGCAGGCGGAGCAAGAGAGGCGCCAGAAGGCTGAAGCGGCGGAGGCGGCATAGAATCCAGAGAGTGAAGAAGCTGCTGTTCGACTACAACCTGCTGACCGACCACAGCGAGCTGAGCGGCATCAACCCCTACGAGGCCAGAGTGAAGGGCCTGAGCCAGAAGCTGAGCGAGGAAGAGTTCTCTGCCGCCCTGCTGCACCTGGCCAAGAGAAGAGGCGTGCACAACGTGAACGAGGTGGAAGAGGACACCGGCAACGAGCTGTCCACCAAAGAGCAGATCAGCCGGAACAGCAAGGCCCTGGAAGAGAAATACGTGGCCGAACTGCAGCTGGAACGGCTGAAGAAAGACGGCGAAGTGCGGGGCAGCATCAACAGATTCAAGACCAGCGACTACGTGAAAGAAGCCAAACAGCTGCTGAAGGTGCAGAAGGCCTACCACCAGCTGGACCAGAGCTTCATCGACACCTACATCGACCTGCTGGAAACCCGGCGGACCTACTATGAGGGACCTGGCGAGGGCAGCCCCTTCGGCTGGAAGGACATCAAAGAATGGTACGAGATGCTGATGGGCCACTGCACCTACTTCCCCGAGGAACTGCGGAGCGTGAAGTACGCCTACAACGCCGACCTGTACAACGCCCTGAACGACCTGAACAATCTCGTGATCACCAGGGACGAGAACGAGAAGCTGGAATATTACGAGAAGTTCCAGATCATCGAGAACGTGTTCAAGCAGAAGAAGAAGCCCACCCTGAAGCAGATCGCCAAAGAAATCCTCGTGAACGAAGAGGATATTAAGGGCTACAGAGTGACCAGCACCGGCAAGCCCGAGTTCACCAACCTGAAGGTGTACCACGACATCAAGGACATTACCGCCCGGAAAGAGATTATTGAGAACGCCGAGCTGCTGGATCAGATTGCCAAGATCCTGACCATCTACCAGAGCAGCGAGGACATCCAGGAAGAACTGACCAATCTGAACTCCGAGCTGACCCAGGAAGAGATCGAGCAGATCTCTAATCTGAAGGGCTATACCGGCACCCACAACCTGAGCCTGAAGGCCATCAACCTGATCCTGGACGAGCTGTGGCACACCAACGACAACCAGATCGCTATCTTCAACCGGCTGAAGCTGGTGCCCAAGAAGGTGGACCTGTCCCAGCAGAAAGAGATCCCCACCACCCTGGTGGACGACTTCATCCTGAGCCCCGTCGTGAAGAGAAGCTTCATCCAGAGCATCAAAGTGATCAACGCCATCATCAAGAAGTACGGCCTGCCCAACGACATCATTATCGAGCTGGCCCGCGAGAAGAACTCCAAGGACGCCCAGAAAATGATCAACGAGATGCAGAAGCGGAACCGGCAGACCAACGAGCGGATCGAGGAAATCATCCGGACCACCGGCAAAGAGAACGCCAAGTACCTGATCGAGAAGATCAAGCTGCACGACATGCAGGAAGGCAAGTGCCTGTACAGCCTGGAAGCCATCCCTCTGGAAGATCTGCTGAACAACCCCTTCAACTATGAGGTGGACCACATCATCCCCAGAAGCGTGTCCTTCGACAACAGCTTCAACAACAAGGTGCTCGTGAAGCAGGAAGAAGCCAGCAAGAAGGGCAACCGGACCCCATTCCAGTACCTGAGCAGCAGCGACAGCAAGATCAGCTACGAAACCTTCAAGAAGCACATCCTGAATCTGGCCAAGGGCAAGGGCAGAATCAGCAAGACCAAGAAAGAGTATCTGCTGGAAGAACGGGACATCAACAGGTTCTCCGTGCAGAAAGACTTCATCAACCGGAACCTGGTGGATACCAGATACGCCACCAGAGGCCTGATGAACCTGCTGCGGAGCTACTTCAGAGTGAACAACCTGGACGTGAAAGTGAAGTCCATCAATGGCGGCTTCACCAGCTTTCTGCGGCGGAAGTGGAAGTTTAAGAAAGAGCGGAACAAGGGGTACAAGCACCACGCCGAGGACGCCCTGATCATTGCCAACGCCGATTTCATCTTCAAAGAGTGGAAGAAACTGGACAAGGCCAAAAAAGTGATGGAAAACCAGATGTTCGAGGAAAAGCAGGCCGAGAGCATGCCCGAGATCGAAACCGAGCAGGAGTACAAAGAGATCTTCATCACCCCCCACCAGATCAAGCACATTAAGGACTTCAAGGACTACAAGTACAGCCACCGGGTGGACAAGAAGCCTAATAGAAAGCTGATTAACGACACCCTGTACTCCACCCGGAAGGACGACAAGGGCAACACCCTGATCGTGAACAATCTGAACGGCCTGTACGACAAGGACAATGACAAGCTGAAAAAGCTGATCAACAAGAGCCCCGAAAAGCTGCTGATGTACCACCACGACCCCCAGACCTACCAGAAACTGAAGCTGATTATGGAACAGTACGGCGACGAGAAGAATCCCCTGTACAAGTACTACGAGGAAACCGGGAACTACCTGACCAAGTACTCCAAAAAGGACAACGGCCCCGTGATCAAGAAGATTAAGTATTACGGCAACAAACTGAACGCCCATCTGGACATCACCGACGACTACCCCAACAGCAGAAACAAGGTCGTGAAGCTGTCCCTGAAGCCCTACAGATTCGACGTGTACCTGGACAATGGCGTGTACAAGTTCGTGACCGTGAAGAATCTGGATGTGATCAAAAAAGAAAACTACTACGAAGTGAATAGCAAGTGCTATGAGGAAGCTAAGAAGCTGAAGAAGATCAGCAACCAGGCCGAGTTTATCGCCTCCTTCTACAAGAACGATCTGATCAAGATCAACGGCGAGCTGTATAGAGTGAGGGGCGACAACGACCCCAAGGCCAACAAGATCGAGGTGTGAGACGGGCCATACTCGTCTCGAACATGATCGACATCACCTACCGCGAGTACCTGGAAAACATGAACGACAAGAGGCCCCCCCACATCATTAAGACAATCGCCTCCAAGACCCAGAGCATTAAGAAGTACAGCACAGACATTCTGGGCAACCTGTATGAAGTGAAATCTAAGAAGCACCCTCAGATCATCAAAAAGGGC(SEQ ID NO:101)。
ATGAAGCGGAACTACATCCTGGGCCTGGCCATCGGCATCACCAGCGTGGGCTACGGCATCATCGACTACGAGACACGGGACGTGATCGATGCCGGCGTGCGGCTGTTCAAAGAGGCCAACGTGGAAAACAACGAGGGCAGGCGGAGCAAGAGAGGCGCCAGAAGGCTGAAGCGGCGGAGGCGGCATAGAATCCAGAGAGTGAAGAAGCTGCTGTTCGACTACAACCTGCTGACCGACCACAGCGAGCTGAGCGGCATCAACCCCTACGAGGCCAGAGTGAAGGGCCTGAGCCAGAAGCTGAGCGAGGAAGAGTTCTCTGCCGCCCTGCTGCACCTGGCCAAGAGAAGAGGCGTGCACAACGTGAACGAGGTGGAAGAGGACACCGGCAACGAGCTGTCCACCAAAGAGCAGATCAGCCGGAACAGCAAGGCCCTGGAAGAGAAATACGTGGCCGAACTGCAGCTGGAACGGCTGAAGAAAGACGGCGAAGTGCGGGGCAGCATCAACAGATTCAAGACCAGCGACTACGTGAAAGAAGCCAAACAGCTGCTGAAGGTGCAGAAGGCCTACCACCAGCTGGACCAGAGCTTCATCGACACCTACATCGACCTGCTGGAAACCCGGCGGACCTACTATGAGGGACCTGGCGAGGGCAGCCCCTTCGGCTGGAAGGACATCAAAGAATGGTACGAGATGCTGATGGGCCACTGCACCTACTTCCCCGAGGAACTGCGGAGCGTGAAGTACGCCTACAACGCCGACCTGTACAACGCCCTGAACGACCTGAACAATCTCGTGATCACCAGGGACGAGAACGAGAAGCTGGAATATTACGAGAAGTTCCAGATCATCGAGAACGTGTTCAAGCAGAAGAAGAAGCCCACCCTGAAGCAGATCGCCAAAGAAATCCTCGTGAACGAAGAGGATATTAAGGGCTACAGAGTGACCAGCACCGGCAAGCCCGAGTTCACCAACCTGAAGGTGTACCACGACATCAAGGACATTACCGCCCGGAAAGAGATTATTGAGAACGCCGAGCTGCTGGATCAGATTGCCAAGATCCTGACCATCTACCAGAGCAGCGAGGACATCCAGGAAGAACTGACCAATCTGAACTCCGAGCTGACCCAGGAAGAGATCGAGCAGATCTCTAATCTGAAGGGCTATACCGGCACCCACAACCTGAGCCTGAAGGCCATCAACCTGATCCTGGACGAGCTGTGGCACACCAACGACAACCAGATCGCTATCTTCAACCGGCTGAAGCTGGTGCCCAAGAAGGTGGACCTGTCCCAGCAGAAAGAGATCCCCACCACCCTGGTGGACGACTTCATCCTGAGCCCCGTCGTGAAGAGAAGCTTCATCCAGAGCATCAAAGTGATCAACGCCATCATCAAGAAGTACGGCCTGCCCAACGACATCATTATCGAGCTGGCCCGCGAGAAGAACTCCAAGGACGCCCAGAAAATGATCAACGAGATGCAGAAGCGGAACCGGCAGACCAACGAGCGGATCGAGGAAATCATCCGGACCACCGGCAAAGAGAACGCCAAGTACCTGATCGAGAAGATCAAGCTGCACGACATGCAGGAAGGCAAGTGCCTGTACAGCCTGGAAGCCATCCCTCTGGAAGATCTGCTGAACAACCCCTTCAACTATGAGGTGGACCACATCATCCCCAGAAGCGTGTCCTTCGACAACAGCTTCAACAACAAGGTGCTCGTGAAGCAGGAAGAAGCCAGCAAGAAGGGCAACCGGACCCCATTCCAGTACCTGAGCAGCAGCGACAGCAAGATCAGCTACGAAACCTTCAAGAAGCACATCCTGAATCTGGCCAAGGGCAAGGGCAGAATCAGCAAGACCAAGAAAGAGTATCTGCTGGAAGAACGGGACATCAACAGGTTCTCCGTGCAGAAAGACTTCATCAACCGGAACCTGGTGGATACCAGATACGCCACCAGAGGCCTGATGAACCTGCTGCGGAGCTACTTCAGAGTGAACAACCTGGACGTGAAAGTGAAGTCCATCAATGGCGGCTTCACCAGCTTTCTGCGGCGGAAGTGGAAGTTTAAGAAAGAGCGGAACAAGGGGTACAAGCACCACGCCGAGGACGCCCTGATCATTGCCAACGCCGATTTCATCTTCAAAGAGTGGAAGAAACTGGACAAGGCCAAAAAAGTGATGGAAAACCAGATGTTCGAGGAAAAGCAGGCCGAGAGCATGCCCGAGATCGAAACCGAGCAGGAGTACAAAGAGATCTTCATCACCCCCCACCAGATCAAGCACATTAAGGACTTCAAGGACTACAAGTACAGCCACCGGGTGGACAAGAAGCCTAATAGAAAGCTGATTAACGACACCCTGTACTCCACCCGGAAGGACGACAAGGGCAACACCCTGATCGTGAACAATCTGAACGGCCTGTACGACAAGGACAATGACAAGCTGAAAAAGCTGATCAACAAGAGCCCCGAAAAGCTGCTGATGTACCACCACGACCCCCAGACCTACCAGAAACTGAAGCTGATTATGGAACAGTACGGCGACGAGAAGAATCCCCTGTACAAGTACTACGAGGAAACCGGGAACTACCTGACCAAGTACTCCAAAAAGGACAACGGCCCCGTGATCAAGAAGATTAAGTATTACGGCAACAAACTGAACGCCCATCTGGACATCACCGACGACTACCCCAACAGCAGAAACAAGGTCGTGAAGCTGTCCCTGAAGCCCTACAGATTCGACGTGTACCTGGACAATGGCGTGTACAAGTTCGTGACCGTGAAGAATCTGGATGTGATCAAAAAAGAAAACTACTACGAAGTGAATAGCAAGTGCTATGAGGAAGCTAAGAAGCTGAAGAAGATCAGCAACCAGGCCGAGTTTATCGCCTCCTTCTACAAGAACGATCTGATCAAGATCAACGGCGAGCTGTATAGAGTGATCGGCGTGAACGCCGAGAAGAGGAACACCATCGAGGTGTGAGACGGGCCATACTCGTCTCGAACATGATCGACATCACCTACCGCGAGTACCTGGAAAACATGAACGACAAGAGGCCCCCCCACATCATTAAGACAATCGCCTCCAAGACCCAGAGCATTAAGAAGTACAGCACAGACATTCTGGGCAACCTGTATGAAGTGAAATCTAAGAAGCACCCTCAGATCATCAAAAAGGGC(SEQ ID NO:102)。
ATGAAGCGGAACTACATCCTGGGCCTGGCCATCGGCATCACCAGCGTGGGCTACGGCATCATCGACTACGAGACACGGGACGTGATCGATGCCGGCGTGCGGCTGTTCAAAGAGGCCAACGTGGAAAACAACGAGGGCAGGCGGAGCAAGAGAGGCGCCAGAAGGCTGAAGCGGCGGAGGCGGCATAGAATCCAGAGAGTGAAGAAGCTGCTGTTCGACTACAACCTGCTGACCGACCACAGCGAGCTGAGCGGCATCAACCCCTACGAGGCCAGAGTGAAGGGCCTGAGCCAGAAGCTGAGCGAGGAAGAGTTCTCTGCCGCCCTGCTGCACCTGGCCAAGAGAAGAGGCGTGCACAACGTGAACGAGGTGGAAGAGGACACCGGCAACGAGCTGTCCACCAAAGAGCAGATCAGCCGGAACAGCAAGGCCCTGGAAGAGAAATACGTGGCCGAACTGCAGCTGGAACGGCTGAAGAAAGACGGCGAAGTGCGGGGCAGCATCAACAGATTCAAGACCAGCGACTACGTGAAAGAAGCCAAACAGCTGCTGAAGGTGCAGAAGGCCTACCACCAGCTGGACCAGAGCTTCATCGACACCTACATCGACCTGCTGGAAACCCGGCGGACCTACTATGAGGGACCTGGCGAGGGCAGCCCCTTCGGCTGGAAGGACATCAAAGAATGGTACGAGATGCTGATGGGCCACTGCACCTACTTCCCCGAGGAACTGCGGAGCGTGAAGTACGCCTACAACGCCGACCTGTACAACGCCCTGAACGACCTGAACAATCTCGTGATCACCAGGGACGAGAACGAGAAGCTGGAATATTACGAGAAGTTCCAGATCATCGAGAACGTGTTCAAGCAGAAGAAGAAGCCCACCCTGAAGCAGATCGCCAAAGAAATCCTCGTGAACGAAGAGGATATTAAGGGCTACAGAGTGACCAGCACCGGCAAGCCCGAGTTCACCAACCTGAAGGTGTACCACGACATCAAGGACATTACCGCCCGGAAAGAGATTATTGAGAACGCCGAGCTGCTGGATCAGATTGCCAAGATCCTGACCATCTACCAGAGCAGCGAGGACATCCAGGAAGAACTGACCAATCTGAACTCCGAGCTGACCCAGGAAGAGATCGAGCAGATCTCTAATCTGAAGGGCTATACCGGCACCCACAACCTGAGCCTGAAGGCCATCAACCTGATCCTGGACGAGCTGTGGCACACCAACGACAACCAGATCGCTATCTTCAACCGGCTGAAGCTGGTGCCCAAGAAGGTGGACCTGTCCCAGCAGAAAGAGATCCCCACCACCCTGGTGGACGACTTCATCCTGAGCCCCGTCGTGAAGAGAAGCTTCATCCAGAGCATCAAAGTGATCAACGCCATCATCAAGAAGTACGGCCTGCCCAACGACATCATTATCGAGCTGGCCCGCGAGAAGAACTCCAAGGACGCCCAGAAAATGATCAACGAGATGCAGAAGCGGAACCGGCAGACCAACGAGCGGATCGAGGAAATCATCCGGACCACCGGCAAAGAGAACGCCAAGTACCTGATCGAGAAGATCAAGCTGCACGACATGCAGGAAGGCAAGTGCCTGTACAGCCTGGAAGCCATCCCTCTGGAAGATCTGCTGAACAACCCCTTCAACTATGAGGTGGACCACATCATCCCCAGAAGCGTGTCCTTCGACAACAGCTTCAACAACAAGGTGCTCGTGAAGCAGGAAGAAGCCAGCAAGAAGGGCAACCGGACCCCATTCCAGTACCTGAGCAGCAGCGACAGCAAGATCAGCTACGAAACCTTCAAGAAGCACATCCTGAATCTGGCCAAGGGCAAGGGCAGAATCAGCAAGACCAAGAAAGAGTATCTGCTGGAAGAACGGGACATCAACAGGTTCTCCGTGCAGAAAGACTTCATCAACCGGAACCTGGTGGATACCAGATACGCCACCAGAGGCCTGATGAACCTGCTGCGGAGCTACTTCAGAGTGAACAACCTGGACGTGAAAGTGAAGTCCATCAATGGCGGCTTCACCAGCTTTCTGCGGCGGAAGTGGAAGTTTAAGAAAGAGCGGAACAAGGGGTACAAGCACCACGCCGAGGACGCCCTGATCATTGCCAACGCCGATTTCATCTTCAAAGAGTGGAAGAAACTGGACAAGGCCAAAAAAGTGATGGAAAACCAGATGTTCGAGGAAAAGCAGGCCGAGAGCATGCCCGAGATCGAAACCGAGCAGGAGTACAAAGAGATCTTCATCACCCCCCACCAGATCAAGCACATTAAGGACTTCAAGGACTACAAGTACAGCCACCGGGTGGACAAGAAGCCTAATAGAAAGCTGATTAACGACACCCTGTACTCCACCCGGAAGGACGACAAGGGCAACACCCTGATCGTGAACAATCTGAACGGCCTGTACGACAAGGACAATGACAAGCTGAAAAAGCTGATCAACAAGAGCCCCGAAAAGCTGCTGATGTACCACCACGACCCCCAGACCTACCAGAAACTGAAGCTGATTATGGAACAGTACGGCGACGAGAAGAATCCCCTGTACAAGTACTACGAGGAAACCGGGAACTACCTGACCAAGTACTCCAAAAAGGACAACGGCCCCGTGATCAAGAAGATTAAGTATTACGGCAACAAACTGAACGCCCATCTGGACATCACCGACGACTACCCCAACAGCAGAAACAAGGTCGTGAAGCTGTCCCTGAAGCCCTACAGATTCGACGTGTACCTGGACAATGGCGTGTACAAGTTCGTGACCGTGAAGAATCTGGATGTGATCAAAAAAGAAAACTACTACGAAGTGAATAGCAAGTGCTATGAGGAAGCTAAGAAGCTGAAGAAGATCAGCAACCAGGCCGAGTTTATCGCCTCCTTCTACAAGAACGATCTGATCAAGATCAACGGCGAGCTGTATAGAGTGATCGGCGTGAACGACGACGCCAAGAACACCCTGGAGCTGTGAGACGGGCCATACTCGTCTCGAACATGATCGACATCACCTACCGCGAGTACCTGGAAAACATGAACGACAAGAGGCCCCCCCACATCATTAAGACAATCGCCTCCAAGACCCAGAGCATTAAGAAGTACAGCACAGACATTCTGGGCAACCTGTATGAAGTGAAATCTAAGAAGCACCCTCAGATCATCAAAAAGGGC(SEQ ID NO:103)。
ATGAAGCGGAACTACATCCTGGGCCTGGCCATCGGCATCACCAGCGTGGGCTACGGCATCATCGACTACGAGACACGGGACGTGATCGATGCCGGCGTGCGGCTGTTCAAAGAGGCCAACGTGGAAAACAACGAGGGCAGGCGGAGCAAGAGAGGCGCCAGAAGGCTGAAGCGGCGGAGGCGGCATAGAATCCAGAGAGTGAAGAAGCTGCTGTTCGACTACAACCTGCTGACCGACCACAGCGAGCTGAGCGGCATCAACCCCTACGAGGCCAGAGTGAAGGGCCTGAGCCAGAAGCTGAGCGAGGAAGAGTTCTCTGCCGCCCTGCTGCACCTGGCCAAGAGAAGAGGCGTGCACAACGTGAACGAGGTGGAAGAGGACACCGGCAACGAGCTGTCCACCAAAGAGCAGATCAGCCGGAACAGCAAGGCCCTGGAAGAGAAATACGTGGCCGAACTGCAGCTGGAACGGCTGAAGAAAGACGGCGAAGTGCGGGGCAGCATCAACAGATTCAAGACCAGCGACTACGTGAAAGAAGCCAAACAGCTGCTGAAGGTGCAGAAGGCCTACCACCAGCTGGACCAGAGCTTCATCGACACCTACATCGACCTGCTGGAAACCCGGCGGACCTACTATGAGGGACCTGGCGAGGGCAGCCCCTTCGGCTGGAAGGACATCAAAGAATGGTACGAGATGCTGATGGGCCACTGCACCTACTTCCCCGAGGAACTGCGGAGCGTGAAGTACGCCTACAACGCCGACCTGTACAACGCCCTGAACGACCTGAACAATCTCGTGATCACCAGGGACGAGAACGAGAAGCTGGAATATTACGAGAAGTTCCAGATCATCGAGAACGTGTTCAAGCAGAAGAAGAAGCCCACCCTGAAGCAGATCGCCAAAGAAATCCTCGTGAACGAAGAGGATATTAAGGGCTACAGAGTGACCAGCACCGGCAAGCCCGAGTTCACCAACCTGAAGGTGTACCACGACATCAAGGACATTACCGCCCGGAAAGAGATTATTGAGAACGCCGAGCTGCTGGATCAGATTGCCAAGATCCTGACCATCTACCAGAGCAGCGAGGACATCCAGGAAGAACTGACCAATCTGAACTCCGAGCTGACCCAGGAAGAGATCGAGCAGATCTCTAATCTGAAGGGCTATACCGGCACCCACAACCTGAGCCTGAAGGCCATCAACCTGATCCTGGACGAGCTGTGGCACACCAACGACAACCAGATCGCTATCTTCAACCGGCTGAAGCTGGTGCCCAAGAAGGTGGACCTGTCCCAGCAGAAAGAGATCCCCACCACCCTGGTGGACGACTTCATCCTGAGCCCCGTCGTGAAGAGAAGCTTCATCCAGAGCATCAAAGTGATCAACGCCATCATCAAGAAGTACGGCCTGCCCAACGACATCATTATCGAGCTGGCCCGCGAGAAGAACTCCAAGGACGCCCAGAAAATGATCAACGAGATGCAGAAGCGGAACCGGCAGACCAACGAGCGGATCGAGGAAATCATCCGGACCACCGGCAAAGAGAACGCCAAGTACCTGATCGAGAAGATCAAGCTGCACGACATGCAGGAAGGCAAGTGCCTGTACAGCCTGGAAGCCATCCCTCTGGAAGATCTGCTGAACAACCCCTTCAACTATGAGGTGGACCACATCATCCCCAGAAGCGTGTCCTTCGACAACAGCTTCAACAACAAGGTGCTCGTGAAGCAGGAAGAAGCCAGCAAGAAGGGCAACCGGACCCCATTCCAGTACCTGAGCAGCAGCGACAGCAAGATCAGCTACGAAACCTTCAAGAAGCACATCCTGAATCTGGCCAAGGGCAAGGGCAGAATCAGCAAGACCAAGAAAGAGTATCTGCTGGAAGAACGGGACATCAACAGGTTCTCCGTGCAGAAAGACTTCATCAACCGGAACCTGGTGGATACCAGATACGCCACCAGAGGCCTGATGAACCTGCTGCGGAGCTACTTCAGAGTGAACAACCTGGACGTGAAAGTGAAGTCCATCAATGGCGGCTTCACCAGCTTTCTGCGGCGGAAGTGGAAGTTTAAGAAAGAGCGGAACAAGGGGTACAAGCACCACGCCGAGGACGCCCTGATCATTGCCAACGCCGATTTCATCTTCAAAGAGTGGAAGAAACTGGACAAGGCCAAAAAAGTGATGGAAAACCAGATGTTCGAGGAAAAGCAGGCCGAGAGCATGCCCGAGATCGAAACCGAGCAGGAGTACAAAGAGATCTTCATCACCCCCCACCAGATCAAGCACATTAAGGACTTCAAGGACTACAAGTACAGCCACCGGGTGGACAAGAAGCCTAATAGAAAGCTGATTAACGACACCCTGTACTCCACCCGGAAGGACGACAAGGGCAACACCCTGATCGTGAACAATCTGAACGGCCTGTACGACAAGGACAATGACAAGCTGAAAAAGCTGATCAACAAGAGCCCCGAAAAGCTGCTGATGTACCACCACGACCCCCAGACCTACCAGAAACTGAAGCTGATTATGGAACAGTACGGCGACGAGAAGAATCCCCTGTACAAGTACTACGAGGAAACCGGGAACTACCTGACCAAGTACTCCAAAAAGGACAACGGCCCCGTGATCAAGAAGATTAAGTATTACGGCAACAAACTGAACGCCCATCTGGACATCACCGACGACTACCCCAACAGCAGAAACAAGGTCGTGAAGCTGTCCCTGAAGCCCTACAGATTCGACGTGTACCTGGACAATGGCGTGTACAAGTTCGTGACCGTGAAGAATCTGGATGTGATCAAAAAAGAAAACTACTACGAAGTGAATAGCAAGTGCTATGAGGAAGCTAAGAAGCTGAAGAAGATCAGCAACCAGGCCGAGTTTATCGCCTCCTTCTACAAGAACGATCTGATCAAGATCAACGGCGAGCTGTATAGAGTGGTGGGCGTGAACAACGACAGCGTGAACAGGGTGGAGCTGTGAGACGGGCCATACTCGTCTCGAACATGATCGACATCACCTACCGCGAGTACCTGGAAAACATGAACGACAAGAGGCCCCCCCACATCATTAAGACAATCGCCTCCAAGACCCAGAGCATTAAGAAGTACAGCACAGACATTCTGGGCAACCTGTATGAAGTGAAATCTAAGAAGCACCCTCAGATCATCAAAAAGGGC(SEQ ID NO:104)。
ATGAAGCGGAACTACATCCTGGGCCTGGCCATCGGCATCACCAGCGTGGGCTACGGCATCATCGACTACGAGACACGGGACGTGATCGATGCCGGCGTGCGGCTGTTCAAAGAGGCCAACGTGGAAAACAACGAGGGCAGGCGGAGCAAGAGAGGCGCCAGAAGGCTGAAGCGGCGGAGGCGGCATAGAATCCAGAGAGTGAAGAAGCTGCTGTTCGACTACAACCTGCTGACCGACCACAGCGAGCTGAGCGGCATCAACCCCTACGAGGCCAGAGTGAAGGGCCTGAGCCAGAAGCTGAGCGAGGAAGAGTTCTCTGCCGCCCTGCTGCACCTGGCCAAGAGAAGAGGCGTGCACAACGTGAACGAGGTGGAAGAGGACACCGGCAACGAGCTGTCCACCAAAGAGCAGATCAGCCGGAACAGCAAGGCCCTGGAAGAGAAATACGTGGCCGAACTGCAGCTGGAACGGCTGAAGAAAGACGGCGAAGTGCGGGGCAGCATCAACAGATTCAAGACCAGCGACTACGTGAAAGAAGCCAAACAGCTGCTGAAGGTGCAGAAGGCCTACCACCAGCTGGACCAGAGCTTCATCGACACCTACATCGACCTGCTGGAAACCCGGCGGACCTACTATGAGGGACCTGGCGAGGGCAGCCCCTTCGGCTGGAAGGACATCAAAGAATGGTACGAGATGCTGATGGGCCACTGCACCTACTTCCCCGAGGAACTGCGGAGCGTGAAGTACGCCTACAACGCCGACCTGTACAACGCCCTGAACGACCTGAACAATCTCGTGATCACCAGGGACGAGAACGAGAAGCTGGAATATTACGAGAAGTTCCAGATCATCGAGAACGTGTTCAAGCAGAAGAAGAAGCCCACCCTGAAGCAGATCGCCAAAGAAATCCTCGTGAACGAAGAGGATATTAAGGGCTACAGAGTGACCAGCACCGGCAAGCCCGAGTTCACCAACCTGAAGGTGTACCACGACATCAAGGACATTACCGCCCGGAAAGAGATTATTGAGAACGCCGAGCTGCTGGATCAGATTGCCAAGATCCTGACCATCTACCAGAGCAGCGAGGACATCCAGGAAGAACTGACCAATCTGAACTCCGAGCTGACCCAGGAAGAGATCGAGCAGATCTCTAATCTGAAGGGCTATACCGGCACCCACAACCTGAGCCTGAAGGCCATCAACCTGATCCTGGACGAGCTGTGGCACACCAACGACAACCAGATCGCTATCTTCAACCGGCTGAAGCTGGTGCCCAAGAAGGTGGACCTGTCCCAGCAGAAAGAGATCCCCACCACCCTGGTGGACGACTTCATCCTGAGCCCCGTCGTGAAGAGAAGCTTCATCCAGAGCATCAAAGTGATCAACGCCATCATCAAGAAGTACGGCCTGCCCAACGACATCATTATCGAGCTGGCCCGCGAGAAGAACTCCAAGGACGCCCAGAAAATGATCAACGAGATGCAGAAGCGGAACCGGCAGACCAACGAGCGGATCGAGGAAATCATCCGGACCACCGGCAAAGAGAACGCCAAGTACCTGATCGAGAAGATCAAGCTGCACGACATGCAGGAAGGCAAGTGCCTGTACAGCCTGGAAGCCATCCCTCTGGAAGATCTGCTGAACAACCCCTTCAACTATGAGGTGGACCACATCATCCCCAGAAGCGTGTCCTTCGACAACAGCTTCAACAACAAGGTGCTCGTGAAGCAGGAAGAAGCCAGCAAGAAGGGCAACCGGACCCCATTCCAGTACCTGAGCAGCAGCGACAGCAAGATCAGCTACGAAACCTTCAAGAAGCACATCCTGAATCTGGCCAAGGGCAAGGGCAGAATCAGCAAGACCAAGAAAGAGTATCTGCTGGAAGAACGGGACATCAACAGGTTCTCCGTGCAGAAAGACTTCATCAACCGGAACCTGGTGGATACCAGATACGCCACCAGAGGCCTGATGAACCTGCTGCGGAGCTACTTCAGAGTGAACAACCTGGACGTGAAAGTGAAGTCCATCAATGGCGGCTTCACCAGCTTTCTGCGGCGGAAGTGGAAGTTTAAGAAAGAGCGGAACAAGGGGTACAAGCACCACGCCGAGGACGCCCTGATCATTGCCAACGCCGATTTCATCTTCAAAGAGTGGAAGAAACTGGACAAGGCCAAAAAAGTGATGGAAAACCAGATGTTCGAGGAAAAGCAGGCCGAGAGCATGCCCGAGATCGAAACCGAGCAGGAGTACAAAGAGATCTTCATCACCCCCCACCAGATCAAGCACATTAAGGACTTCAAGGACTACAAGTACAGCCACCGGGTGGACAAGAAGCCTAATAGAAAGCTGATTAACGACACCCTGTACTCCACCCGGAAGGACGACAAGGGCAACACCCTGATCGTGAACAATCTGAACGGCCTGTACGACAAGGACAATGACAAGCTGAAAAAGCTGATCAACAAGAGCCCCGAAAAGCTGCTGATGTACCACCACGACCCCCAGACCTACCAGAAACTGAAGCTGATTATGGAACAGTACGGCGACGAGAAGAATCCCCTGTACAAGTACTACGAGGAAACCGGGAACTACCTGACCAAGTACTCCAAAAAGGACAACGGCCCCGTGATCAAGAAGATTAAGTATTACGGCAACAAACTGAACGCCCATCTGGACATCACCGACGACTACCCCAACAGCAGAAACAAGGTCGTGAAGCTGTCCCTGAAGCCCTACAGATTCGACGTGTACCTGGACAATGGCGTGTACAAGTTCGTGACCGTGAAGAATCTGGATGTGATCAAAAAAGAAAACTACTACGAAGTGAATAGCAAGTGCTATGAGGAAGCTAAGAAGCTGAAGAAGATCAGCAACCAGGCCGAGTTTATCGCCTCCTTCTACAAGAACGATCTGATCAAGATCAACGGCGAGCTGTATAGAGTGGTGGGCGTGAACAACGACACCAGGAACGTGGTGGAGCTGTGAGACGGGCCATACTCGTCTCGAACATGATCGACATCACCTACCGCGAGTACCTGGAAAACATGAACGACAAGAGGCCCCCCCACATCATTAAGACAATCGCCTCCAAGACCCAGAGCATTAAGAAGTACAGCACAGACATTCTGGGCAACCTGTATGAAGTGAAATCTAAGAAGCACCCTCAGATCATCAAAAAGGGC(SEQ ID NO:105)。
ATGAAGCGGAACTACATCCTGGGCCTGGCCATCGGCATCACCAGCGTGGGCTACGGCATCATCGACTACGAGACACGGGACGTGATCGATGCCGGCGTGCGGCTGTTCAAAGAGGCCAACGTGGAAAACAACGAGGGCAGGCGGAGCAAGAGAGGCGCCAGAAGGCTGAAGCGGCGGAGGCGGCATAGAATCCAGAGAGTGAAGAAGCTGCTGTTCGACTACAACCTGCTGACCGACCACAGCGAGCTGAGCGGCATCAACCCCTACGAGGCCAGAGTGAAGGGCCTGAGCCAGAAGCTGAGCGAGGAAGAGTTCTCTGCCGCCCTGCTGCACCTGGCCAAGAGAAGAGGCGTGCACAACGTGAACGAGGTGGAAGAGGACACCGGCAACGAGCTGTCCACCAAAGAGCAGATCAGCCGGAACAGCAAGGCCCTGGAAGAGAAATACGTGGCCGAACTGCAGCTGGAACGGCTGAAGAAAGACGGCGAAGTGCGGGGCAGCATCAACAGATTCAAGACCAGCGACTACGTGAAAGAAGCCAAACAGCTGCTGAAGGTGCAGAAGGCCTACCACCAGCTGGACCAGAGCTTCATCGACACCTACATCGACCTGCTGGAAACCCGGCGGACCTACTATGAGGGACCTGGCGAGGGCAGCCCCTTCGGCTGGAAGGACATCAAAGAATGGTACGAGATGCTGATGGGCCACTGCACCTACTTCCCCGAGGAACTGCGGAGCGTGAAGTACGCCTACAACGCCGACCTGTACAACGCCCTGAACGACCTGAACAATCTCGTGATCACCAGGGACGAGAACGAGAAGCTGGAATATTACGAGAAGTTCCAGATCATCGAGAACGTGTTCAAGCAGAAGAAGAAGCCCACCCTGAAGCAGATCGCCAAAGAAATCCTCGTGAACGAAGAGGATATTAAGGGCTACAGAGTGACCAGCACCGGCAAGCCCGAGTTCACCAACCTGAAGGTGTACCACGACATCAAGGACATTACCGCCCGGAAAGAGATTATTGAGAACGCCGAGCTGCTGGATCAGATTGCCAAGATCCTGACCATCTACCAGAGCAGCGAGGACATCCAGGAAGAACTGACCAATCTGAACTCCGAGCTGACCCAGGAAGAGATCGAGCAGATCTCTAATCTGAAGGGCTATACCGGCACCCACAACCTGAGCCTGAAGGCCATCAACCTGATCCTGGACGAGCTGTGGCACACCAACGACAACCAGATCGCTATCTTCAACCGGCTGAAGCTGGTGCCCAAGAAGGTGGACCTGTCCCAGCAGAAAGAGATCCCCACCACCCTGGTGGACGACTTCATCCTGAGCCCCGTCGTGAAGAGAAGCTTCATCCAGAGCATCAAAGTGATCAACGCCATCATCAAGAAGTACGGCCTGCCCAACGACATCATTATCGAGCTGGCCCGCGAGAAGAACTCCAAGGACGCCCAGAAAATGATCAACGAGATGCAGAAGCGGAACCGGCAGACCAACGAGCGGATCGAGGAAATCATCCGGACCACCGGCAAAGAGAACGCCAAGTACCTGATCGAGAAGATCAAGCTGCACGACATGCAGGAAGGCAAGTGCCTGTACAGCCTGGAAGCCATCCCTCTGGAAGATCTGCTGAACAACCCCTTCAACTATGAGGTGGACCACATCATCCCCAGAAGCGTGTCCTTCGACAACAGCTTCAACAACAAGGTGCTCGTGAAGCAGGAAGAAGCCAGCAAGAAGGGCAACCGGACCCCATTCCAGTACCTGAGCAGCAGCGACAGCAAGATCAGCTACGAAACCTTCAAGAAGCACATCCTGAATCTGGCCAAGGGCAAGGGCAGAATCAGCAAGACCAAGAAAGAGTATCTGCTGGAAGAACGGGACATCAACAGGTTCTCCGTGCAGAAAGACTTCATCAACCGGAACCTGGTGGATACCAGATACGCCACCAGAGGCCTGATGAACCTGCTGCGGAGCTACTTCAGAGTGAACAACCTGGACGTGAAAGTGAAGTCCATCAATGGCGGCTTCACCAGCTTTCTGCGGCGGAAGTGGAAGTTTAAGAAAGAGCGGAACAAGGGGTACAAGCACCACGCCGAGGACGCCCTGATCATTGCCAACGCCGATTTCATCTTCAAAGAGTGGAAGAAACTGGACAAGGCCAAAAAAGTGATGGAAAACCAGATGTTCGAGGAAAAGCAGGCCGAGAGCATGCCCGAGATCGAAACCGAGCAGGAGTACAAAGAGATCTTCATCACCCCCCACCAGATCAAGCACATTAAGGACTTCAAGGACTACAAGTACAGCCACCGGGTGGACAAGAAGCCTAATAGAAAGCTGATTAACGACACCCTGTACTCCACCCGGAAGGACGACAAGGGCAACACCCTGATCGTGAACAATCTGAACGGCCTGTACGACAAGGACAATGACAAGCTGAAAAAGCTGATCAACAAGAGCCCCGAAAAGCTGCTGATGTACCACCACGACCCCCAGACCTACCAGAAACTGAAGCTGATTATGGAACAGTACGGCGACGAGAAGAATCCCCTGTACAAGTACTACGAGGAAACCGGGAACTACCTGACCAAGTACTCCAAAAAGGACAACGGCCCCGTGATCAAGAAGATTAAGTATTACGGCAACAAACTGAACGCCCATCTGGACATCACCGACGACTACCCCAACAGCAGAAACAAGGTCGTGAAGCTGTCCCTGAAGCCCTACAGATTCGACGTGTACCTGGACAATGGCGTGTACAAGTTCGTGACCGTGAAGAATCTGGATGTGATCAAAAAAGAAAACTACTACGAAGTGAATAGCAAGTGCTATGAGGAAGCTAAGAAGCTGAAGAAGATCAGCAACCAGGCCGAGTTTATCGCCTCCTTCTACAAGAACGATCTGATCAAGATCAACGGCGAGCTGTATAGAGTGGTGGGCGTGAACAACGACAGCAGGAACGTGGTGGAGCTGTGAGACGGGCCATACTCGTCTCGAACATGATCGACATCACCTACCGCGAGTACCTGGAAAACATGAACGACAAGAGGCCCCCCCACATCATTAAGACAATCGCCTCCAAGACCCAGAGCATTAAGAAGTACAGCACAGACATTCTGGGCAACCTGTATGAAGTGAAATCTAAGAAGCACCCTCAGATCATCAAAAAGGGC(SEQ ID NO:106)。
ATGAAGCGGAACTACATCCTGGGCCTGGCCATCGGCATCACCAGCGTGGGCTACGGCATCATCGACTACGAGACACGGGACGTGATCGATGCCGGCGTGCGGCTGTTCAAAGAGGCCAACGTGGAAAACAACGAGGGCAGGCGGAGCAAGAGAGGCGCCAGAAGGCTGAAGCGGCGGAGGCGGCATAGAATCCAGAGAGTGAAGAAGCTGCTGTTCGACTACAACCTGCTGACCGACCACAGCGAGCTGAGCGGCATCAACCCCTACGAGGCCAGAGTGAAGGGCCTGAGCCAGAAGCTGAGCGAGGAAGAGTTCTCTGCCGCCCTGCTGCACCTGGCCAAGAGAAGAGGCGTGCACAACGTGAACGAGGTGGAAGAGGACACCGGCAACGAGCTGTCCACCAAAGAGCAGATCAGCCGGAACAGCAAGGCCCTGGAAGAGAAATACGTGGCCGAACTGCAGCTGGAACGGCTGAAGAAAGACGGCGAAGTGCGGGGCAGCATCAACAGATTCAAGACCAGCGACTACGTGAAAGAAGCCAAACAGCTGCTGAAGGTGCAGAAGGCCTACCACCAGCTGGACCAGAGCTTCATCGACACCTACATCGACCTGCTGGAAACCCGGCGGACCTACTATGAGGGACCTGGCGAGGGCAGCCCCTTCGGCTGGAAGGACATCAAAGAATGGTACGAGATGCTGATGGGCCACTGCACCTACTTCCCCGAGGAACTGCGGAGCGTGAAGTACGCCTACAACGCCGACCTGTACAACGCCCTGAACGACCTGAACAATCTCGTGATCACCAGGGACGAGAACGAGAAGCTGGAATATTACGAGAAGTTCCAGATCATCGAGAACGTGTTCAAGCAGAAGAAGAAGCCCACCCTGAAGCAGATCGCCAAAGAAATCCTCGTGAACGAAGAGGATATTAAGGGCTACAGAGTGACCAGCACCGGCAAGCCCGAGTTCACCAACCTGAAGGTGTACCACGACATCAAGGACATTACCGCCCGGAAAGAGATTATTGAGAACGCCGAGCTGCTGGATCAGATTGCCAAGATCCTGACCATCTACCAGAGCAGCGAGGACATCCAGGAAGAACTGACCAATCTGAACTCCGAGCTGACCCAGGAAGAGATCGAGCAGATCTCTAATCTGAAGGGCTATACCGGCACCCACAACCTGAGCCTGAAGGCCATCAACCTGATCCTGGACGAGCTGTGGCACACCAACGACAACCAGATCGCTATCTTCAACCGGCTGAAGCTGGTGCCCAAGAAGGTGGACCTGTCCCAGCAGAAAGAGATCCCCACCACCCTGGTGGACGACTTCATCCTGAGCCCCGTCGTGAAGAGAAGCTTCATCCAGAGCATCAAAGTGATCAACGCCATCATCAAGAAGTACGGCCTGCCCAACGACATCATTATCGAGCTGGCCCGCGAGAAGAACTCCAAGGACGCCCAGAAAATGATCAACGAGATGCAGAAGCGGAACCGGCAGACCAACGAGCGGATCGAGGAAATCATCCGGACCACCGGCAAAGAGAACGCCAAGTACCTGATCGAGAAGATCAAGCTGCACGACATGCAGGAAGGCAAGTGCCTGTACAGCCTGGAAGCCATCCCTCTGGAAGATCTGCTGAACAACCCCTTCAACTATGAGGTGGACCACATCATCCCCAGAAGCGTGTCCTTCGACAACAGCTTCAACAACAAGGTGCTCGTGAAGCAGGAAGAAGCCAGCAAGAAGGGCAACCGGACCCCATTCCAGTACCTGAGCAGCAGCGACAGCAAGATCAGCTACGAAACCTTCAAGAAGCACATCCTGAATCTGGCCAAGGGCAAGGGCAGAATCAGCAAGACCAAGAAAGAGTATCTGCTGGAAGAACGGGACATCAACAGGTTCTCCGTGCAGAAAGACTTCATCAACCGGAACCTGGTGGATACCAGATACGCCACCAGAGGCCTGATGAACCTGCTGCGGAGCTACTTCAGAGTGAACAACCTGGACGTGAAAGTGAAGTCCATCAATGGCGGCTTCACCAGCTTTCTGCGGCGGAAGTGGAAGTTTAAGAAAGAGCGGAACAAGGGGTACAAGCACCACGCCGAGGACGCCCTGATCATTGCCAACGCCGATTTCATCTTCAAAGAGTGGAAGAAACTGGACAAGGCCAAAAAAGTGATGGAAAACCAGATGTTCGAGGAAAAGCAGGCCGAGAGCATGCCCGAGATCGAAACCGAGCAGGAGTACAAAGAGATCTTCATCACCCCCCACCAGATCAAGCACATTAAGGACTTCAAGGACTACAAGTACAGCCACCGGGTGGACAAGAAGCCTAATAGAAAGCTGATTAACGACACCCTGTACTCCACCCGGAAGGACGACAAGGGCAACACCCTGATCGTGAACAATCTGAACGGCCTGTACGACAAGGACAATGACAAGCTGAAAAAGCTGATCAACAAGAGCCCCGAAAAGCTGCTGATGTACCACCACGACCCCCAGACCTACCAGAAACTGAAGCTGATTATGGAACAGTACGGCGACGAGAAGAATCCCCTGTACAAGTACTACGAGGAAACCGGGAACTACCTGACCAAGTACTCCAAAAAGGACAACGGCCCCGTGATCAAGAAGATTAAGTATTACGGCAACAAACTGAACGCCCATCTGGACATCACCGACGACTACCCCAACAGCAGAAACAAGGTCGTGAAGCTGTCCCTGAAGCCCTACAGATTCGACGTGTACCTGGACAATGGCGTGTACAAGTTCGTGACCGTGAAGAATCTGGATGTGATCAAAAAAGAAAACTACTACGAAGTGAATAGCAAGTGCTATGAGGAAGCTAAGAAGCTGAAGAAGATCAGCAACCAGGCCGAGTTTATCGCCTCCTTCTACAAGAACGATCTGATCAAGATCAACGGCGAGCTGTATAGAGTGAGGGGCGACAGCATGCCCAGGCAGAACAAGATCGAGATGTGAGACGGGCCATACTCGTCTCGAACATGATCGACATCACCTACCGCGAGTACCTGGAAAACATGAACGACAAGAGGCCCCCCCACATCATTAAGACAATCGCCTCCAAGACCCAGAGCATTAAGAAGTACAGCACAGACATTCTGGGCAACCTGTATGAAGTGAAATCTAAGAAGCACCCTCAGATCATCAAAAAGGGC(SEQ ID NO:107)。
ATGAAGCGGAACTACATCCTGGGCCTGGCCATCGGCATCACCAGCGTGGGCTACGGCATCATCGACTACGAGACACGGGACGTGATCGATGCCGGCGTGCGGCTGTTCAAAGAGGCCAACGTGGAAAACAACGAGGGCAGGCGGAGCAAGAGAGGCGCCAGAAGGCTGAAGCGGCGGAGGCGGCATAGAATCCAGAGAGTGAAGAAGCTGCTGTTCGACTACAACCTGCTGACCGACCACAGCGAGCTGAGCGGCATCAACCCCTACGAGGCCAGAGTGAAGGGCCTGAGCCAGAAGCTGAGCGAGGAAGAGTTCTCTGCCGCCCTGCTGCACCTGGCCAAGAGAAGAGGCGTGCACAACGTGAACGAGGTGGAAGAGGACACCGGCAACGAGCTGTCCACCAAAGAGCAGATCAGCCGGAACAGCAAGGCCCTGGAAGAGAAATACGTGGCCGAACTGCAGCTGGAACGGCTGAAGAAAGACGGCGAAGTGCGGGGCAGCATCAACAGATTCAAGACCAGCGACTACGTGAAAGAAGCCAAACAGCTGCTGAAGGTGCAGAAGGCCTACCACCAGCTGGACCAGAGCTTCATCGACACCTACATCGACCTGCTGGAAACCCGGCGGACCTACTATGAGGGACCTGGCGAGGGCAGCCCCTTCGGCTGGAAGGACATCAAAGAATGGTACGAGATGCTGATGGGCCACTGCACCTACTTCCCCGAGGAACTGCGGAGCGTGAAGTACGCCTACAACGCCGACCTGTACAACGCCCTGAACGACCTGAACAATCTCGTGATCACCAGGGACGAGAACGAGAAGCTGGAATATTACGAGAAGTTCCAGATCATCGAGAACGTGTTCAAGCAGAAGAAGAAGCCCACCCTGAAGCAGATCGCCAAAGAAATCCTCGTGAACGAAGAGGATATTAAGGGCTACAGAGTGACCAGCACCGGCAAGCCCGAGTTCACCAACCTGAAGGTGTACCACGACATCAAGGACATTACCGCCCGGAAAGAGATTATTGAGAACGCCGAGCTGCTGGATCAGATTGCCAAGATCCTGACCATCTACCAGAGCAGCGAGGACATCCAGGAAGAACTGACCAATCTGAACTCCGAGCTGACCCAGGAAGAGATCGAGCAGATCTCTAATCTGAAGGGCTATACCGGCACCCACAACCTGAGCCTGAAGGCCATCAACCTGATCCTGGACGAGCTGTGGCACACCAACGACAACCAGATCGCTATCTTCAACCGGCTGAAGCTGGTGCCCAAGAAGGTGGACCTGTCCCAGCAGAAAGAGATCCCCACCACCCTGGTGGACGACTTCATCCTGAGCCCCGTCGTGAAGAGAAGCTTCATCCAGAGCATCAAAGTGATCAACGCCATCATCAAGAAGTACGGCCTGCCCAACGACATCATTATCGAGCTGGCCCGCGAGAAGAACTCCAAGGACGCCCAGAAAATGATCAACGAGATGCAGAAGCGGAACCGGCAGACCAACGAGCGGATCGAGGAAATCATCCGGACCACCGGCAAAGAGAACGCCAAGTACCTGATCGAGAAGATCAAGCTGCACGACATGCAGGAAGGCAAGTGCCTGTACAGCCTGGAAGCCATCCCTCTGGAAGATCTGCTGAACAACCCCTTCAACTATGAGGTGGACCACATCATCCCCAGAAGCGTGTCCTTCGACAACAGCTTCAACAACAAGGTGCTCGTGAAGCAGGAAGAAGCCAGCAAGAAGGGCAACCGGACCCCATTCCAGTACCTGAGCAGCAGCGACAGCAAGATCAGCTACGAAACCTTCAAGAAGCACATCCTGAATCTGGCCAAGGGCAAGGGCAGAATCAGCAAGACCAAGAAAGAGTATCTGCTGGAAGAACGGGACATCAACAGGTTCTCCGTGCAGAAAGACTTCATCAACCGGAACCTGGTGGATACCAGATACGCCACCAGAGGCCTGATGAACCTGCTGCGGAGCTACTTCAGAGTGAACAACCTGGACGTGAAAGTGAAGTCCATCAATGGCGGCTTCACCAGCTTTCTGCGGCGGAAGTGGAAGTTTAAGAAAGAGCGGAACAAGGGGTACAAGCACCACGCCGAGGACGCCCTGATCATTGCCAACGCCGATTTCATCTTCAAAGAGTGGAAGAAACTGGACAAGGCCAAAAAAGTGATGGAAAACCAGATGTTCGAGGAAAAGCAGGCCGAGAGCATGCCCGAGATCGAAACCGAGCAGGAGTACAAAGAGATCTTCATCACCCCCCACCAGATCAAGCACATTAAGGACTTCAAGGACTACAAGTACAGCCACCGGGTGGACAAGAAGCCTAATAGAAAGCTGATTAACGACACCCTGTACTCCACCCGGAAGGACGACAAGGGCAACACCCTGATCGTGAACAATCTGAACGGCCTGTACGACAAGGACAATGACAAGCTGAAAAAGCTGATCAACAAGAGCCCCGAAAAGCTGCTGATGTACCACCACGACCCCCAGACCTACCAGAAACTGAAGCTGATTATGGAACAGTACGGCGACGAGAAGAATCCCCTGTACAAGTACTACGAGGAAACCGGGAACTACCTGACCAAGTACTCCAAAAAGGACAACGGCCCCGTGATCAAGAAGATTAAGTATTACGGCAACAAACTGAACGCCCATCTGGACATCACCGACGACTACCCCAACAGCAGAAACAAGGTCGTGAAGCTGTCCCTGAAGCCCTACAGATTCGACGTGTACCTGGACAATGGCGTGTACAAGTTCGTGACCGTGAAGAATCTGGATGTGATCAAAAAAGAAAACTACTACGAAGTGAATAGCAAGTGCTATGAGGAAGCTAAGAAGCTGAAGAAGATCAGCAACCAGGCCGAGTTTATCGCCTCCTTCTACAAGAACGATCTGATCAAGATCAACGGCGAGCTGTATAGAGTGAGGGGCGACGCCATGCCCAGGGACAACAAGATCGAGGTGTGAGACGGGCCATACTCGTCTCGAACATGATCGACATCACCTACCGCGAGTACCTGGAAAACATGAACGACAAGAGGCCCCCCCACATCATTAAGACAATCGCCTCCAAGACCCAGAGCATTAAGAAGTACAGCACAGACATTCTGGGCAACCTGTATGAAGTGAAATCTAAGAAGCACCCTCAGATCATCAAAAAGGGC(SEQ ID NO:108)。
ATGAAGCGGAACTACATCCTGGGCCTGGCCATCGGCATCACCAGCGTGGGCTACGGCATCATCGACTACGAGACACGGGACGTGATCGATGCCGGCGTGCGGCTGTTCAAAGAGGCCAACGTGGAAAACAACGAGGGCAGGCGGAGCAAGAGAGGCGCCAGAAGGCTGAAGCGGCGGAGGCGGCATAGAATCCAGAGAGTGAAGAAGCTGCTGTTCGACTACAACCTGCTGACCGACCACAGCGAGCTGAGCGGCATCAACCCCTACGAGGCCAGAGTGAAGGGCCTGAGCCAGAAGCTGAGCGAGGAAGAGTTCTCTGCCGCCCTGCTGCACCTGGCCAAGAGAAGAGGCGTGCACAACGTGAACGAGGTGGAAGAGGACACCGGCAACGAGCTGTCCACCAAAGAGCAGATCAGCCGGAACAGCAAGGCCCTGGAAGAGAAATACGTGGCCGAACTGCAGCTGGAACGGCTGAAGAAAGACGGCGAAGTGCGGGGCAGCATCAACAGATTCAAGACCAGCGACTACGTGAAAGAAGCCAAACAGCTGCTGAAGGTGCAGAAGGCCTACCACCAGCTGGACCAGAGCTTCATCGACACCTACATCGACCTGCTGGAAACCCGGCGGACCTACTATGAGGGACCTGGCGAGGGCAGCCCCTTCGGCTGGAAGGACATCAAAGAATGGTACGAGATGCTGATGGGCCACTGCACCTACTTCCCCGAGGAACTGCGGAGCGTGAAGTACGCCTACAACGCCGACCTGTACAACGCCCTGAACGACCTGAACAATCTCGTGATCACCAGGGACGAGAACGAGAAGCTGGAATATTACGAGAAGTTCCAGATCATCGAGAACGTGTTCAAGCAGAAGAAGAAGCCCACCCTGAAGCAGATCGCCAAAGAAATCCTCGTGAACGAAGAGGATATTAAGGGCTACAGAGTGACCAGCACCGGCAAGCCCGAGTTCACCAACCTGAAGGTGTACCACGACATCAAGGACATTACCGCCCGGAAAGAGATTATTGAGAACGCCGAGCTGCTGGATCAGATTGCCAAGATCCTGACCATCTACCAGAGCAGCGAGGACATCCAGGAAGAACTGACCAATCTGAACTCCGAGCTGACCCAGGAAGAGATCGAGCAGATCTCTAATCTGAAGGGCTATACCGGCACCCACAACCTGAGCCTGAAGGCCATCAACCTGATCCTGGACGAGCTGTGGCACACCAACGACAACCAGATCGCTATCTTCAACCGGCTGAAGCTGGTGCCCAAGAAGGTGGACCTGTCCCAGCAGAAAGAGATCCCCACCACCCTGGTGGACGACTTCATCCTGAGCCCCGTCGTGAAGAGAAGCTTCATCCAGAGCATCAAAGTGATCAACGCCATCATCAAGAAGTACGGCCTGCCCAACGACATCATTATCGAGCTGGCCCGCGAGAAGAACTCCAAGGACGCCCAGAAAATGATCAACGAGATGCAGAAGCGGAACCGGCAGACCAACGAGCGGATCGAGGAAATCATCCGGACCACCGGCAAAGAGAACGCCAAGTACCTGATCGAGAAGATCAAGCTGCACGACATGCAGGAAGGCAAGTGCCTGTACAGCCTGGAAGCCATCCCTCTGGAAGATCTGCTGAACAACCCCTTCAACTATGAGGTGGACCACATCATCCCCAGAAGCGTGTCCTTCGACAACAGCTTCAACAACAAGGTGCTCGTGAAGCAGGAAGAAGCCAGCAAGAAGGGCAACCGGACCCCATTCCAGTACCTGAGCAGCAGCGACAGCAAGATCAGCTACGAAACCTTCAAGAAGCACATCCTGAATCTGGCCAAGGGCAAGGGCAGAATCAGCAAGACCAAGAAAGAGTATCTGCTGGAAGAACGGGACATCAACAGGTTCTCCGTGCAGAAAGACTTCATCAACCGGAACCTGGTGGATACCAGATACGCCACCAGAGGCCTGATGAACCTGCTGCGGAGCTACTTCAGAGTGAACAACCTGGACGTGAAAGTGAAGTCCATCAATGGCGGCTTCACCAGCTTTCTGCGGCGGAAGTGGAAGTTTAAGAAAGAGCGGAACAAGGGGTACAAGCACCACGCCGAGGACGCCCTGATCATTGCCAACGCCGATTTCATCTTCAAAGAGTGGAAGAAACTGGACAAGGCCAAAAAAGTGATGGAAAACCAGATGTTCGAGGAAAAGCAGGCCGAGAGCATGCCCGAGATCGAAACCGAGCAGGAGTACAAAGAGATCTTCATCACCCCCCACCAGATCAAGCACATTAAGGACTTCAAGGACTACAAGTACAGCCACCGGGTGGACAAGAAGCCTAATAGAAAGCTGATTAACGACACCCTGTACTCCACCCGGAAGGACGACAAGGGCAACACCCTGATCGTGAACAATCTGAACGGCCTGTACGACAAGGACAATGACAAGCTGAAAAAGCTGATCAACAAGAGCCCCGAAAAGCTGCTGATGTACCACCACGACCCCCAGACCTACCAGAAACTGAAGCTGATTATGGAACAGTACGGCGACGAGAAGAATCCCCTGTACAAGTACTACGAGGAAACCGGGAACTACCTGACCAAGTACTCCAAAAAGGACAACGGCCCCGTGATCAAGAAGATTAAGTATTACGGCAACAAACTGAACGCCCATCTGGACATCACCGACGACTACCCCAACAGCAGAAACAAGGTCGTGAAGCTGTCCCTGAAGCCCTACAGATTCGACGTGTACCTGGACAATGGCGTGTACAAGTTCGTGACCGTGAAGAATCTGGATGTGATCAAAAAAGAAAACTACTACGAAGTGAATAGCAAGTGCTATGAGGAAGCTAAGAAGCTGAAGAAGATCAGCAACCAGGCCGAGTTTATCGCCTCCTTCTACAAGAACGATCTGATCAAGATCAACGGCGAGCTGTATAGAGTGATCGGCATCAACAACGGCGACAAGAACCTGGTGGAGCTGTGAGACGGGCCATACTCGTCTCGAACATGATCGACATCACCTACCGCGAGTACCTGGAAAACATGAACGACAAGAGGCCCCCCCACATCATTAAGACAATCGCCTCCAAGACCCAGAGCATTAAGAAGTACAGCACAGACATTCTGGGCAACCTGTATGAAGTGAAATCTAAGAAGCACCCTCAGATCATCAAAAAGGGC(SEQ ID NO:109)。
ATGAAGCGGAACTACATCCTGGGCCTGGCCATCGGCATCACCAGCGTGGGCTACGGCATCATCGACTACGAGACACGGGACGTGATCGATGCCGGCGTGCGGCTGTTCAAAGAGGCCAACGTGGAAAACAACGAGGGCAGGCGGAGCAAGAGAGGCGCCAGAAGGCTGAAGCGGCGGAGGCGGCATAGAATCCAGAGAGTGAAGAAGCTGCTGTTCGACTACAACCTGCTGACCGACCACAGCGAGCTGAGCGGCATCAACCCCTACGAGGCCAGAGTGAAGGGCCTGAGCCAGAAGCTGAGCGAGGAAGAGTTCTCTGCCGCCCTGCTGCACCTGGCCAAGAGAAGAGGCGTGCACAACGTGAACGAGGTGGAAGAGGACACCGGCAACGAGCTGTCCACCAAAGAGCAGATCAGCCGGAACAGCAAGGCCCTGGAAGAGAAATACGTGGCCGAACTGCAGCTGGAACGGCTGAAGAAAGACGGCGAAGTGCGGGGCAGCATCAACAGATTCAAGACCAGCGACTACGTGAAAGAAGCCAAACAGCTGCTGAAGGTGCAGAAGGCCTACCACCAGCTGGACCAGAGCTTCATCGACACCTACATCGACCTGCTGGAAACCCGGCGGACCTACTATGAGGGACCTGGCGAGGGCAGCCCCTTCGGCTGGAAGGACATCAAAGAATGGTACGAGATGCTGATGGGCCACTGCACCTACTTCCCCGAGGAACTGCGGAGCGTGAAGTACGCCTACAACGCCGACCTGTACAACGCCCTGAACGACCTGAACAATCTCGTGATCACCAGGGACGAGAACGAGAAGCTGGAATATTACGAGAAGTTCCAGATCATCGAGAACGTGTTCAAGCAGAAGAAGAAGCCCACCCTGAAGCAGATCGCCAAAGAAATCCTCGTGAACGAAGAGGATATTAAGGGCTACAGAGTGACCAGCACCGGCAAGCCCGAGTTCACCAACCTGAAGGTGTACCACGACATCAAGGACATTACCGCCCGGAAAGAGATTATTGAGAACGCCGAGCTGCTGGATCAGATTGCCAAGATCCTGACCATCTACCAGAGCAGCGAGGACATCCAGGAAGAACTGACCAATCTGAACTCCGAGCTGACCCAGGAAGAGATCGAGCAGATCTCTAATCTGAAGGGCTATACCGGCACCCACAACCTGAGCCTGAAGGCCATCAACCTGATCCTGGACGAGCTGTGGCACACCAACGACAACCAGATCGCTATCTTCAACCGGCTGAAGCTGGTGCCCAAGAAGGTGGACCTGTCCCAGCAGAAAGAGATCCCCACCACCCTGGTGGACGACTTCATCCTGAGCCCCGTCGTGAAGAGAAGCTTCATCCAGAGCATCAAAGTGATCAACGCCATCATCAAGAAGTACGGCCTGCCCAACGACATCATTATCGAGCTGGCCCGCGAGAAGAACTCCAAGGACGCCCAGAAAATGATCAACGAGATGCAGAAGCGGAACCGGCAGACCAACGAGCGGATCGAGGAAATCATCCGGACCACCGGCAAAGAGAACGCCAAGTACCTGATCGAGAAGATCAAGCTGCACGACATGCAGGAAGGCAAGTGCCTGTACAGCCTGGAAGCCATCCCTCTGGAAGATCTGCTGAACAACCCCTTCAACTATGAGGTGGACCACATCATCCCCAGAAGCGTGTCCTTCGACAACAGCTTCAACAACAAGGTGCTCGTGAAGCAGGAAGAAGCCAGCAAGAAGGGCAACCGGACCCCATTCCAGTACCTGAGCAGCAGCGACAGCAAGATCAGCTACGAAACCTTCAAGAAGCACATCCTGAATCTGGCCAAGGGCAAGGGCAGAATCAGCAAGACCAAGAAAGAGTATCTGCTGGAAGAACGGGACATCAACAGGTTCTCCGTGCAGAAAGACTTCATCAACCGGAACCTGGTGGATACCAGATACGCCACCAGAGGCCTGATGAACCTGCTGCGGAGCTACTTCAGAGTGAACAACCTGGACGTGAAAGTGAAGTCCATCAATGGCGGCTTCACCAGCTTTCTGCGGCGGAAGTGGAAGTTTAAGAAAGAGCGGAACAAGGGGTACAAGCACCACGCCGAGGACGCCCTGATCATTGCCAACGCCGATTTCATCTTCAAAGAGTGGAAGAAACTGGACAAGGCCAAAAAAGTGATGGAAAACCAGATGTTCGAGGAAAAGCAGGCCGAGAGCATGCCCGAGATCGAAACCGAGCAGGAGTACAAAGAGATCTTCATCACCCCCCACCAGATCAAGCACATTAAGGACTTCAAGGACTACAAGTACAGCCACCGGGTGGACAAGAAGCCTAATAGAAAGCTGATTAACGACACCCTGTACTCCACCCGGAAGGACGACAAGGGCAACACCCTGATCGTGAACAATCTGAACGGCCTGTACGACAAGGACAATGACAAGCTGAAAAAGCTGATCAACAAGAGCCCCGAAAAGCTGCTGATGTACCACCACGACCCCCAGACCTACCAGAAACTGAAGCTGATTATGGAACAGTACGGCGACGAGAAGAATCCCCTGTACAAGTACTACGAGGAAACCGGGAACTACCTGACCAAGTACTCCAAAAAGGACAACGGCCCCGTGATCAAGAAGATTAAGTATTACGGCAACAAACTGAACGCCCATCTGGACATCACCGACGACTACCCCAACAGCAGAAACAAGGTCGTGAAGCTGTCCCTGAAGCCCTACAGATTCGACGTGTACCTGGACAATGGCGTGTACAAGTTCGTGACCGTGAAGAATCTGGATGTGATCAAAAAAGAAAACTACTACGAAGTGAATAGCAAGTGCTATGAGGAAGCTAAGAAGCTGAAGAAGATCAGCAACCAGGCCGAGTTTATCGCCTCCTTCTACAAGAACGATCTGATCAAGATCAACGGCGAGCTGTATAGAGTGAGGGGCGACAACAACCCCAGGCAGAACATGATCGAGGTGTGAGACGGGCCATACTCGTCTCGAACATGATCGACATCACCTACCGCGAGTACCTGGAAAACATGAACGACAAGAGGCCCCCCCACATCATTAAGACAATCGCCTCCAAGACCCAGAGCATTAAGAAGTACAGCACAGACATTCTGGGCAACCTGTATGAAGTGAAATCTAAGAAGCACCCTCAGATCATCAAAAAGGGC(SEQ ID NO:110)。
ATGAAGCGGAACTACATCCTGGGCCTGGCCATCGGCATCACCAGCGTGGGCTACGGCATCATCGACTACGAGACACGGGACGTGATCGATGCCGGCGTGCGGCTGTTCAAAGAGGCCAACGTGGAAAACAACGAGGGCAGGCGGAGCAAGAGAGGCGCCAGAAGGCTGAAGCGGCGGAGGCGGCATAGAATCCAGAGAGTGAAGAAGCTGCTGTTCGACTACAACCTGCTGACCGACCACAGCGAGCTGAGCGGCATCAACCCCTACGAGGCCAGAGTGAAGGGCCTGAGCCAGAAGCTGAGCGAGGAAGAGTTCTCTGCCGCCCTGCTGCACCTGGCCAAGAGAAGAGGCGTGCACAACGTGAACGAGGTGGAAGAGGACACCGGCAACGAGCTGTCCACCAAAGAGCAGATCAGCCGGAACAGCAAGGCCCTGGAAGAGAAATACGTGGCCGAACTGCAGCTGGAACGGCTGAAGAAAGACGGCGAAGTGCGGGGCAGCATCAACAGATTCAAGACCAGCGACTACGTGAAAGAAGCCAAACAGCTGCTGAAGGTGCAGAAGGCCTACCACCAGCTGGACCAGAGCTTCATCGACACCTACATCGACCTGCTGGAAACCCGGCGGACCTACTATGAGGGACCTGGCGAGGGCAGCCCCTTCGGCTGGAAGGACATCAAAGAATGGTACGAGATGCTGATGGGCCACTGCACCTACTTCCCCGAGGAACTGCGGAGCGTGAAGTACGCCTACAACGCCGACCTGTACAACGCCCTGAACGACCTGAACAATCTCGTGATCACCAGGGACGAGAACGAGAAGCTGGAATATTACGAGAAGTTCCAGATCATCGAGAACGTGTTCAAGCAGAAGAAGAAGCCCACCCTGAAGCAGATCGCCAAAGAAATCCTCGTGAACGAAGAGGATATTAAGGGCTACAGAGTGACCAGCACCGGCAAGCCCGAGTTCACCAACCTGAAGGTGTACCACGACATCAAGGACATTACCGCCCGGAAAGAGATTATTGAGAACGCCGAGCTGCTGGATCAGATTGCCAAGATCCTGACCATCTACCAGAGCAGCGAGGACATCCAGGAAGAACTGACCAATCTGAACTCCGAGCTGACCCAGGAAGAGATCGAGCAGATCTCTAATCTGAAGGGCTATACCGGCACCCACAACCTGAGCCTGAAGGCCATCAACCTGATCCTGGACGAGCTGTGGCACACCAACGACAACCAGATCGCTATCTTCAACCGGCTGAAGCTGGTGCCCAAGAAGGTGGACCTGTCCCAGCAGAAAGAGATCCCCACCACCCTGGTGGACGACTTCATCCTGAGCCCCGTCGTGAAGAGAAGCTTCATCCAGAGCATCAAAGTGATCAACGCCATCATCAAGAAGTACGGCCTGCCCAACGACATCATTATCGAGCTGGCCCGCGAGAAGAACTCCAAGGACGCCCAGAAAATGATCAACGAGATGCAGAAGCGGAACCGGCAGACCAACGAGCGGATCGAGGAAATCATCCGGACCACCGGCAAAGAGAACGCCAAGTACCTGATCGAGAAGATCAAGCTGCACGACATGCAGGAAGGCAAGTGCCTGTACAGCCTGGAAGCCATCCCTCTGGAAGATCTGCTGAACAACCCCTTCAACTATGAGGTGGACCACATCATCCCCAGAAGCGTGTCCTTCGACAACAGCTTCAACAACAAGGTGCTCGTGAAGCAGGAAGAAGCCAGCAAGAAGGGCAACCGGACCCCATTCCAGTACCTGAGCAGCAGCGACAGCAAGATCAGCTACGAAACCTTCAAGAAGCACATCCTGAATCTGGCCAAGGGCAAGGGCAGAATCAGCAAGACCAAGAAAGAGTATCTGCTGGAAGAACGGGACATCAACAGGTTCTCCGTGCAGAAAGACTTCATCAACCGGAACCTGGTGGATACCAGATACGCCACCAGAGGCCTGATGAACCTGCTGCGGAGCTACTTCAGAGTGAACAACCTGGACGTGAAAGTGAAGTCCATCAATGGCGGCTTCACCAGCTTTCTGCGGCGGAAGTGGAAGTTTAAGAAAGAGCGGAACAAGGGGTACAAGCACCACGCCGAGGACGCCCTGATCATTGCCAACGCCGATTTCATCTTCAAAGAGTGGAAGAAACTGGACAAGGCCAAAAAAGTGATGGAAAACCAGATGTTCGAGGAAAAGCAGGCCGAGAGCATGCCCGAGATCGAAACCGAGCAGGAGTACAAAGAGATCTTCATCACCCCCCACCAGATCAAGCACATTAAGGACTTCAAGGACTACAAGTACAGCCACCGGGTGGACAAGAAGCCTAATAGAAAGCTGATTAACGACACCCTGTACTCCACCCGGAAGGACGACAAGGGCAACACCCTGATCGTGAACAATCTGAACGGCCTGTACGACAAGGACAATGACAAGCTGAAAAAGCTGATCAACAAGAGCCCCGAAAAGCTGCTGATGTACCACCACGACCCCCAGACCTACCAGAAACTGAAGCTGATTATGGAACAGTACGGCGACGAGAAGAATCCCCTGTACAAGTACTACGAGGAAACCGGGAACTACCTGACCAAGTACTCCAAAAAGGACAACGGCCCCGTGATCAAGAAGATTAAGTATTACGGCAACAAACTGAACGCCCATCTGGACATCACCGACGACTACCCCAACAGCAGAAACAAGGTCGTGAAGCTGTCCCTGAAGCCCTACAGATTCGACGTGTACCTGGACAATGGCGTGTACAAGTTCGTGACCGTGAAGAATCTGGATGTGATCAAAAAAGAAAACTACTACGAAGTGAATAGCAAGTGCTATGAGGAAGCTAAGAAGCTGAAGAAGATCAGCAACCAGGCCGAGTTTATCGCCTCCTTCTACAAGAACGATCTGATCAAGATCAACGGCGAGCTGTATAGAGTGATCGGCGTGAACAACGACAGCACCAACAGGGTGGAGCTGTGAGACGGGCCATACTCGTCTCGAACATGATCGACATCACCTACCGCGAGTACCTGGAAAACATGAACGACAAGAGGCCCCCCCACATCATTAAGACAATCGCCTCCAAGACCCAGAGCATTAAGAAGTACAGCACAGACATTCTGGGCAACCTGTATGAAGTGAAATCTAAGAAGCACCCTCAGATCATCAAAAAGGGC(SEQ ID NO:111)。
ATGAAGCGGAACTACATCCTGGGCCTGGCCATCGGCATCACCAGCGTGGGCTACGGCATCATCGACTACGAGACACGGGACGTGATCGATGCCGGCGTGCGGCTGTTCAAAGAGGCCAACGTGGAAAACAACGAGGGCAGGCGGAGCAAGAGAGGCGCCAGAAGGCTGAAGCGGCGGAGGCGGCATAGAATCCAGAGAGTGAAGAAGCTGCTGTTCGACTACAACCTGCTGACCGACCACAGCGAGCTGAGCGGCATCAACCCCTACGAGGCCAGAGTGAAGGGCCTGAGCCAGAAGCTGAGCGAGGAAGAGTTCTCTGCCGCCCTGCTGCACCTGGCCAAGAGAAGAGGCGTGCACAACGTGAACGAGGTGGAAGAGGACACCGGCAACGAGCTGTCCACCAAAGAGCAGATCAGCCGGAACAGCAAGGCCCTGGAAGAGAAATACGTGGCCGAACTGCAGCTGGAACGGCTGAAGAAAGACGGCGAAGTGCGGGGCAGCATCAACAGATTCAAGACCAGCGACTACGTGAAAGAAGCCAAACAGCTGCTGAAGGTGCAGAAGGCCTACCACCAGCTGGACCAGAGCTTCATCGACACCTACATCGACCTGCTGGAAACCCGGCGGACCTACTATGAGGGACCTGGCGAGGGCAGCCCCTTCGGCTGGAAGGACATCAAAGAATGGTACGAGATGCTGATGGGCCACTGCACCTACTTCCCCGAGGAACTGCGGAGCGTGAAGTACGCCTACAACGCCGACCTGTACAACGCCCTGAACGACCTGAACAATCTCGTGATCACCAGGGACGAGAACGAGAAGCTGGAATATTACGAGAAGTTCCAGATCATCGAGAACGTGTTCAAGCAGAAGAAGAAGCCCACCCTGAAGCAGATCGCCAAAGAAATCCTCGTGAACGAAGAGGATATTAAGGGCTACAGAGTGACCAGCACCGGCAAGCCCGAGTTCACCAACCTGAAGGTGTACCACGACATCAAGGACATTACCGCCCGGAAAGAGATTATTGAGAACGCCGAGCTGCTGGATCAGATTGCCAAGATCCTGACCATCTACCAGAGCAGCGAGGACATCCAGGAAGAACTGACCAATCTGAACTCCGAGCTGACCCAGGAAGAGATCGAGCAGATCTCTAATCTGAAGGGCTATACCGGCACCCACAACCTGAGCCTGAAGGCCATCAACCTGATCCTGGACGAGCTGTGGCACACCAACGACAACCAGATCGCTATCTTCAACCGGCTGAAGCTGGTGCCCAAGAAGGTGGACCTGTCCCAGCAGAAAGAGATCCCCACCACCCTGGTGGACGACTTCATCCTGAGCCCCGTCGTGAAGAGAAGCTTCATCCAGAGCATCAAAGTGATCAACGCCATCATCAAGAAGTACGGCCTGCCCAACGACATCATTATCGAGCTGGCCCGCGAGAAGAACTCCAAGGACGCCCAGAAAATGATCAACGAGATGCAGAAGCGGAACCGGCAGACCAACGAGCGGATCGAGGAAATCATCCGGACCACCGGCAAAGAGAACGCCAAGTACCTGATCGAGAAGATCAAGCTGCACGACATGCAGGAAGGCAAGTGCCTGTACAGCCTGGAAGCCATCCCTCTGGAAGATCTGCTGAACAACCCCTTCAACTATGAGGTGGACCACATCATCCCCAGAAGCGTGTCCTTCGACAACAGCTTCAACAACAAGGTGCTCGTGAAGCAGGAAGAAGCCAGCAAGAAGGGCAACCGGACCCCATTCCAGTACCTGAGCAGCAGCGACAGCAAGATCAGCTACGAAACCTTCAAGAAGCACATCCTGAATCTGGCCAAGGGCAAGGGCAGAATCAGCAAGACCAAGAAAGAGTATCTGCTGGAAGAACGGGACATCAACAGGTTCTCCGTGCAGAAAGACTTCATCAACCGGAACCTGGTGGATACCAGATACGCCACCAGAGGCCTGATGAACCTGCTGCGGAGCTACTTCAGAGTGAACAACCTGGACGTGAAAGTGAAGTCCATCAATGGCGGCTTCACCAGCTTTCTGCGGCGGAAGTGGAAGTTTAAGAAAGAGCGGAACAAGGGGTACAAGCACCACGCCGAGGACGCCCTGATCATTGCCAACGCCGATTTCATCTTCAAAGAGTGGAAGAAACTGGACAAGGCCAAAAAAGTGATGGAAAACCAGATGTTCGAGGAAAAGCAGGCCGAGAGCATGCCCGAGATCGAAACCGAGCAGGAGTACAAAGAGATCTTCATCACCCCCCACCAGATCAAGCACATTAAGGACTTCAAGGACTACAAGTACAGCCACCGGGTGGACAAGAAGCCTAATAGAAAGCTGATTAACGACACCCTGTACTCCACCCGGAAGGACGACAAGGGCAACACCCTGATCGTGAACAATCTGAACGGCCTGTACGACAAGGACAATGACAAGCTGAAAAAGCTGATCAACAAGAGCCCCGAAAAGCTGCTGATGTACCACCACGACCCCCAGACCTACCAGAAACTGAAGCTGATTATGGAACAGTACGGCGACGAGAAGAATCCCCTGTACAAGTACTACGAGGAAACCGGGAACTACCTGACCAAGTACTCCAAAAAGGACAACGGCCCCGTGATCAAGAAGATTAAGTATTACGGCAACAAACTGAACGCCCATCTGGACATCACCGACGACTACCCCAACAGCAGAAACAAGGTCGTGAAGCTGTCCCTGAAGCCCTACAGATTCGACGTGTACCTGGACAATGGCGTGTACAAGTTCGTGACCGTGAAGAATCTGGATGTGATCAAAAAAGAAAACTACTACGAAGTGAATAGCAAGTGCTATGAGGAAGCTAAGAAGCTGAAGAAGATCAGCAACCAGGCCGAGTTTATCGCCTCCTTCTACAAGAACGATCTGATCAAGATCAACGGCGAGCTGTATAGAGTGAGGGGCGACAACGACCCCAGGAGGAGCACCATCGAGCTGTGAGACGGGCCATACTCGTCTCGAACATGATCGACATCACCTACCGCGAGTACCTGGAAAACATGAACGACAAGAGGCCCCCCCACATCATTAAGACAATCGCCTCCAAGACCCAGAGCATTAAGAAGTACAGCACAGACATTCTGGGCAACCTGTATGAAGTGAAATCTAAGAAGCACCCTCAGATCATCAAAAAGGGC(SEQ ID NO:112)。
ATGAAGCGGAACTACATCCTGGGCCTGGCCATCGGCATCACCAGCGTGGGCTACGGCATCATCGACTACGAGACACGGGACGTGATCGATGCCGGCGTGCGGCTGTTCAAAGAGGCCAACGTGGAAAACAACGAGGGCAGGCGGAGCAAGAGAGGCGCCAGAAGGCTGAAGCGGCGGAGGCGGCATAGAATCCAGAGAGTGAAGAAGCTGCTGTTCGACTACAACCTGCTGACCGACCACAGCGAGCTGAGCGGCATCAACCCCTACGAGGCCAGAGTGAAGGGCCTGAGCCAGAAGCTGAGCGAGGAAGAGTTCTCTGCCGCCCTGCTGCACCTGGCCAAGAGAAGAGGCGTGCACAACGTGAACGAGGTGGAAGAGGACACCGGCAACGAGCTGTCCACCAAAGAGCAGATCAGCCGGAACAGCAAGGCCCTGGAAGAGAAATACGTGGCCGAACTGCAGCTGGAACGGCTGAAGAAAGACGGCGAAGTGCGGGGCAGCATCAACAGATTCAAGACCAGCGACTACGTGAAAGAAGCCAAACAGCTGCTGAAGGTGCAGAAGGCCTACCACCAGCTGGACCAGAGCTTCATCGACACCTACATCGACCTGCTGGAAACCCGGCGGACCTACTATGAGGGACCTGGCGAGGGCAGCCCCTTCGGCTGGAAGGACATCAAAGAATGGTACGAGATGCTGATGGGCCACTGCACCTACTTCCCCGAGGAACTGCGGAGCGTGAAGTACGCCTACAACGCCGACCTGTACAACGCCCTGAACGACCTGAACAATCTCGTGATCACCAGGGACGAGAACGAGAAGCTGGAATATTACGAGAAGTTCCAGATCATCGAGAACGTGTTCAAGCAGAAGAAGAAGCCCACCCTGAAGCAGATCGCCAAAGAAATCCTCGTGAACGAAGAGGATATTAAGGGCTACAGAGTGACCAGCACCGGCAAGCCCGAGTTCACCAACCTGAAGGTGTACCACGACATCAAGGACATTACCGCCCGGAAAGAGATTATTGAGAACGCCGAGCTGCTGGATCAGATTGCCAAGATCCTGACCATCTACCAGAGCAGCGAGGACATCCAGGAAGAACTGACCAATCTGAACTCCGAGCTGACCCAGGAAGAGATCGAGCAGATCTCTAATCTGAAGGGCTATACCGGCACCCACAACCTGAGCCTGAAGGCCATCAACCTGATCCTGGACGAGCTGTGGCACACCAACGACAACCAGATCGCTATCTTCAACCGGCTGAAGCTGGTGCCCAAGAAGGTGGACCTGTCCCAGCAGAAAGAGATCCCCACCACCCTGGTGGACGACTTCATCCTGAGCCCCGTCGTGAAGAGAAGCTTCATCCAGAGCATCAAAGTGATCAACGCCATCATCAAGAAGTACGGCCTGCCCAACGACATCATTATCGAGCTGGCCCGCGAGAAGAACTCCAAGGACGCCCAGAAAATGATCAACGAGATGCAGAAGCGGAACCGGCAGACCAACGAGCGGATCGAGGAAATCATCCGGACCACCGGCAAAGAGAACGCCAAGTACCTGATCGAGAAGATCAAGCTGCACGACATGCAGGAAGGCAAGTGCCTGTACAGCCTGGAAGCCATCCCTCTGGAAGATCTGCTGAACAACCCCTTCAACTATGAGGTGGACCACATCATCCCCAGAAGCGTGTCCTTCGACAACAGCTTCAACAACAAGGTGCTCGTGAAGCAGGAAGAAGCCAGCAAGAAGGGCAACCGGACCCCATTCCAGTACCTGAGCAGCAGCGACAGCAAGATCAGCTACGAAACCTTCAAGAAGCACATCCTGAATCTGGCCAAGGGCAAGGGCAGAATCAGCAAGACCAAGAAAGAGTATCTGCTGGAAGAACGGGACATCAACAGGTTCTCCGTGCAGAAAGACTTCATCAACCGGAACCTGGTGGATACCAGATACGCCACCAGAGGCCTGATGAACCTGCTGCGGAGCTACTTCAGAGTGAACAACCTGGACGTGAAAGTGAAGTCCATCAATGGCGGCTTCACCAGCTTTCTGCGGCGGAAGTGGAAGTTTAAGAAAGAGCGGAACAAGGGGTACAAGCACCACGCCGAGGACGCCCTGATCATTGCCAACGCCGATTTCATCTTCAAAGAGTGGAAGAAACTGGACAAGGCCAAAAAAGTGATGGAAAACCAGATGTTCGAGGAAAAGCAGGCCGAGAGCATGCCCGAGATCGAAACCGAGCAGGAGTACAAAGAGATCTTCATCACCCCCCACCAGATCAAGCACATTAAGGACTTCAAGGACTACAAGTACAGCCACCGGGTGGACAAGAAGCCTAATAGAAAGCTGATTAACGACACCCTGTACTCCACCCGGAAGGACGACAAGGGCAACACCCTGATCGTGAACAATCTGAACGGCCTGTACGACAAGGACAATGACAAGCTGAAAAAGCTGATCAACAAGAGCCCCGAAAAGCTGCTGATGTACCACCACGACCCCCAGACCTACCAGAAACTGAAGCTGATTATGGAACAGTACGGCGACGAGAAGAATCCCCTGTACAAGTACTACGAGGAAACCGGGAACTACCTGACCAAGTACTCCAAAAAGGACAACGGCCCCGTGATCAAGAAGATTAAGTATTACGGCAACAAACTGAACGCCCATCTGGACATCACCGACGACTACCCCAACAGCAGAAACAAGGTCGTGAAGCTGTCCCTGAAGCCCTACAGATTCGACGTGTACCTGGACAATGGCGTGTACAAGTTCGTGACCGTGAAGAATCTGGATGTGATCAAAAAAGAAAACTACTACGAAGTGAATAGCAAGTGCTATGAGGAAGCTAAGAAGCTGAAGAAGATCAGCAACCAGGCCGAGTTTATCGCCTCCTTCTACAAGAACGATCTGATCAAGATCAACGGCGAGCTGTATAGAGTGAGGGGCGACAACAACCCCAGGCAGAACAAGCTGGAGGTGTGAGACGGGCCATACTCGTCTCGAACATGATCGACATCACCTACCGCGAGTACCTGGAAAACATGAACGACAAGAGGCCCCCCCACATCATTAAGACAATCGCCTCCAAGACCCAGAGCATTAAGAAGTACAGCACAGACATTCTGGGCAACCTGTATGAAGTGAAATCTAAGAAGCACCCTCAGATCATCAAAAAGGGC(SEQ ID NO:113)。
ATGAAGCGGAACTACATCCTGGGCCTGGCCATCGGCATCACCAGCGTGGGCTACGGCATCATCGACTACGAGACACGGGACGTGATCGATGCCGGCGTGCGGCTGTTCAAAGAGGCCAACGTGGAAAACAACGAGGGCAGGCGGAGCAAGAGAGGCGCCAGAAGGCTGAAGCGGCGGAGGCGGCATAGAATCCAGAGAGTGAAGAAGCTGCTGTTCGACTACAACCTGCTGACCGACCACAGCGAGCTGAGCGGCATCAACCCCTACGAGGCCAGAGTGAAGGGCCTGAGCCAGAAGCTGAGCGAGGAAGAGTTCTCTGCCGCCCTGCTGCACCTGGCCAAGAGAAGAGGCGTGCACAACGTGAACGAGGTGGAAGAGGACACCGGCAACGAGCTGTCCACCAAAGAGCAGATCAGCCGGAACAGCAAGGCCCTGGAAGAGAAATACGTGGCCGAACTGCAGCTGGAACGGCTGAAGAAAGACGGCGAAGTGCGGGGCAGCATCAACAGATTCAAGACCAGCGACTACGTGAAAGAAGCCAAACAGCTGCTGAAGGTGCAGAAGGCCTACCACCAGCTGGACCAGAGCTTCATCGACACCTACATCGACCTGCTGGAAACCCGGCGGACCTACTATGAGGGACCTGGCGAGGGCAGCCCCTTCGGCTGGAAGGACATCAAAGAATGGTACGAGATGCTGATGGGCCACTGCACCTACTTCCCCGAGGAACTGCGGAGCGTGAAGTACGCCTACAACGCCGACCTGTACAACGCCCTGAACGACCTGAACAATCTCGTGATCACCAGGGACGAGAACGAGAAGCTGGAATATTACGAGAAGTTCCAGATCATCGAGAACGTGTTCAAGCAGAAGAAGAAGCCCACCCTGAAGCAGATCGCCAAAGAAATCCTCGTGAACGAAGAGGATATTAAGGGCTACAGAGTGACCAGCACCGGCAAGCCCGAGTTCACCAACCTGAAGGTGTACCACGACATCAAGGACATTACCGCCCGGAAAGAGATTATTGAGAACGCCGAGCTGCTGGATCAGATTGCCAAGATCCTGACCATCTACCAGAGCAGCGAGGACATCCAGGAAGAACTGACCAATCTGAACTCCGAGCTGACCCAGGAAGAGATCGAGCAGATCTCTAATCTGAAGGGCTATACCGGCACCCACAACCTGAGCCTGAAGGCCATCAACCTGATCCTGGACGAGCTGTGGCACACCAACGACAACCAGATCGCTATCTTCAACCGGCTGAAGCTGGTGCCCAAGAAGGTGGACCTGTCCCAGCAGAAAGAGATCCCCACCACCCTGGTGGACGACTTCATCCTGAGCCCCGTCGTGAAGAGAAGCTTCATCCAGAGCATCAAAGTGATCAACGCCATCATCAAGAAGTACGGCCTGCCCAACGACATCATTATCGAGCTGGCCCGCGAGAAGAACTCCAAGGACGCCCAGAAAATGATCAACGAGATGCAGAAGCGGAACCGGCAGACCAACGAGCGGATCGAGGAAATCATCCGGACCACCGGCAAAGAGAACGCCAAGTACCTGATCGAGAAGATCAAGCTGCACGACATGCAGGAAGGCAAGTGCCTGTACAGCCTGGAAGCCATCCCTCTGGAAGATCTGCTGAACAACCCCTTCAACTATGAGGTGGACCACATCATCCCCAGAAGCGTGTCCTTCGACAACAGCTTCAACAACAAGGTGCTCGTGAAGCAGGAAGAAGCCAGCAAGAAGGGCAACCGGACCCCATTCCAGTACCTGAGCAGCAGCGACAGCAAGATCAGCTACGAAACCTTCAAGAAGCACATCCTGAATCTGGCCAAGGGCAAGGGCAGAATCAGCAAGACCAAGAAAGAGTATCTGCTGGAAGAACGGGACATCAACAGGTTCTCCGTGCAGAAAGACTTCATCAACCGGAACCTGGTGGATACCAGATACGCCACCAGAGGCCTGATGAACCTGCTGCGGAGCTACTTCAGAGTGAACAACCTGGACGTGAAAGTGAAGTCCATCAATGGCGGCTTCACCAGCTTTCTGCGGCGGAAGTGGAAGTTTAAGAAAGAGCGGAACAAGGGGTACAAGCACCACGCCGAGGACGCCCTGATCATTGCCAACGCCGATTTCATCTTCAAAGAGTGGAAGAAACTGGACAAGGCCAAAAAAGTGATGGAAAACCAGATGTTCGAGGAAAAGCAGGCCGAGAGCATGCCCGAGATCGAAACCGAGCAGGAGTACAAAGAGATCTTCATCACCCCCCACCAGATCAAGCACATTAAGGACTTCAAGGACTACAAGTACAGCCACCGGGTGGACAAGAAGCCTAATAGAAAGCTGATTAACGACACCCTGTACTCCACCCGGAAGGACGACAAGGGCAACACCCTGATCGTGAACAATCTGAACGGCCTGTACGACAAGGACAATGACAAGCTGAAAAAGCTGATCAACAAGAGCCCCGAAAAGCTGCTGATGTACCACCACGACCCCCAGACCTACCAGAAACTGAAGCTGATTATGGAACAGTACGGCGACGAGAAGAATCCCCTGTACAAGTACTACGAGGAAACCGGGAACTACCTGACCAAGTACTCCAAAAAGGACAACGGCCCCGTGATCAAGAAGATTAAGTATTACGGCAACAAACTGAACGCCCATCTGGACATCACCGACGACTACCCCAACAGCAGAAACAAGGTCGTGAAGCTGTCCCTGAAGCCCTACAGATTCGACGTGTACCTGGACAATGGCGTGTACAAGTTCGTGACCGTGAAGAATCTGGATGTGATCAAAAAAGAAAACTACTACGAAGTGAATAGCAAGTGCTATGAGGAAGCTAAGAAGCTGAAGAAGATCAGCAACCAGGCCGAGTTTATCGCCTCCTTCTACAAGAACGATCTGATCAAGATCAACGGCGAGCTGTATAGAGTGACCGCCACCAACAACGACAAGAAGAACATGATCGAGGTGTGAGACGGGCCATACTCGTCTCGAACATGATCGACATCACCTACCGCGAGTACCTGGAAAACATGAACGACAAGAGGCCCCCCCACATCATTAAGACAATCGCCTCCAAGACCCAGAGCATTAAGAAGTACAGCACAGACATTCTGGGCAACCTGTATGAAGTGAAATCTAAGAAGCACCCTCAGATCATCAAAAAGGGC(SEQ ID NO:114)。
ATGAAGCGGAACTACATCCTGGGCCTGGCCATCGGCATCACCAGCGTGGGCTACGGCATCATCGACTACGAGACACGGGACGTGATCGATGCCGGCGTGCGGCTGTTCAAAGAGGCCAACGTGGAAAACAACGAGGGCAGGCGGAGCAAGAGAGGCGCCAGAAGGCTGAAGCGGCGGAGGCGGCATAGAATCCAGAGAGTGAAGAAGCTGCTGTTCGACTACAACCTGCTGACCGACCACAGCGAGCTGAGCGGCATCAACCCCTACGAGGCCAGAGTGAAGGGCCTGAGCCAGAAGCTGAGCGAGGAAGAGTTCTCTGCCGCCCTGCTGCACCTGGCCAAGAGAAGAGGCGTGCACAACGTGAACGAGGTGGAAGAGGACACCGGCAACGAGCTGTCCACCAAAGAGCAGATCAGCCGGAACAGCAAGGCCCTGGAAGAGAAATACGTGGCCGAACTGCAGCTGGAACGGCTGAAGAAAGACGGCGAAGTGCGGGGCAGCATCAACAGATTCAAGACCAGCGACTACGTGAAAGAAGCCAAACAGCTGCTGAAGGTGCAGAAGGCCTACCACCAGCTGGACCAGAGCTTCATCGACACCTACATCGACCTGCTGGAAACCCGGCGGACCTACTATGAGGGACCTGGCGAGGGCAGCCCCTTCGGCTGGAAGGACATCAAAGAATGGTACGAGATGCTGATGGGCCACTGCACCTACTTCCCCGAGGAACTGCGGAGCGTGAAGTACGCCTACAACGCCGACCTGTACAACGCCCTGAACGACCTGAACAATCTCGTGATCACCAGGGACGAGAACGAGAAGCTGGAATATTACGAGAAGTTCCAGATCATCGAGAACGTGTTCAAGCAGAAGAAGAAGCCCACCCTGAAGCAGATCGCCAAAGAAATCCTCGTGAACGAAGAGGATATTAAGGGCTACAGAGTGACCAGCACCGGCAAGCCCGAGTTCACCAACCTGAAGGTGTACCACGACATCAAGGACATTACCGCCCGGAAAGAGATTATTGAGAACGCCGAGCTGCTGGATCAGATTGCCAAGATCCTGACCATCTACCAGAGCAGCGAGGACATCCAGGAAGAACTGACCAATCTGAACTCCGAGCTGACCCAGGAAGAGATCGAGCAGATCTCTAATCTGAAGGGCTATACCGGCACCCACAACCTGAGCCTGAAGGCCATCAACCTGATCCTGGACGAGCTGTGGCACACCAACGACAACCAGATCGCTATCTTCAACCGGCTGAAGCTGGTGCCCAAGAAGGTGGACCTGTCCCAGCAGAAAGAGATCCCCACCACCCTGGTGGACGACTTCATCCTGAGCCCCGTCGTGAAGAGAAGCTTCATCCAGAGCATCAAAGTGATCAACGCCATCATCAAGAAGTACGGCCTGCCCAACGACATCATTATCGAGCTGGCCCGCGAGAAGAACTCCAAGGACGCCCAGAAAATGATCAACGAGATGCAGAAGCGGAACCGGCAGACCAACGAGCGGATCGAGGAAATCATCCGGACCACCGGCAAAGAGAACGCCAAGTACCTGATCGAGAAGATCAAGCTGCACGACATGCAGGAAGGCAAGTGCCTGTACAGCCTGGAAGCCATCCCTCTGGAAGATCTGCTGAACAACCCCTTCAACTATGAGGTGGACCACATCATCCCCAGAAGCGTGTCCTTCGACAACAGCTTCAACAACAAGGTGCTCGTGAAGCAGGAAGAAGCCAGCAAGAAGGGCAACCGGACCCCATTCCAGTACCTGAGCAGCAGCGACAGCAAGATCAGCTACGAAACCTTCAAGAAGCACATCCTGAATCTGGCCAAGGGCAAGGGCAGAATCAGCAAGACCAAGAAAGAGTATCTGCTGGAAGAACGGGACATCAACAGGTTCTCCGTGCAGAAAGACTTCATCAACCGGAACCTGGTGGATACCAGATACGCCACCAGAGGCCTGATGAACCTGCTGCGGAGCTACTTCAGAGTGAACAACCTGGACGTGAAAGTGAAGTCCATCAATGGCGGCTTCACCAGCTTTCTGCGGCGGAAGTGGAAGTTTAAGAAAGAGCGGAACAAGGGGTACAAGCACCACGCCGAGGACGCCCTGATCATTGCCAACGCCGATTTCATCTTCAAAGAGTGGAAGAAACTGGACAAGGCCAAAAAAGTGATGGAAAACCAGATGTTCGAGGAAAAGCAGGCCGAGAGCATGCCCGAGATCGAAACCGAGCAGGAGTACAAAGAGATCTTCATCACCCCCCACCAGATCAAGCACATTAAGGACTTCAAGGACTACAAGTACAGCCACCGGGTGGACAAGAAGCCTAATAGAAAGCTGATTAACGACACCCTGTACTCCACCCGGAAGGACGACAAGGGCAACACCCTGATCGTGAACAATCTGAACGGCCTGTACGACAAGGACAATGACAAGCTGAAAAAGCTGATCAACAAGAGCCCCGAAAAGCTGCTGATGTACCACCACGACCCCCAGACCTACCAGAAACTGAAGCTGATTATGGAACAGTACGGCGACGAGAAGAATCCCCTGTACAAGTACTACGAGGAAACCGGGAACTACCTGACCAAGTACTCCAAAAAGGACAACGGCCCCGTGATCAAGAAGATTAAGTATTACGGCAACAAACTGAACGCCCATCTGGACATCACCGACGACTACCCCAACAGCAGAAACAAGGTCGTGAAGCTGTCCCTGAAGCCCTACAGATTCGACGTGTACCTGGACAATGGCGTGTACAAGTTCGTGACCGTGAAGAATCTGGATGTGATCAAAAAAGAAAACTACTACGAAGTGAATAGCAAGTGCTATGAGGAAGCTAAGAAGCTGAAGAAGATCAGCAACCAGGCCGAGTTTATCGCCTCCTTCTACAAGAACGATCTGATCAAGATCAACGGCGAGCTGTATAGAGTGATCGGCGTGAACAACAACAGGCTGAACAAGATCGAGCTGTGAGACGGGCCATACTCGTCTCGAACATGATCGACATCACCTACCGCGAGTACCTGGAAAACATGAACGACAAGAGGCCCCCCCACATCATTAAGACAATCGCCTCCAAGACCCAGAGCATTAAGAAGTACAGCACAGACATTCTGGGCAACCTGTATGAAGTGAAATCTAAGAAGCACCCTCAGATCATCAAAAAGGGC(SEQ ID NO:115)。
ATGAAGCGGAACTACATCCTGGGCCTGGCCATCGGCATCACCAGCGTGGGCTACGGCATCATCGACTACGAGACACGGGACGTGATCGATGCCGGCGTGCGGCTGTTCAAAGAGGCCAACGTGGAAAACAACGAGGGCAGGCGGAGCAAGAGAGGCGCCAGAAGGCTGAAGCGGCGGAGGCGGCATAGAATCCAGAGAGTGAAGAAGCTGCTGTTCGACTACAACCTGCTGACCGACCACAGCGAGCTGAGCGGCATCAACCCCTACGAGGCCAGAGTGAAGGGCCTGAGCCAGAAGCTGAGCGAGGAAGAGTTCTCTGCCGCCCTGCTGCACCTGGCCAAGAGAAGAGGCGTGCACAACGTGAACGAGGTGGAAGAGGACACCGGCAACGAGCTGTCCACCAAAGAGCAGATCAGCCGGAACAGCAAGGCCCTGGAAGAGAAATACGTGGCCGAACTGCAGCTGGAACGGCTGAAGAAAGACGGCGAAGTGCGGGGCAGCATCAACAGATTCAAGACCAGCGACTACGTGAAAGAAGCCAAACAGCTGCTGAAGGTGCAGAAGGCCTACCACCAGCTGGACCAGAGCTTCATCGACACCTACATCGACCTGCTGGAAACCCGGCGGACCTACTATGAGGGACCTGGCGAGGGCAGCCCCTTCGGCTGGAAGGACATCAAAGAATGGTACGAGATGCTGATGGGCCACTGCACCTACTTCCCCGAGGAACTGCGGAGCGTGAAGTACGCCTACAACGCCGACCTGTACAACGCCCTGAACGACCTGAACAATCTCGTGATCACCAGGGACGAGAACGAGAAGCTGGAATATTACGAGAAGTTCCAGATCATCGAGAACGTGTTCAAGCAGAAGAAGAAGCCCACCCTGAAGCAGATCGCCAAAGAAATCCTCGTGAACGAAGAGGATATTAAGGGCTACAGAGTGACCAGCACCGGCAAGCCCGAGTTCACCAACCTGAAGGTGTACCACGACATCAAGGACATTACCGCCCGGAAAGAGATTATTGAGAACGCCGAGCTGCTGGATCAGATTGCCAAGATCCTGACCATCTACCAGAGCAGCGAGGACATCCAGGAAGAACTGACCAATCTGAACTCCGAGCTGACCCAGGAAGAGATCGAGCAGATCTCTAATCTGAAGGGCTATACCGGCACCCACAACCTGAGCCTGAAGGCCATCAACCTGATCCTGGACGAGCTGTGGCACACCAACGACAACCAGATCGCTATCTTCAACCGGCTGAAGCTGGTGCCCAAGAAGGTGGACCTGTCCCAGCAGAAAGAGATCCCCACCACCCTGGTGGACGACTTCATCCTGAGCCCCGTCGTGAAGAGAAGCTTCATCCAGAGCATCAAAGTGATCAACGCCATCATCAAGAAGTACGGCCTGCCCAACGACATCATTATCGAGCTGGCCCGCGAGAAGAACTCCAAGGACGCCCAGAAAATGATCAACGAGATGCAGAAGCGGAACCGGCAGACCAACGAGCGGATCGAGGAAATCATCCGGACCACCGGCAAAGAGAACGCCAAGTACCTGATCGAGAAGATCAAGCTGCACGACATGCAGGAAGGCAAGTGCCTGTACAGCCTGGAAGCCATCCCTCTGGAAGATCTGCTGAACAACCCCTTCAACTATGAGGTGGACCACATCATCCCCAGAAGCGTGTCCTTCGACAACAGCTTCAACAACAAGGTGCTCGTGAAGCAGGAAGAAGCCAGCAAGAAGGGCAACCGGACCCCATTCCAGTACCTGAGCAGCAGCGACAGCAAGATCAGCTACGAAACCTTCAAGAAGCACATCCTGAATCTGGCCAAGGGCAAGGGCAGAATCAGCAAGACCAAGAAAGAGTATCTGCTGGAAGAACGGGACATCAACAGGTTCTCCGTGCAGAAAGACTTCATCAACCGGAACCTGGTGGATACCAGATACGCCACCAGAGGCCTGATGAACCTGCTGCGGAGCTACTTCAGAGTGAACAACCTGGACGTGAAAGTGAAGTCCATCAATGGCGGCTTCACCAGCTTTCTGCGGCGGAAGTGGAAGTTTAAGAAAGAGCGGAACAAGGGGTACAAGCACCACGCCGAGGACGCCCTGATCATTGCCAACGCCGATTTCATCTTCAAAGAGTGGAAGAAACTGGACAAGGCCAAAAAAGTGATGGAAAACCAGATGTTCGAGGAAAAGCAGGCCGAGAGCATGCCCGAGATCGAAACCGAGCAGGAGTACAAAGAGATCTTCATCACCCCCCACCAGATCAAGCACATTAAGGACTTCAAGGACTACAAGTACAGCCACCGGGTGGACAAGAAGCCTAATAGAAAGCTGATTAACGACACCCTGTACTCCACCCGGAAGGACGACAAGGGCAACACCCTGATCGTGAACAATCTGAACGGCCTGTACGACAAGGACAATGACAAGCTGAAAAAGCTGATCAACAAGAGCCCCGAAAAGCTGCTGATGTACCACCACGACCCCCAGACCTACCAGAAACTGAAGCTGATTATGGAACAGTACGGCGACGAGAAGAATCCCCTGTACAAGTACTACGAGGAAACCGGGAACTACCTGACCAAGTACTCCAAAAAGGACAACGGCCCCGTGATCAAGAAGATTAAGTATTACGGCAACAAACTGAACGCCCATCTGGACATCACCGACGACTACCCCAACAGCAGAAACAAGGTCGTGAAGCTGTCCCTGAAGCCCTACAGATTCGACGTGTACCTGGACAATGGCGTGTACAAGTTCGTGACCGTGAAGAATCTGGATGTGATCAAAAAAGAAAACTACTACGAAGTGAATAGCAAGTGCTATGAGGAAGCTAAGAAGCTGAAGAAGATCAGCAACCAGGCCGAGTTTATCGCCTCCTTCTACAAGAACGATCTGATCAAGATCAACGGCGAGCTGTATAGAGTGATCGGCGTGTTCAGCGACGCCGGCAACCTGCTGGAGGTGTGAGACGGGCCATACTCGTCTCGAACATGATCGACATCACCTACCGCGAGTACCTGGAAAACATGAACGACAAGAGGCCCCCCCACATCATTAAGACAATCGCCTCCAAGACCCAGAGCATTAAGAAGTACAGCACAGACATTCTGGGCAACCTGTATGAAGTGAAATCTAAGAAGCACCCTCAGATCATCAAAAAGGGC(SEQ ID NO:116)。
ATGAAGCGGAACTACATCCTGGGCCTGGCCATCGGCATCACCAGCGTGGGCTACGGCATCATCGACTACGAGACACGGGACGTGATCGATGCCGGCGTGCGGCTGTTCAAAGAGGCCAACGTGGAAAACAACGAGGGCAGGCGGAGCAAGAGAGGCGCCAGAAGGCTGAAGCGGCGGAGGCGGCATAGAATCCAGAGAGTGAAGAAGCTGCTGTTCGACTACAACCTGCTGACCGACCACAGCGAGCTGAGCGGCATCAACCCCTACGAGGCCAGAGTGAAGGGCCTGAGCCAGAAGCTGAGCGAGGAAGAGTTCTCTGCCGCCCTGCTGCACCTGGCCAAGAGAAGAGGCGTGCACAACGTGAACGAGGTGGAAGAGGACACCGGCAACGAGCTGTCCACCAAAGAGCAGATCAGCCGGAACAGCAAGGCCCTGGAAGAGAAATACGTGGCCGAACTGCAGCTGGAACGGCTGAAGAAAGACGGCGAAGTGCGGGGCAGCATCAACAGATTCAAGACCAGCGACTACGTGAAAGAAGCCAAACAGCTGCTGAAGGTGCAGAAGGCCTACCACCAGCTGGACCAGAGCTTCATCGACACCTACATCGACCTGCTGGAAACCCGGCGGACCTACTATGAGGGACCTGGCGAGGGCAGCCCCTTCGGCTGGAAGGACATCAAAGAATGGTACGAGATGCTGATGGGCCACTGCACCTACTTCCCCGAGGAACTGCGGAGCGTGAAGTACGCCTACAACGCCGACCTGTACAACGCCCTGAACGACCTGAACAATCTCGTGATCACCAGGGACGAGAACGAGAAGCTGGAATATTACGAGAAGTTCCAGATCATCGAGAACGTGTTCAAGCAGAAGAAGAAGCCCACCCTGAAGCAGATCGCCAAAGAAATCCTCGTGAACGAAGAGGATATTAAGGGCTACAGAGTGACCAGCACCGGCAAGCCCGAGTTCACCAACCTGAAGGTGTACCACGACATCAAGGACATTACCGCCCGGAAAGAGATTATTGAGAACGCCGAGCTGCTGGATCAGATTGCCAAGATCCTGACCATCTACCAGAGCAGCGAGGACATCCAGGAAGAACTGACCAATCTGAACTCCGAGCTGACCCAGGAAGAGATCGAGCAGATCTCTAATCTGAAGGGCTATACCGGCACCCACAACCTGAGCCTGAAGGCCATCAACCTGATCCTGGACGAGCTGTGGCACACCAACGACAACCAGATCGCTATCTTCAACCGGCTGAAGCTGGTGCCCAAGAAGGTGGACCTGTCCCAGCAGAAAGAGATCCCCACCACCCTGGTGGACGACTTCATCCTGAGCCCCGTCGTGAAGAGAAGCTTCATCCAGAGCATCAAAGTGATCAACGCCATCATCAAGAAGTACGGCCTGCCCAACGACATCATTATCGAGCTGGCCCGCGAGAAGAACTCCAAGGACGCCCAGAAAATGATCAACGAGATGCAGAAGCGGAACCGGCAGACCAACGAGCGGATCGAGGAAATCATCCGGACCACCGGCAAAGAGAACGCCAAGTACCTGATCGAGAAGATCAAGCTGCACGACATGCAGGAAGGCAAGTGCCTGTACAGCCTGGAAGCCATCCCTCTGGAAGATCTGCTGAACAACCCCTTCAACTATGAGGTGGACCACATCATCCCCAGAAGCGTGTCCTTCGACAACAGCTTCAACAACAAGGTGCTCGTGAAGCAGGAAGAAGCCAGCAAGAAGGGCAACCGGACCCCATTCCAGTACCTGAGCAGCAGCGACAGCAAGATCAGCTACGAAACCTTCAAGAAGCACATCCTGAATCTGGCCAAGGGCAAGGGCAGAATCAGCAAGACCAAGAAAGAGTATCTGCTGGAAGAACGGGACATCAACAGGTTCTCCGTGCAGAAAGACTTCATCAACCGGAACCTGGTGGATACCAGATACGCCACCAGAGGCCTGATGAACCTGCTGCGGAGCTACTTCAGAGTGAACAACCTGGACGTGAAAGTGAAGTCCATCAATGGCGGCTTCACCAGCTTTCTGCGGCGGAAGTGGAAGTTTAAGAAAGAGCGGAACAAGGGGTACAAGCACCACGCCGAGGACGCCCTGATCATTGCCAACGCCGATTTCATCTTCAAAGAGTGGAAGAAACTGGACAAGGCCAAAAAAGTGATGGAAAACCAGATGTTCGAGGAAAAGCAGGCCGAGAGCATGCCCGAGATCGAAACCGAGCAGGAGTACAAAGAGATCTTCATCACCCCCCACCAGATCAAGCACATTAAGGACTTCAAGGACTACAAGTACAGCCACCGGGTGGACAAGAAGCCTAATAGAAAGCTGATTAACGACACCCTGTACTCCACCCGGAAGGACGACAAGGGCAACACCCTGATCGTGAACAATCTGAACGGCCTGTACGACAAGGACAATGACAAGCTGAAAAAGCTGATCAACAAGAGCCCCGAAAAGCTGCTGATGTACCACCACGACCCCCAGACCTACCAGAAACTGAAGCTGATTATGGAACAGTACGGCGACGAGAAGAATCCCCTGTACAAGTACTACGAGGAAACCGGGAACTACCTGACCAAGTACTCCAAAAAGGACAACGGCCCCGTGATCAAGAAGATTAAGTATTACGGCAACAAACTGAACGCCCATCTGGACATCACCGACGACTACCCCAACAGCAGAAACAAGGTCGTGAAGCTGTCCCTGAAGCCCTACAGATTCGACGTGTACCTGGACAATGGCGTGTACAAGTTCGTGACCGTGAAGAATCTGGATGTGATCAAAAAAGAAAACTACTACGAAGTGAATAGCAAGTGCTATGAGGAAGCTAAGAAGCTGAAGAAGATCAGCAACCAGGCCGAGTTTATCGCCTCCTTCTACAAGAACGATCTGATCAAGATCAACGGCGAGCTGTATAGAGTGATCGGCGACAACAACCCCAGGAACAACGTGATCGAGGTGTGAGACGGGCCATACTCGTCTCGAACATGATCGACATCACCTACCGCGAGTACCTGGAAAACATGAACGACAAGAGGCCCCCCCACATCATTAAGACAATCGCCTCCAAGACCCAGAGCATTAAGAAGTACAGCACAGACATTCTGGGCAACCTGTATGAAGTGAAATCTAAGAAGCACCCTCAGATCATCAAAAAGGGC(SEQ ID NO:117)。
ATGAAGCGGAACTACATCCTGGGCCTGGCCATCGGCATCACCAGCGTGGGCTACGGCATCATCGACTACGAGACACGGGACGTGATCGATGCCGGCGTGCGGCTGTTCAAAGAGGCCAACGTGGAAAACAACGAGGGCAGGCGGAGCAAGAGAGGCGCCAGAAGGCTGAAGCGGCGGAGGCGGCATAGAATCCAGAGAGTGAAGAAGCTGCTGTTCGACTACAACCTGCTGACCGACCACAGCGAGCTGAGCGGCATCAACCCCTACGAGGCCAGAGTGAAGGGCCTGAGCCAGAAGCTGAGCGAGGAAGAGTTCTCTGCCGCCCTGCTGCACCTGGCCAAGAGAAGAGGCGTGCACAACGTGAACGAGGTGGAAGAGGACACCGGCAACGAGCTGTCCACCAAAGAGCAGATCAGCCGGAACAGCAAGGCCCTGGAAGAGAAATACGTGGCCGAACTGCAGCTGGAACGGCTGAAGAAAGACGGCGAAGTGCGGGGCAGCATCAACAGATTCAAGACCAGCGACTACGTGAAAGAAGCCAAACAGCTGCTGAAGGTGCAGAAGGCCTACCACCAGCTGGACCAGAGCTTCATCGACACCTACATCGACCTGCTGGAAACCCGGCGGACCTACTATGAGGGACCTGGCGAGGGCAGCCCCTTCGGCTGGAAGGACATCAAAGAATGGTACGAGATGCTGATGGGCCACTGCACCTACTTCCCCGAGGAACTGCGGAGCGTGAAGTACGCCTACAACGCCGACCTGTACAACGCCCTGAACGACCTGAACAATCTCGTGATCACCAGGGACGAGAACGAGAAGCTGGAATATTACGAGAAGTTCCAGATCATCGAGAACGTGTTCAAGCAGAAGAAGAAGCCCACCCTGAAGCAGATCGCCAAAGAAATCCTCGTGAACGAAGAGGATATTAAGGGCTACAGAGTGACCAGCACCGGCAAGCCCGAGTTCACCAACCTGAAGGTGTACCACGACATCAAGGACATTACCGCCCGGAAAGAGATTATTGAGAACGCCGAGCTGCTGGATCAGATTGCCAAGATCCTGACCATCTACCAGAGCAGCGAGGACATCCAGGAAGAACTGACCAATCTGAACTCCGAGCTGACCCAGGAAGAGATCGAGCAGATCTCTAATCTGAAGGGCTATACCGGCACCCACAACCTGAGCCTGAAGGCCATCAACCTGATCCTGGACGAGCTGTGGCACACCAACGACAACCAGATCGCTATCTTCAACCGGCTGAAGCTGGTGCCCAAGAAGGTGGACCTGTCCCAGCAGAAAGAGATCCCCACCACCCTGGTGGACGACTTCATCCTGAGCCCCGTCGTGAAGAGAAGCTTCATCCAGAGCATCAAAGTGATCAACGCCATCATCAAGAAGTACGGCCTGCCCAACGACATCATTATCGAGCTGGCCCGCGAGAAGAACTCCAAGGACGCCCAGAAAATGATCAACGAGATGCAGAAGCGGAACCGGCAGACCAACGAGCGGATCGAGGAAATCATCCGGACCACCGGCAAAGAGAACGCCAAGTACCTGATCGAGAAGATCAAGCTGCACGACATGCAGGAAGGCAAGTGCCTGTACAGCCTGGAAGCCATCCCTCTGGAAGATCTGCTGAACAACCCCTTCAACTATGAGGTGGACCACATCATCCCCAGAAGCGTGTCCTTCGACAACAGCTTCAACAACAAGGTGCTCGTGAAGCAGGAAGAAGCCAGCAAGAAGGGCAACCGGACCCCATTCCAGTACCTGAGCAGCAGCGACAGCAAGATCAGCTACGAAACCTTCAAGAAGCACATCCTGAATCTGGCCAAGGGCAAGGGCAGAATCAGCAAGACCAAGAAAGAGTATCTGCTGGAAGAACGGGACATCAACAGGTTCTCCGTGCAGAAAGACTTCATCAACCGGAACCTGGTGGATACCAGATACGCCACCAGAGGCCTGATGAACCTGCTGCGGAGCTACTTCAGAGTGAACAACCTGGACGTGAAAGTGAAGTCCATCAATGGCGGCTTCACCAGCTTTCTGCGGCGGAAGTGGAAGTTTAAGAAAGAGCGGAACAAGGGGTACAAGCACCACGCCGAGGACGCCCTGATCATTGCCAACGCCGATTTCATCTTCAAAGAGTGGAAGAAACTGGACAAGGCCAAAAAAGTGATGGAAAACCAGATGTTCGAGGAAAAGCAGGCCGAGAGCATGCCCGAGATCGAAACCGAGCAGGAGTACAAAGAGATCTTCATCACCCCCCACCAGATCAAGCACATTAAGGACTTCAAGGACTACAAGTACAGCCACCGGGTGGACAAGAAGCCTAATAGAAAGCTGATTAACGACACCCTGTACTCCACCCGGAAGGACGACAAGGGCAACACCCTGATCGTGAACAATCTGAACGGCCTGTACGACAAGGACAATGACAAGCTGAAAAAGCTGATCAACAAGAGCCCCGAAAAGCTGCTGATGTACCACCACGACCCCCAGACCTACCAGAAACTGAAGCTGATTATGGAACAGTACGGCGACGAGAAGAATCCCCTGTACAAGTACTACGAGGAAACCGGGAACTACCTGACCAAGTACTCCAAAAAGGACAACGGCCCCGTGATCAAGAAGATTAAGTATTACGGCAACAAACTGAACGCCCATCTGGACATCACCGACGACTACCCCAACAGCAGAAACAAGGTCGTGAAGCTGTCCCTGAAGCCCTACAGATTCGACGTGTACCTGGACAATGGCGTGTACAAGTTCGTGACCGTGAAGAATCTGGATGTGATCAAAAAAGAAAACTACTACGAAGTGAATAGCAAGTGCTATGAGGAAGCTAAGAAGCTGAAGAAGATCAGCAACCAGGCCGAGTTTATCGCCTCCTTCTACAAGAACGATCTGATCAAGATCAACGGCGAGCTGTATAGAGTGATCGGCGTGAACAACAACCTGCTGAACAAGATCGAAGTGTGAGACGGGCCATACTCGTCTCGAACATGATCGACATCACCTACCGCGAGTACCTGGAAAACATGAACGACAAGAGGCCCCCCCACATCATTAAGACAATCGCCTCCAAGACCCAGAGCATTAAGAAGTACAGCACAGACATTCTGGGCAACCTGTATGAAGTGAAATCTAAGAAGCACCCTCAGATCATCAAAAAGGGC(SEQ ID NO:118)。
ATGAAGCGGAACTACATCCTGGGCCTGGCCATCGGCATCACCAGCGTGGGCTACGGCATCATCGACTACGAGACACGGGACGTGATCGATGCCGGCGTGCGGCTGTTCAAAGAGGCCAACGTGGAAAACAACGAGGGCAGGCGGAGCAAGAGAGGCGCCAGAAGGCTGAAGCGGCGGAGGCGGCATAGAATCCAGAGAGTGAAGAAGCTGCTGTTCGACTACAACCTGCTGACCGACCACAGCGAGCTGAGCGGCATCAACCCCTACGAGGCCAGAGTGAAGGGCCTGAGCCAGAAGCTGAGCGAGGAAGAGTTCTCTGCCGCCCTGCTGCACCTGGCCAAGAGAAGAGGCGTGCACAACGTGAACGAGGTGGAAGAGGACACCGGCAACGAGCTGTCCACCAAAGAGCAGATCAGCCGGAACAGCAAGGCCCTGGAAGAGAAATACGTGGCCGAACTGCAGCTGGAACGGCTGAAGAAAGACGGCGAAGTGCGGGGCAGCATCAACAGATTCAAGACCAGCGACTACGTGAAAGAAGCCAAACAGCTGCTGAAGGTGCAGAAGGCCTACCACCAGCTGGACCAGAGCTTCATCGACACCTACATCGACCTGCTGGAAACCCGGCGGACCTACTATGAGGGACCTGGCGAGGGCAGCCCCTTCGGCTGGAAGGACATCAAAGAATGGTACGAGATGCTGATGGGCCACTGCACCTACTTCCCCGAGGAACTGCGGAGCGTGAAGTACGCCTACAACGCCGACCTGTACAACGCCCTGAACGACCTGAACAATCTCGTGATCACCAGGGACGAGAACGAGAAGCTGGAATATTACGAGAAGTTCCAGATCATCGAGAACGTGTTCAAGCAGAAGAAGAAGCCCACCCTGAAGCAGATCGCCAAAGAAATCCTCGTGAACGAAGAGGATATTAAGGGCTACAGAGTGACCAGCACCGGCAAGCCCGAGTTCACCAACCTGAAGGTGTACCACGACATCAAGGACATTACCGCCCGGAAAGAGATTATTGAGAACGCCGAGCTGCTGGATCAGATTGCCAAGATCCTGACCATCTACCAGAGCAGCGAGGACATCCAGGAAGAACTGACCAATCTGAACTCCGAGCTGACCCAGGAAGAGATCGAGCAGATCTCTAATCTGAAGGGCTATACCGGCACCCACAACCTGAGCCTGAAGGCCATCAACCTGATCCTGGACGAGCTGTGGCACACCAACGACAACCAGATCGCTATCTTCAACCGGCTGAAGCTGGTGCCCAAGAAGGTGGACCTGTCCCAGCAGAAAGAGATCCCCACCACCCTGGTGGACGACTTCATCCTGAGCCCCGTCGTGAAGAGAAGCTTCATCCAGAGCATCAAAGTGATCAACGCCATCATCAAGAAGTACGGCCTGCCCAACGACATCATTATCGAGCTGGCCCGCGAGAAGAACTCCAAGGACGCCCAGAAAATGATCAACGAGATGCAGAAGCGGAACCGGCAGACCAACGAGCGGATCGAGGAAATCATCCGGACCACCGGCAAAGAGAACGCCAAGTACCTGATCGAGAAGATCAAGCTGCACGACATGCAGGAAGGCAAGTGCCTGTACAGCCTGGAAGCCATCCCTCTGGAAGATCTGCTGAACAACCCCTTCAACTATGAGGTGGACCACATCATCCCCAGAAGCGTGTCCTTCGACAACAGCTTCAACAACAAGGTGCTCGTGAAGCAGGAAGAAGCCAGCAAGAAGGGCAACCGGACCCCATTCCAGTACCTGAGCAGCAGCGACAGCAAGATCAGCTACGAAACCTTCAAGAAGCACATCCTGAATCTGGCCAAGGGCAAGGGCAGAATCAGCAAGACCAAGAAAGAGTATCTGCTGGAAGAACGGGACATCAACAGGTTCTCCGTGCAGAAAGACTTCATCAACCGGAACCTGGTGGATACCAGATACGCCACCAGAGGCCTGATGAACCTGCTGCGGAGCTACTTCAGAGTGAACAACCTGGACGTGAAAGTGAAGTCCATCAATGGCGGCTTCACCAGCTTTCTGCGGCGGAAGTGGAAGTTTAAGAAAGAGCGGAACAAGGGGTACAAGCACCACGCCGAGGACGCCCTGATCATTGCCAACGCCGATTTCATCTTCAAAGAGTGGAAGAAACTGGACAAGGCCAAAAAAGTGATGGAAAACCAGATGTTCGAGGAAAAGCAGGCCGAGAGCATGCCCGAGATCGAAACCGAGCAGGAGTACAAAGAGATCTTCATCACCCCCCACCAGATCAAGCACATTAAGGACTTCAAGGACTACAAGTACAGCCACCGGGTGGACAAGAAGCCTAATAGAAAGCTGATTAACGACACCCTGTACTCCACCCGGAAGGACGACAAGGGCAACACCCTGATCGTGAACAATCTGAACGGCCTGTACGACAAGGACAATGACAAGCTGAAAAAGCTGATCAACAAGAGCCCCGAAAAGCTGCTGATGTACCACCACGACCCCCAGACCTACCAGAAACTGAAGCTGATTATGGAACAGTACGGCGACGAGAAGAATCCCCTGTACAAGTACTACGAGGAAACCGGGAACTACCTGACCAAGTACTCCAAAAAGGACAACGGCCCCGTGATCAAGAAGATTAAGTATTACGGCAACAAACTGAACGCCCATCTGGACATCACCGACGACTACCCCAACAGCAGAAACAAGGTCGTGAAGCTGTCCCTGAAGCCCTACAGATTCGACGTGTACCTGGACAATGGCGTGTACAAGTTCGTGACCGTGAAGAATCTGGATGTGATCAAAAAAGAAAACTACTACGAAGTGAATAGCAAGTGCTATGAGGAAGCTAAGAAGCTGAAGAAGATCAGCAACCAGGCCGAGTTTATCGCCTCCTTCTACAAGAACGATCTGATCAAGATCAACGGCGAGCTGTATAGAGTGATCGGCGTGAACTCGGACCTGCTGAACCGGATCGAAGTGTGAGACGGGCCATACTCGTCTCGAACATGATCGACATCACCTACCGCGAGTACCTGGAAAACATGAACGACAAGAGGCCCCCCCACATCATTAAGACAATCGCCTCCAAGACCCAGAGCATTAAGAAGTACAGCACAGACATTCTGGGCAACCTGTATGAAGTGAAATCTAAGAAGCACCCTCAGATCATCAAAAAGGGC(SEQ ID NO:119)。
ATGAAGCGGAACTACATCCTGGGCCTGGCCATCGGCATCACCAGCGTGGGCTACGGCATCATCGACTACGAGACACGGGACGTGATCGATGCCGGCGTGCGGCTGTTCAAAGAGGCCAACGTGGAAAACAACGAGGGCAGGCGGAGCAAGAGAGGCGCCAGAAGGCTGAAGCGGCGGAGGCGGCATAGAATCCAGAGAGTGAAGAAGCTGCTGTTCGACTACAACCTGCTGACCGACCACAGCGAGCTGAGCGGCATCAACCCCTACGAGGCCAGAGTGAAGGGCCTGAGCCAGAAGCTGAGCGAGGAAGAGTTCTCTGCCGCCCTGCTGCACCTGGCCAAGAGAAGAGGCGTGCACAACGTGAACGAGGTGGAAGAGGACACCGGCAACGAGCTGTCCACCAAAGAGCAGATCAGCCGGAACAGCAAGGCCCTGGAAGAGAAATACGTGGCCGAACTGCAGCTGGAACGGCTGAAGAAAGACGGCGAAGTGCGGGGCAGCATCAACAGATTCAAGACCAGCGACTACGTGAAAGAAGCCAAACAGCTGCTGAAGGTGCAGAAGGCCTACCACCAGCTGGACCAGAGCTTCATCGACACCTACATCGACCTGCTGGAAACCCGGCGGACCTACTATGAGGGACCTGGCGAGGGCAGCCCCTTCGGCTGGAAGGACATCAAAGAATGGTACGAGATGCTGATGGGCCACTGCACCTACTTCCCCGAGGAACTGCGGAGCGTGAAGTACGCCTACAACGCCGACCTGTACAACGCCCTGAACGACCTGAACAATCTCGTGATCACCAGGGACGAGAACGAGAAGCTGGAATATTACGAGAAGTTCCAGATCATCGAGAACGTGTTCAAGCAGAAGAAGAAGCCCACCCTGAAGCAGATCGCCAAAGAAATCCTCGTGAACGAAGAGGATATTAAGGGCTACAGAGTGACCAGCACCGGCAAGCCCGAGTTCACCAACCTGAAGGTGTACCACGACATCAAGGACATTACCGCCCGGAAAGAGATTATTGAGAACGCCGAGCTGCTGGATCAGATTGCCAAGATCCTGACCATCTACCAGAGCAGCGAGGACATCCAGGAAGAACTGACCAATCTGAACTCCGAGCTGACCCAGGAAGAGATCGAGCAGATCTCTAATCTGAAGGGCTATACCGGCACCCACAACCTGAGCCTGAAGGCCATCAACCTGATCCTGGACGAGCTGTGGCACACCAACGACAACCAGATCGCTATCTTCAACCGGCTGAAGCTGGTGCCCAAGAAGGTGGACCTGTCCCAGCAGAAAGAGATCCCCACCACCCTGGTGGACGACTTCATCCTGAGCCCCGTCGTGAAGAGAAGCTTCATCCAGAGCATCAAAGTGATCAACGCCATCATCAAGAAGTACGGCCTGCCCAACGACATCATTATCGAGCTGGCCCGCGAGAAGAACTCCAAGGACGCCCAGAAAATGATCAACGAGATGCAGAAGCGGAACCGGCAGACCAACGAGCGGATCGAGGAAATCATCCGGACCACCGGCAAAGAGAACGCCAAGTACCTGATCGAGAAGATCAAGCTGCACGACATGCAGGAAGGCAAGTGCCTGTACAGCCTGGAAGCCATCCCTCTGGAAGATCTGCTGAACAACCCCTTCAACTATGAGGTGGACCACATCATCCCCAGAAGCGTGTCCTTCGACAACAGCTTCAACAACAAGGTGCTCGTGAAGCAGGAAGAAGCCAGCAAGAAGGGCAACCGGACCCCATTCCAGTACCTGAGCAGCAGCGACAGCAAGATCAGCTACGAAACCTTCAAGAAGCACATCCTGAATCTGGCCAAGGGCAAGGGCAGAATCAGCAAGACCAAGAAAGAGTATCTGCTGGAAGAACGGGACATCAACAGGTTCTCCGTGCAGAAAGACTTCATCAACCGGAACCTGGTGGATACCAGATACGCCACCAGAGGCCTGATGAACCTGCTGCGGAGCTACTTCAGAGTGAACAACCTGGACGTGAAAGTGAAGTCCATCAATGGCGGCTTCACCAGCTTTCTGCGGCGGAAGTGGAAGTTTAAGAAAGAGCGGAACAAGGGGTACAAGCACCACGCCGAGGACGCCCTGATCATTGCCAACGCCGATTTCATCTTCAAAGAGTGGAAGAAACTGGACAAGGCCAAAAAAGTGATGGAAAACCAGATGTTCGAGGAAAAGCAGGCCGAGAGCATGCCCGAGATCGAAACCGAGCAGGAGTACAAAGAGATCTTCATCACCCCCCACCAGATCAAGCACATTAAGGACTTCAAGGACTACAAGTACAGCCACCGGGTGGACAAGAAGCCTAATAGAAAGCTGATTAACGACACCCTGTACTCCACCCGGAAGGACGACAAGGGCAACACCCTGATCGTGAACAATCTGAACGGCCTGTACGACAAGGACAATGACAAGCTGAAAAAGCTGATCAACAAGAGCCCCGAAAAGCTGCTGATGTACCACCACGACCCCCAGACCTACCAGAAACTGAAGCTGATTATGGAACAGTACGGCGACGAGAAGAATCCCCTGTACAAGTACTACGAGGAAACCGGGAACTACCTGACCAAGTACTCCAAAAAGGACAACGGCCCCGTGATCAAGAAGATTAAGTATTACGGCAACAAACTGAACGCCCATCTGGACATCACCGACGACTACCCCAACAGCAGAAACAAGGTCGTGAAGCTGTCCCTGAAGCCCTACAGATTCGACGTGTACCTGGACAATGGCGTGTACAAGTTCGTGACCGTGAAGAATCTGGATGTGATCAAAAAAGAAAACTACTACGAAGTGAATAGCAAGTGCTATGAGGAAGCTAAGAAGCTGAAGAAGATCAGCAACCAGGCCGAGTTTATCGCCTCCTTCTACAAGAACGATCTGATCAAGATCAACGGCGAGCTGTATAGAGTGATCGGCGTGAACAACGACCTGCTGAACAAGATCGAAGTGTGAGACGGGCCATACTCGTCTCGAACATGATCGACATCACCTACCGCGAGTACCTGGAAAACATGAACGACAAGAGGCCCCCCCACATCATTAAGACAATCGCCTCCAAGACCCAGAGCATTAAGAAGTACAGCACAGACATTCTGGGCAACCTGTATGAAGTGAAATCTAAGAAGCACCCTCAGATCATCAAAAAGGGC(SEQ ID NO:120)。
ATGAAGCGGAACTACATCCTGGGCCTGGCCATCGGCATCACCAGCGTGGGCTACGGCATCATCGACTACGAGACACGGGACGTGATCGATGCCGGCGTGCGGCTGTTCAAAGAGGCCAACGTGGAAAACAACGAGGGCAGGCGGAGCAAGAGAGGCGCCAGAAGGCTGAAGCGGCGGAGGCGGCATAGAATCCAGAGAGTGAAGAAGCTGCTGTTCGACTACAACCTGCTGACCGACCACAGCGAGCTGAGCGGCATCAACCCCTACGAGGCCAGAGTGAAGGGCCTGAGCCAGAAGCTGAGCGAGGAAGAGTTCTCTGCCGCCCTGCTGCACCTGGCCAAGAGAAGAGGCGTGCACAACGTGAACGAGGTGGAAGAGGACACCGGCAACGAGCTGTCCACCAAAGAGCAGATCAGCCGGAACAGCAAGGCCCTGGAAGAGAAATACGTGGCCGAACTGCAGCTGGAACGGCTGAAGAAAGACGGCGAAGTGCGGGGCAGCATCAACAGATTCAAGACCAGCGACTACGTGAAAGAAGCCAAACAGCTGCTGAAGGTGCAGAAGGCCTACCACCAGCTGGACCAGAGCTTCATCGACACCTACATCGACCTGCTGGAAACCCGGCGGACCTACTATGAGGGACCTGGCGAGGGCAGCCCCTTCGGCTGGAAGGACATCAAAGAATGGTACGAGATGCTGATGGGCCACTGCACCTACTTCCCCGAGGAACTGCGGAGCGTGAAGTACGCCTACAACGCCGACCTGTACAACGCCCTGAACGACCTGAACAATCTCGTGATCACCAGGGACGAGAACGAGAAGCTGGAATATTACGAGAAGTTCCAGATCATCGAGAACGTGTTCAAGCAGAAGAAGAAGCCCACCCTGAAGCAGATCGCCAAAGAAATCCTCGTGAACGAAGAGGATATTAAGGGCTACAGAGTGACCAGCACCGGCAAGCCCGAGTTCACCAACCTGAAGGTGTACCACGACATCAAGGACATTACCGCCCGGAAAGAGATTATTGAGAACGCCGAGCTGCTGGATCAGATTGCCAAGATCCTGACCATCTACCAGAGCAGCGAGGACATCCAGGAAGAACTGACCAATCTGAACTCCGAGCTGACCCAGGAAGAGATCGAGCAGATCTCTAATCTGAAGGGCTATACCGGCACCCACAACCTGAGCCTGAAGGCCATCAACCTGATCCTGGACGAGCTGTGGCACACCAACGACAACCAGATCGCTATCTTCAACCGGCTGAAGCTGGTGCCCAAGAAGGTGGACCTGTCCCAGCAGAAAGAGATCCCCACCACCCTGGTGGACGACTTCATCCTGAGCCCCGTCGTGAAGAGAAGCTTCATCCAGAGCATCAAAGTGATCAACGCCATCATCAAGAAGTACGGCCTGCCCAACGACATCATTATCGAGCTGGCCCGCGAGAAGAACTCCAAGGACGCCCAGAAAATGATCAACGAGATGCAGAAGCGGAACCGGCAGACCAACGAGCGGATCGAGGAAATCATCCGGACCACCGGCAAAGAGAACGCCAAGTACCTGATCGAGAAGATCAAGCTGCACGACATGCAGGAAGGCAAGTGCCTGTACAGCCTGGAAGCCATCCCTCTGGAAGATCTGCTGAACAACCCCTTCAACTATGAGGTGGACCACATCATCCCCAGAAGCGTGTCCTTCGACAACAGCTTCAACAACAAGGTGCTCGTGAAGCAGGAAGAAGCCAGCAAGAAGGGCAACCGGACCCCATTCCAGTACCTGAGCAGCAGCGACAGCAAGATCAGCTACGAAACCTTCAAGAAGCACATCCTGAATCTGGCCAAGGGCAAGGGCAGAATCAGCAAGACCAAGAAAGAGTATCTGCTGGAAGAACGGGACATCAACAGGTTCTCCGTGCAGAAAGACTTCATCAACCGGAACCTGGTGGATACCAGATACGCCACCAGAGGCCTGATGAACCTGCTGCGGAGCTACTTCAGAGTGAACAACCTGGACGTGAAAGTGAAGTCCATCAATGGCGGCTTCACCAGCTTTCTGCGGCGGAAGTGGAAGTTTAAGAAAGAGCGGAACAAGGGGTACAAGCACCACGCCGAGGACGCCCTGATCATTGCCAACGCCGATTTCATCTTCAAAGAGTGGAAGAAACTGGACAAGGCCAAAAAAGTGATGGAAAACCAGATGTTCGAGGAAAAGCAGGCCGAGAGCATGCCCGAGATCGAAACCGAGCAGGAGTACAAAGAGATCTTCATCACCCCCCACCAGATCAAGCACATTAAGGACTTCAAGGACTACAAGTACAGCCACCGGGTGGACAAGAAGCCTAATAGAAAGCTGATTAACGACACCCTGTACTCCACCCGGAAGGACGACAAGGGCAACACCCTGATCGTGAACAATCTGAACGGCCTGTACGACAAGGACAATGACAAGCTGAAAAAGCTGATCAACAAGAGCCCCGAAAAGCTGCTGATGTACCACCACGACCCCCAGACCTACCAGAAACTGAAGCTGATTATGGAACAGTACGGCGACGAGAAGAATCCCCTGTACAAGTACTACGAGGAAACCGGGAACTACCTGACCAAGTACTCCAAAAAGGACAACGGCCCCGTGATCAAGAAGATTAAGTATTACGGCAACAAACTGAACGCCCATCTGGACATCACCGACGACTACCCCAACAGCAGAAACAAGGTCGTGAAGCTGTCCCTGAAGCCCTACAGATTCGACGTGTACCTGGACAATGGCGTGTACAAGTTCGTGACCGTGAAGAATCTGGATGTGATCAAAAAAGAAAACTACTACGAAGTGAATAGCAAGTGCTATGAGGAAGCTAAGAAGCTGAAGAAGATCAGCAACCAGGCCGAGTTTATCGCCTCCTTCTACAAGAACGATCTGATCAAGATCAACGGCGAGCTGTATAGAGTGATCGGCGTGAACAACAGCACCAGGAACAAGGTGGAGCTGTGAGACGGGCCATACTCGTCTCGAACATGATCGACATCACCTACCGCGAGTACCTGGAAAACATGAACGACAAGAGGCCCCCCCACATCATTAAGACAATCGCCTCCAAGACCCAGAGCATTAAGAAGTACAGCACAGACATTCTGGGCAACCTGTATGAAGTGAAATCTAAGAAGCACCCTCAGATCATCAAAAAGGGC(SEQ ID NO:121)。
ATGAAGCGGAACTACATCCTGGGCCTGGCCATCGGCATCACCAGCGTGGGCTACGGCATCATCGACTACGAGACACGGGACGTGATCGATGCCGGCGTGCGGCTGTTCAAAGAGGCCAACGTGGAAAACAACGAGGGCAGGCGGAGCAAGAGAGGCGCCAGAAGGCTGAAGCGGCGGAGGCGGCATAGAATCCAGAGAGTGAAGAAGCTGCTGTTCGACTACAACCTGCTGACCGACCACAGCGAGCTGAGCGGCATCAACCCCTACGAGGCCAGAGTGAAGGGCCTGAGCCAGAAGCTGAGCGAGGAAGAGTTCTCTGCCGCCCTGCTGCACCTGGCCAAGAGAAGAGGCGTGCACAACGTGAACGAGGTGGAAGAGGACACCGGCAACGAGCTGTCCACCAAAGAGCAGATCAGCCGGAACAGCAAGGCCCTGGAAGAGAAATACGTGGCCGAACTGCAGCTGGAACGGCTGAAGAAAGACGGCGAAGTGCGGGGCAGCATCAACAGATTCAAGACCAGCGACTACGTGAAAGAAGCCAAACAGCTGCTGAAGGTGCAGAAGGCCTACCACCAGCTGGACCAGAGCTTCATCGACACCTACATCGACCTGCTGGAAACCCGGCGGACCTACTATGAGGGACCTGGCGAGGGCAGCCCCTTCGGCTGGAAGGACATCAAAGAATGGTACGAGATGCTGATGGGCCACTGCACCTACTTCCCCGAGGAACTGCGGAGCGTGAAGTACGCCTACAACGCCGACCTGTACAACGCCCTGAACGACCTGAACAATCTCGTGATCACCAGGGACGAGAACGAGAAGCTGGAATATTACGAGAAGTTCCAGATCATCGAGAACGTGTTCAAGCAGAAGAAGAAGCCCACCCTGAAGCAGATCGCCAAAGAAATCCTCGTGAACGAAGAGGATATTAAGGGCTACAGAGTGACCAGCACCGGCAAGCCCGAGTTCACCAACCTGAAGGTGTACCACGACATCAAGGACATTACCGCCCGGAAAGAGATTATTGAGAACGCCGAGCTGCTGGATCAGATTGCCAAGATCCTGACCATCTACCAGAGCAGCGAGGACATCCAGGAAGAACTGACCAATCTGAACTCCGAGCTGACCCAGGAAGAGATCGAGCAGATCTCTAATCTGAAGGGCTATACCGGCACCCACAACCTGAGCCTGAAGGCCATCAACCTGATCCTGGACGAGCTGTGGCACACCAACGACAACCAGATCGCTATCTTCAACCGGCTGAAGCTGGTGCCCAAGAAGGTGGACCTGTCCCAGCAGAAAGAGATCCCCACCACCCTGGTGGACGACTTCATCCTGAGCCCCGTCGTGAAGAGAAGCTTCATCCAGAGCATCAAAGTGATCAACGCCATCATCAAGAAGTACGGCCTGCCCAACGACATCATTATCGAGCTGGCCCGCGAGAAGAACTCCAAGGACGCCCAGAAAATGATCAACGAGATGCAGAAGCGGAACCGGCAGACCAACGAGCGGATCGAGGAAATCATCCGGACCACCGGCAAAGAGAACGCCAAGTACCTGATCGAGAAGATCAAGCTGCACGACATGCAGGAAGGCAAGTGCCTGTACAGCCTGGAAGCCATCCCTCTGGAAGATCTGCTGAACAACCCCTTCAACTATGAGGTGGACCACATCATCCCCAGAAGCGTGTCCTTCGACAACAGCTTCAACAACAAGGTGCTCGTGAAGCAGGAAGAAGCCAGCAAGAAGGGCAACCGGACCCCATTCCAGTACCTGAGCAGCAGCGACAGCAAGATCAGCTACGAAACCTTCAAGAAGCACATCCTGAATCTGGCCAAGGGCAAGGGCAGAATCAGCAAGACCAAGAAAGAGTATCTGCTGGAAGAACGGGACATCAACAGGTTCTCCGTGCAGAAAGACTTCATCAACCGGAACCTGGTGGATACCAGATACGCCACCAGAGGCCTGATGAACCTGCTGCGGAGCTACTTCAGAGTGAACAACCTGGACGTGAAAGTGAAGTCCATCAATGGCGGCTTCACCAGCTTTCTGCGGCGGAAGTGGAAGTTTAAGAAAGAGCGGAACAAGGGGTACAAGCACCACGCCGAGGACGCCCTGATCATTGCCAACGCCGATTTCATCTTCAAAGAGTGGAAGAAACTGGACAAGGCCAAAAAAGTGATGGAAAACCAGATGTTCGAGGAAAAGCAGGCCGAGAGCATGCCCGAGATCGAAACCGAGCAGGAGTACAAAGAGATCTTCATCACCCCCCACCAGATCAAGCACATTAAGGACTTCAAGGACTACAAGTACAGCCACCGGGTGGACAAGAAGCCTAATAGAAAGCTGATTAACGACACCCTGTACTCCACCCGGAAGGACGACAAGGGCAACACCCTGATCGTGAACAATCTGAACGGCCTGTACGACAAGGACAATGACAAGCTGAAAAAGCTGATCAACAAGAGCCCCGAAAAGCTGCTGATGTACCACCACGACCCCCAGACCTACCAGAAACTGAAGCTGATTATGGAACAGTACGGCGACGAGAAGAATCCCCTGTACAAGTACTACGAGGAAACCGGGAACTACCTGACCAAGTACTCCAAAAAGGACAACGGCCCCGTGATCAAGAAGATTAAGTATTACGGCAACAAACTGAACGCCCATCTGGACATCACCGACGACTACCCCAACAGCAGAAACAAGGTCGTGAAGCTGTCCCTGAAGCCCTACAGATTCGACGTGTACCTGGACAATGGCGTGTACAAGTTCGTGACCGTGAAGAATCTGGATGTGATCAAAAAAGAAAACTACTACGAAGTGAATAGCAAGTGCTATGAGGAAGCTAAGAAGCTGAAGAAGATCAGCAACCAGGCCGAGTTTATCGCCTCCTTCTACAAGAACGATCTGATCAAGATCAACGGCGAGCTGTATAGAGTGATCGGCGTGAACAACAGCACCAGGAACCTGGTGGAGCTGTGAGACGGGCCATACTCGTCTCGAACATGATCGACATCACCTACCGCGAGTACCTGGAAAACATGAACGACAAGAGGCCCCCCCACATCATTAAGACAATCGCCTCCAAGACCCAGAGCATTAAGAAGTACAGCACAGACATTCTGGGCAACCTGTATGAAGTGAAATCTAAGAAGCACCCTCAGATCATCAAAAAGGGC(SEQ ID NO:122)。
ATGAAGCGGAACTACATCCTGGGCCTGGCCATCGGCATCACCAGCGTGGGCTACGGCATCATCGACTACGAGACACGGGACGTGATCGATGCCGGCGTGCGGCTGTTCAAAGAGGCCAACGTGGAAAACAACGAGGGCAGGCGGAGCAAGAGAGGCGCCAGAAGGCTGAAGCGGCGGAGGCGGCATAGAATCCAGAGAGTGAAGAAGCTGCTGTTCGACTACAACCTGCTGACCGACCACAGCGAGCTGAGCGGCATCAACCCCTACGAGGCCAGAGTGAAGGGCCTGAGCCAGAAGCTGAGCGAGGAAGAGTTCTCTGCCGCCCTGCTGCACCTGGCCAAGAGAAGAGGCGTGCACAACGTGAACGAGGTGGAAGAGGACACCGGCAACGAGCTGTCCACCAAAGAGCAGATCAGCCGGAACAGCAAGGCCCTGGAAGAGAAATACGTGGCCGAACTGCAGCTGGAACGGCTGAAGAAAGACGGCGAAGTGCGGGGCAGCATCAACAGATTCAAGACCAGCGACTACGTGAAAGAAGCCAAACAGCTGCTGAAGGTGCAGAAGGCCTACCACCAGCTGGACCAGAGCTTCATCGACACCTACATCGACCTGCTGGAAACCCGGCGGACCTACTATGAGGGACCTGGCGAGGGCAGCCCCTTCGGCTGGAAGGACATCAAAGAATGGTACGAGATGCTGATGGGCCACTGCACCTACTTCCCCGAGGAACTGCGGAGCGTGAAGTACGCCTACAACGCCGACCTGTACAACGCCCTGAACGACCTGAACAATCTCGTGATCACCAGGGACGAGAACGAGAAGCTGGAATATTACGAGAAGTTCCAGATCATCGAGAACGTGTTCAAGCAGAAGAAGAAGCCCACCCTGAAGCAGATCGCCAAAGAAATCCTCGTGAACGAAGAGGATATTAAGGGCTACAGAGTGACCAGCACCGGCAAGCCCGAGTTCACCAACCTGAAGGTGTACCACGACATCAAGGACATTACCGCCCGGAAAGAGATTATTGAGAACGCCGAGCTGCTGGATCAGATTGCCAAGATCCTGACCATCTACCAGAGCAGCGAGGACATCCAGGAAGAACTGACCAATCTGAACTCCGAGCTGACCCAGGAAGAGATCGAGCAGATCTCTAATCTGAAGGGCTATACCGGCACCCACAACCTGAGCCTGAAGGCCATCAACCTGATCCTGGACGAGCTGTGGCACACCAACGACAACCAGATCGCTATCTTCAACCGGCTGAAGCTGGTGCCCAAGAAGGTGGACCTGTCCCAGCAGAAAGAGATCCCCACCACCCTGGTGGACGACTTCATCCTGAGCCCCGTCGTGAAGAGAAGCTTCATCCAGAGCATCAAAGTGATCAACGCCATCATCAAGAAGTACGGCCTGCCCAACGACATCATTATCGAGCTGGCCCGCGAGAAGAACTCCAAGGACGCCCAGAAAATGATCAACGAGATGCAGAAGCGGAACCGGCAGACCAACGAGCGGATCGAGGAAATCATCCGGACCACCGGCAAAGAGAACGCCAAGTACCTGATCGAGAAGATCAAGCTGCACGACATGCAGGAAGGCAAGTGCCTGTACAGCCTGGAAGCCATCCCTCTGGAAGATCTGCTGAACAACCCCTTCAACTATGAGGTGGACCACATCATCCCCAGAAGCGTGTCCTTCGACAACAGCTTCAACAACAAGGTGCTCGTGAAGCAGGAAGAAGCCAGCAAGAAGGGCAACCGGACCCCATTCCAGTACCTGAGCAGCAGCGACAGCAAGATCAGCTACGAAACCTTCAAGAAGCACATCCTGAATCTGGCCAAGGGCAAGGGCAGAATCAGCAAGACCAAGAAAGAGTATCTGCTGGAAGAACGGGACATCAACAGGTTCTCCGTGCAGAAAGACTTCATCAACCGGAACCTGGTGGATACCAGATACGCCACCAGAGGCCTGATGAACCTGCTGCGGAGCTACTTCAGAGTGAACAACCTGGACGTGAAAGTGAAGTCCATCAATGGCGGCTTCACCAGCTTTCTGCGGCGGAAGTGGAAGTTTAAGAAAGAGCGGAACAAGGGGTACAAGCACCACGCCGAGGACGCCCTGATCATTGCCAACGCCGATTTCATCTTCAAAGAGTGGAAGAAACTGGACAAGGCCAAAAAAGTGATGGAAAACCAGATGTTCGAGGAAAAGCAGGCCGAGAGCATGCCCGAGATCGAAACCGAGCAGGAGTACAAAGAGATCTTCATCACCCCCCACCAGATCAAGCACATTAAGGACTTCAAGGACTACAAGTACAGCCACCGGGTGGACAAGAAGCCTAATAGAAAGCTGATTAACGACACCCTGTACTCCACCCGGAAGGACGACAAGGGCAACACCCTGATCGTGAACAATCTGAACGGCCTGTACGACAAGGACAATGACAAGCTGAAAAAGCTGATCAACAAGAGCCCCGAAAAGCTGCTGATGTACCACCACGACCCCCAGACCTACCAGAAACTGAAGCTGATTATGGAACAGTACGGCGACGAGAAGAATCCCCTGTACAAGTACTACGAGGAAACCGGGAACTACCTGACCAAGTACTCCAAAAAGGACAACGGCCCCGTGATCAAGAAGATTAAGTATTACGGCAACAAACTGAACGCCCATCTGGACATCACCGACGACTACCCCAACAGCAGAAACAAGGTCGTGAAGCTGTCCCTGAAGCCCTACAGATTCGACGTGTACCTGGACAATGGCGTGTACAAGTTCGTGACCGTGAAGAATCTGGATGTGATCAAAAAAGAAAACTACTACGAAGTGAATAGCAAGTGCTATGAGGAAGCTAAGAAGCTGAAGAAGATCAGCAACCAGGCCGAGTTTATCGCCTCCTTCTACAAGAACGATCTGATCAAGATCAACGGCGAGCTGTATAGAGTGATCGGCGTGAACAGCGACGACAGGAACAAGATCGAGCTGTGAGACGGGCCATACTCGTCTCGAACATGATCGACATCACCTACCGCGAGTACCTGGAAAACATGAACGACAAGAGGCCCCCCCACATCATTAAGACAATCGCCTCCAAGACCCAGAGCATTAAGAAGTACAGCACAGACATTCTGGGCAACCTGTATGAAGTGAAATCTAAGAAGCACCCTCAGATCATCAAAAAGGGC(SEQ ID NO:123)。
ATGAAGCGGAACTACATCCTGGGCCTGGCCATCGGCATCACCAGCGTGGGCTACGGCATCATCGACTACGAGACACGGGACGTGATCGATGCCGGCGTGCGGCTGTTCAAAGAGGCCAACGTGGAAAACAACGAGGGCAGGCGGAGCAAGAGAGGCGCCAGAAGGCTGAAGCGGCGGAGGCGGCATAGAATCCAGAGAGTGAAGAAGCTGCTGTTCGACTACAACCTGCTGACCGACCACAGCGAGCTGAGCGGCATCAACCCCTACGAGGCCAGAGTGAAGGGCCTGAGCCAGAAGCTGAGCGAGGAAGAGTTCTCTGCCGCCCTGCTGCACCTGGCCAAGAGAAGAGGCGTGCACAACGTGAACGAGGTGGAAGAGGACACCGGCAACGAGCTGTCCACCAAAGAGCAGATCAGCCGGAACAGCAAGGCCCTGGAAGAGAAATACGTGGCCGAACTGCAGCTGGAACGGCTGAAGAAAGACGGCGAAGTGCGGGGCAGCATCAACAGATTCAAGACCAGCGACTACGTGAAAGAAGCCAAACAGCTGCTGAAGGTGCAGAAGGCCTACCACCAGCTGGACCAGAGCTTCATCGACACCTACATCGACCTGCTGGAAACCCGGCGGACCTACTATGAGGGACCTGGCGAGGGCAGCCCCTTCGGCTGGAAGGACATCAAAGAATGGTACGAGATGCTGATGGGCCACTGCACCTACTTCCCCGAGGAACTGCGGAGCGTGAAGTACGCCTACAACGCCGACCTGTACAACGCCCTGAACGACCTGAACAATCTCGTGATCACCAGGGACGAGAACGAGAAGCTGGAATATTACGAGAAGTTCCAGATCATCGAGAACGTGTTCAAGCAGAAGAAGAAGCCCACCCTGAAGCAGATCGCCAAAGAAATCCTCGTGAACGAAGAGGATATTAAGGGCTACAGAGTGACCAGCACCGGCAAGCCCGAGTTCACCAACCTGAAGGTGTACCACGACATCAAGGACATTACCGCCCGGAAAGAGATTATTGAGAACGCCGAGCTGCTGGATCAGATTGCCAAGATCCTGACCATCTACCAGAGCAGCGAGGACATCCAGGAAGAACTGACCAATCTGAACTCCGAGCTGACCCAGGAAGAGATCGAGCAGATCTCTAATCTGAAGGGCTATACCGGCACCCACAACCTGAGCCTGAAGGCCATCAACCTGATCCTGGACGAGCTGTGGCACACCAACGACAACCAGATCGCTATCTTCAACCGGCTGAAGCTGGTGCCCAAGAAGGTGGACCTGTCCCAGCAGAAAGAGATCCCCACCACCCTGGTGGACGACTTCATCCTGAGCCCCGTCGTGAAGAGAAGCTTCATCCAGAGCATCAAAGTGATCAACGCCATCATCAAGAAGTACGGCCTGCCCAACGACATCATTATCGAGCTGGCCCGCGAGAAGAACTCCAAGGACGCCCAGAAAATGATCAACGAGATGCAGAAGCGGAACCGGCAGACCAACGAGCGGATCGAGGAAATCATCCGGACCACCGGCAAAGAGAACGCCAAGTACCTGATCGAGAAGATCAAGCTGCACGACATGCAGGAAGGCAAGTGCCTGTACAGCCTGGAAGCCATCCCTCTGGAAGATCTGCTGAACAACCCCTTCAACTATGAGGTGGACCACATCATCCCCAGAAGCGTGTCCTTCGACAACAGCTTCAACAACAAGGTGCTCGTGAAGCAGGAAGAAGCCAGCAAGAAGGGCAACCGGACCCCATTCCAGTACCTGAGCAGCAGCGACAGCAAGATCAGCTACGAAACCTTCAAGAAGCACATCCTGAATCTGGCCAAGGGCAAGGGCAGAATCAGCAAGACCAAGAAAGAGTATCTGCTGGAAGAACGGGACATCAACAGGTTCTCCGTGCAGAAAGACTTCATCAACCGGAACCTGGTGGATACCAGATACGCCACCAGAGGCCTGATGAACCTGCTGCGGAGCTACTTCAGAGTGAACAACCTGGACGTGAAAGTGAAGTCCATCAATGGCGGCTTCACCAGCTTTCTGCGGCGGAAGTGGAAGTTTAAGAAAGAGCGGAACAAGGGGTACAAGCACCACGCCGAGGACGCCCTGATCATTGCCAACGCCGATTTCATCTTCAAAGAGTGGAAGAAACTGGACAAGGCCAAAAAAGTGATGGAAAACCAGATGTTCGAGGAAAAGCAGGCCGAGAGCATGCCCGAGATCGAAACCGAGCAGGAGTACAAAGAGATCTTCATCACCCCCCACCAGATCAAGCACATTAAGGACTTCAAGGACTACAAGTACAGCCACCGGGTGGACAAGAAGCCTAATAGAAAGCTGATTAACGACACCCTGTACTCCACCCGGAAGGACGACAAGGGCAACACCCTGATCGTGAACAATCTGAACGGCCTGTACGACAAGGACAATGACAAGCTGAAAAAGCTGATCAACAAGAGCCCCGAAAAGCTGCTGATGTACCACCACGACCCCCAGACCTACCAGAAACTGAAGCTGATTATGGAACAGTACGGCGACGAGAAGAATCCCCTGTACAAGTACTACGAGGAAACCGGGAACTACCTGACCAAGTACTCCAAAAAGGACAACGGCCCCGTGATCAAGAAGATTAAGTATTACGGCAACAAACTGAACGCCCATCTGGACATCACCGACGACTACCCCAACAGCAGAAACAAGGTCGTGAAGCTGTCCCTGAAGCCCTACAGATTCGACGTGTACCTGGACAATGGCGTGTACAAGTTCGTGACCGTGAAGAATCTGGATGTGATCAAAAAAGAAAACTACTACGAAGTGAATAGCAAGTGCTATGAGGAAGCTAAGAAGCTGAAGAAGATCAGCAACCAGGCCGAGTTTATCGCCTCCTTCTACAAGAACGATCTGATCAAGATCAACGGCGAGCTGTATAGAGTGATCGGCGTGAACAGCGACGACAGGAACCTGATCGAGCTGTGAGACGGGCCATACTCGTCTCGAACATGATCGACATCACCTACCGCGAGTACCTGGAAAACATGAACGACAAGAGGCCCCCCCACATCATTAAGACAATCGCCTCCAAGACCCAGAGCATTAAGAAGTACAGCACAGACATTCTGGGCAACCTGTATGAAGTGAAATCTAAGAAGCACCCTCAGATCATCAAAAAGGGC(SEQ ID NO:124)。
ATGAAGCGGAACTACATCCTGGGCCTGGCCATCGGCATCACCAGCGTGGGCTACGGCATCATCGACTACGAGACACGGGACGTGATCGATGCCGGCGTGCGGCTGTTCAAAGAGGCCAACGTGGAAAACAACGAGGGCAGGCGGAGCAAGAGAGGCGCCAGAAGGCTGAAGCGGCGGAGGCGGCATAGAATCCAGAGAGTGAAGAAGCTGCTGTTCGACTACAACCTGCTGACCGACCACAGCGAGCTGAGCGGCATCAACCCCTACGAGGCCAGAGTGAAGGGCCTGAGCCAGAAGCTGAGCGAGGAAGAGTTCTCTGCCGCCCTGCTGCACCTGGCCAAGAGAAGAGGCGTGCACAACGTGAACGAGGTGGAAGAGGACACCGGCAACGAGCTGTCCACCAAAGAGCAGATCAGCCGGAACAGCAAGGCCCTGGAAGAGAAATACGTGGCCGAACTGCAGCTGGAACGGCTGAAGAAAGACGGCGAAGTGCGGGGCAGCATCAACAGATTCAAGACCAGCGACTACGTGAAAGAAGCCAAACAGCTGCTGAAGGTGCAGAAGGCCTACCACCAGCTGGACCAGAGCTTCATCGACACCTACATCGACCTGCTGGAAACCCGGCGGACCTACTATGAGGGACCTGGCGAGGGCAGCCCCTTCGGCTGGAAGGACATCAAAGAATGGTACGAGATGCTGATGGGCCACTGCACCTACTTCCCCGAGGAACTGCGGAGCGTGAAGTACGCCTACAACGCCGACCTGTACAACGCCCTGAACGACCTGAACAATCTCGTGATCACCAGGGACGAGAACGAGAAGCTGGAATATTACGAGAAGTTCCAGATCATCGAGAACGTGTTCAAGCAGAAGAAGAAGCCCACCCTGAAGCAGATCGCCAAAGAAATCCTCGTGAACGAAGAGGATATTAAGGGCTACAGAGTGACCAGCACCGGCAAGCCCGAGTTCACCAACCTGAAGGTGTACCACGACATCAAGGACATTACCGCCCGGAAAGAGATTATTGAGAACGCCGAGCTGCTGGATCAGATTGCCAAGATCCTGACCATCTACCAGAGCAGCGAGGACATCCAGGAAGAACTGACCAATCTGAACTCCGAGCTGACCCAGGAAGAGATCGAGCAGATCTCTAATCTGAAGGGCTATACCGGCACCCACAACCTGAGCCTGAAGGCCATCAACCTGATCCTGGACGAGCTGTGGCACACCAACGACAACCAGATCGCTATCTTCAACCGGCTGAAGCTGGTGCCCAAGAAGGTGGACCTGTCCCAGCAGAAAGAGATCCCCACCACCCTGGTGGACGACTTCATCCTGAGCCCCGTCGTGAAGAGAAGCTTCATCCAGAGCATCAAAGTGATCAACGCCATCATCAAGAAGTACGGCCTGCCCAACGACATCATTATCGAGCTGGCCCGCGAGAAGAACTCCAAGGACGCCCAGAAAATGATCAACGAGATGCAGAAGCGGAACCGGCAGACCAACGAGCGGATCGAGGAAATCATCCGGACCACCGGCAAAGAGAACGCCAAGTACCTGATCGAGAAGATCAAGCTGCACGACATGCAGGAAGGCAAGTGCCTGTACAGCCTGGAAGCCATCCCTCTGGAAGATCTGCTGAACAACCCCTTCAACTATGAGGTGGACCACATCATCCCCAGAAGCGTGTCCTTCGACAACAGCTTCAACAACAAGGTGCTCGTGAAGCAGGAAGAAGCCAGCAAGAAGGGCAACCGGACCCCATTCCAGTACCTGAGCAGCAGCGACAGCAAGATCAGCTACGAAACCTTCAAGAAGCACATCCTGAATCTGGCCAAGGGCAAGGGCAGAATCAGCAAGACCAAGAAAGAGTATCTGCTGGAAGAACGGGACATCAACAGGTTCTCCGTGCAGAAAGACTTCATCAACCGGAACCTGGTGGATACCAGATACGCCACCAGAGGCCTGATGAACCTGCTGCGGAGCTACTTCAGAGTGAACAACCTGGACGTGAAAGTGAAGTCCATCAATGGCGGCTTCACCAGCTTTCTGCGGCGGAAGTGGAAGTTTAAGAAAGAGCGGAACAAGGGGTACAAGCACCACGCCGAGGACGCCCTGATCATTGCCAACGCCGATTTCATCTTCAAAGAGTGGAAGAAACTGGACAAGGCCAAAAAAGTGATGGAAAACCAGATGTTCGAGGAAAAGCAGGCCGAGAGCATGCCCGAGATCGAAACCGAGCAGGAGTACAAAGAGATCTTCATCACCCCCCACCAGATCAAGCACATTAAGGACTTCAAGGACTACAAGTACAGCCACCGGGTGGACAAGAAGCCTAATAGAAAGCTGATTAACGACACCCTGTACTCCACCCGGAAGGACGACAAGGGCAACACCCTGATCGTGAACAATCTGAACGGCCTGTACGACAAGGACAATGACAAGCTGAAAAAGCTGATCAACAAGAGCCCCGAAAAGCTGCTGATGTACCACCACGACCCCCAGACCTACCAGAAACTGAAGCTGATTATGGAACAGTACGGCGACGAGAAGAATCCCCTGTACAAGTACTACGAGGAAACCGGGAACTACCTGACCAAGTACTCCAAAAAGGACAACGGCCCCGTGATCAAGAAGATTAAGTATTACGGCAACAAACTGAACGCCCATCTGGACATCACCGACGACTACCCCAACAGCAGAAACAAGGTCGTGAAGCTGTCCCTGAAGCCCTACAGATTCGACGTGTACCTGGACAATGGCGTGTACAAGTTCGTGACCGTGAAGAATCTGGATGTGATCAAAAAAGAAAACTACTACGAAGTGAATAGCAAGTGCTATGAGGAAGCTAAGAAGCTGAAGAAGATCAGCAACCAGGCCGAGTTTATCGCCTCCTTCTACAAGAACGATCTGATCAAGATCAACGGCGAGCTGTATAGAGTGATCGGCGTGAACAGCGACGACAGGAACAGGATCGAGCTGTGAGACGGGCCATACTCGTCTCGAACATGATCGACATCACCTACCGCGAGTACCTGGAAAACATGAACGACAAGAGGCCCCCCCACATCATTAAGACAATCGCCTCCAAGACCCAGAGCATTAAGAAGTACAGCACAGACATTCTGGGCAACCTGTATGAAGTGAAATCTAAGAAGCACCCTCAGATCATCAAAAAGGGC(SEQ ID NO:125)。
ATGAAGCGGAACTACATCCTGGGCCTGGACATCGGCATCACCAGCGTGGGCTACGGCATCATCGACTACGAGACACGGGACGTGATCGATGCCGGCGTGCGGCTGTTCAAAGAGGCCAACGTGGAAAACAACGAGGGCAGGCGGAGCAAGAGAGGCGCCAGAAGGCTGAAGCGGCGGAGGCGGCATAGAATCCAGAGAGTGAAGAAGCTGCTGTTCGACTACAACCTGCTGACCGACCACAGCGAGCTGAGCGGCATCAACCCCTACGAGGCCAGAGTGAAGGGCCTGAGCCAGAAGCTGAGCGAGGAAGAGTTCTCTGCCGCCCTGCTGCACCTGGCCAAGAGAAGAGGCGTGCACAACGTGAACGAGGTGGAAGAGGACACCGGCAACGAGCTGTCCACCAAAGAGCAGATCAGCCGGAACAGCAAGGCCCTGGAAGAGAAATACGTGGCCGAACTGCAGCTGGAACGGCTGAAGAAAGACGGCGAAGTGCGGGGCAGCATCAACAGATTCAAGACCAGCGACTACGTGAAAGAAGCCAAACAGCTGCTGAAGGTGCAGAAGGCCTACCACCAGCTGGACCAGAGCTTCATCGACACCTACATCGACCTGCTGGAAACCCGGCGGACCTACTATGAGGGACCTGGCGAGGGCAGCCCCTTCGGCTGGAAGGACATCAAAGAATGGTACGAGATGCTGATGGGCCACTGCACCTACTTCCCCGAGGAACTGCGGAGCGTGAAGTACGCCTACAACGCCGACCTGTACAACGCCCTGAACGACCTGAACAATCTCGTGATCACCAGGGACGAGAACGAGAAGCTGGAATATTACGAGAAGTTCCAGATCATCGAGAACGTGTTCAAGCAGAAGAAGAAGCCCACCCTGAAGCAGATCGCCAAAGAAATCCTCGTGAACGAAGAGGATATTAAGGGCTACAGAGTGACCAGCACCGGCAAGCCCGAGTTCACCAACCTGAAGGTGTACCACGACATCAAGGACATTACCGCCCGGAAAGAGATTATTGAGAACGCCGAGCTGCTGGATCAGATTGCCAAGATCCTGACCATCTACCAGAGCAGCGAGGACATCCAGGAAGAACTGACCAATCTGAACTCCGAGCTGACCCAGGAAGAGATCGAGCAGATCTCTAATCTGAAGGGCTATACCGGCACCCACAACCTGAGCCTGAAGGCCATCAACCTGATCCTGGACGAGCTGTGGCACACCAACGACAACCAGATCGCTATCTTCAACCGGCTGAAGCTGGTGCCCAAGAAGGTGGACCTGTCCCAGCAGAAAGAGATCCCCACCACCCTGGTGGACGACTTCATCCTGAGCCCCGTCGTGAAGAGAAGCTTCATCCAGAGCATCAAAGTGATCAACGCCATCATCAAGAAGTACGGCCTGCCCAACGACATCATTATCGAGCTGGCCCGCGAGAAGAACTCCAAGGACGCCCAGAAAATGATCAACGAGATGCAGAAGCGGAACGCCAAGACCAACGAGCGGATCGAGGAAATCATCCGGACCACCGGCAAAGAGAACGCCAAGTACCTGATCGAGAAGATCAAGCTGCACGACATGCAGGAAGGCAAGTGCCTGTACAGCCTGGAAGCCATCCCTCTGGAAGATCTGCTGAACAACCCCTTCAACTATGAGGTGGACCACATCATCCCCAGAAGCGTGTCCTTCGACAACAGCTTCAACAACAAGGTGCTCGTGAAGCAGGAAGAAAACAGCAAGAAGGGCAACCGGACCCCATTCCAGTACCTGAGCAGCAGCGACAGCAAGATCAGCTACGAAACCTTCAAGAAGCACATCCTGAATCTGGCCAAGGGCAAGGGCAGAATCAGCAAGACCAAGAAAGAGTATCTGCTGGAAGAACGGGACATCAACAGGTTCTCCGTGCAGAAAGACTTCATCAACCGGAACCTGGTGGATACCAGATACGCCACCGCCCGGCTGATGAACCTGCTGCGGAGCTACTTCAGAGTGAACAACCTGGACGTGAAAGTGAAGTCCATCAATGGCGGCTTCACCAGCTTTCTGCGGCGGAAGTGGAAGTTTAAGAAAGAGCGGAACAAGGGGTACAAGCACCACGCCGAGGACGCCCTGATCATTGCCAACGCCGATTTCATCTTCAAAGAGTGGAAGAAACTGGACAAGGCCAAAAAAGTGATGGAAAACCAGATGTTCGAGGAAAAGCAGGCCGAGAGCATGCCCGAGATCGAAACCGAGCAGGAGTACAAAGAGATCTTCATCACCCCCCACCAGATCAAGCACATTAAGGACTTCAAGGACTACAAGTACAGCCACCGGGTGGACAAGAAGCCTAATAGAAAGCTGATTAACGACACCCTGTACTCCACCCGGAAGGACGACAAGGGCAACACCCTGATCGTGAACAATCTGAACGGCCTGTACGACAAGGACAATGACAAGCTGAAAAAGCTGATCAACAAGAGCCCCGAAAAGCTGCTGATGTACCACCACGACCCCCAGACCTACCAGAAACTGAAGCTGATTATGGAACAGTACGGCGACGAGAAGAATCCCCTGTACAAGTACTACGAGGAAACCGGGAACTACCTGACCAAGTACTCCAAAAAGGACAACGGCCCCGTGATCAAGAAGATTAAGTATTACGGCAACAAACTGAACGCCCATCTGGACATCACCGACGACTACCCCAACAGCAGAAACAAGGTCGTGAAGCTGTCCCTGAAGCCCTACAGATTCGACGTGTACCTGGACAATGGCGTGTACAAGTTCGTGACCGTGAAGAATCTGGATGTGATCAAAAAAGAAAACTACTACGAAGTGAATAGCAAGTGCTATGAGGAAGCTAAGAAGCTGAAGAAGATCAGCAACCAGGCCGAGTTTATCGCCTCCTTCTACAAGAACGATCTGATCAAGATCAACGGCGAGCTGTATAGAGTGATCGGCGTGAACAGCGACGACCGGAACCGGATCGAAGTGCTGATGATCGACATCACCTACCGCGAGTACCTGGAAAACATGAACGACAAGAGGCCCCCCCACATCATTAAGACAATCGCCTCCAAGACCCAGAGCATTAAGAAGTACAGCACAGACATTCTGGGCAACCTGTATGAAGTGAAATCTAAGAAGCACCCTCAGATCATCAAAAAGGGC(SEQ ID NO:132)。
The nucleic acid with the nucleotide sequence can effectively encode the Cas9 protein mutant, and has higher expression efficiency.
In a third aspect of the invention, a kit is provided. According to an embodiment of the invention, the kit comprises: a first nucleic acid molecule encoding the Cas9 protein mutant previously described; and a second nucleic acid molecule encoding a gRNA. The mutant Cas9 protein encoded by the first nucleic acid molecule in the kit according to the embodiment of the invention, such as a SaCas9 chimera, can be combined with a very wide dsDNA region under the guide RNA, the PAM recognition range is expanded to be close to 1/2, and the gene editing capacity of the CRISPR/Cas9 system is strong.
According to an embodiment of the present invention, the kit may further comprise at least one of the following additional technical features:
according to an embodiment of the invention, the first nucleic acid molecule has the nucleotide sequence of any one of SEQ ID NOs 85-125, 132. The first nucleic acid with the nucleotide sequence can effectively edit the Cas9 protein mutant, and the expression efficiency is higher.
According to an embodiment of the invention, the nucleotide sequence of the second nucleic acid molecule encoding the gRNA framework sequence has at least one of the following mutations compared to the nucleotide sequence of the wild-type gRNA framework sequence: U3C, U4A, U4C, U5C, A6G, a32G, a31T, a31G, a30G, T29C. Wherein, the wild type gRNA framework sequence has a nucleotide sequence shown in SEQ ID NO. 126.
GTTTTAGTACTCTGGAAACAGAATCTACTAAAACAAGGCAAAATGCCGTGTTTATCTCGTCAACTTGTTGGCGAGATTTTTT(SEQ ID NO:126)。
Through sequence alignment, the inventors found that positions 4-6 of the gRNA framework sequence were more conserved. And the previous work of the inventor shows that the activity of guide RNA can be improved by changing the base sequence of the 4-6 positions, so that the applicant believes that the interaction of Cas9 in the region and the guide RNA skeleton does not have protein-specific sequence RNA interaction, and the mutation of the region can cause the change of a CRRNA PAM recognition region, and is likely to generate self-non-self-recognition interference. Since consecutive 4T's in the gRNA backbone can cause premature termination of class III polymerase, the inventors surprisingly found that guide RNA activity can be effectively increased by AU inversion at
According to an embodiment of the invention, the second nucleic acid molecule has the nucleotide sequence of any one of SEQ ID NO 127-129.
GTCTTAGTACTCTGGAAACAGAATCTACTAAGACAAGGCAAAATGCCGTGTTTATCTCGTCAACTTGTTGGCGAGATTTTTT(SEQ ID NO:127)。
GTTATAGTACTCTGGAAACAGAATCTACTATAACAAGGCAAAATGCCGTGTTTATCTCGTCAACTTGTTGGCGAGATTTTTTT(SEQ ID NO:128)。
GTTCCGGTACTCTGGAAACAGAATCTACCGGAACAAGGCAAAATGCCGTGTTTATCTCGTCAACTTGTTGGCGAGATTTTTTT(SEQ ID NO:129)。
The efficiency of the second nucleic acid molecule with the nucleotide sequence for coding the gRNA is high, the PAM binding activity of the gRNA obtained by coding and guiding sacAS9 is obviously improved, and the gene editing capacity of a CRISPR/Cas9 system is further enhanced.
According to an embodiment of the invention, the first nucleic acid molecule and the second nucleic acid molecule are carried on the same expression vector.
According to an embodiment of the invention, said one and the same vector is an adenoviral vector.
In a fourth aspect of the invention, a method of genetically modifying a cell is provided. According to an embodiment of the invention, a first nucleic acid molecule and a second nucleic acid molecule are introduced into the cell to be engineered, the first nucleic acid molecule and the second nucleic acid molecule being as defined above. According to the method provided by the embodiment of the invention, the gene transformation of the predetermined site of the cell genome can be realized, and the transformation has high power, high efficiency and strong activity.
According to an embodiment of the present invention, the method may further include at least one of the following additional technical features:
according to embodiments of the invention, the PAM recognition region sequence and the gRNA sequence of the Cas9 protein mutant are determined based on the gene sequence to be engineered.
According to an embodiment of the present invention, the PAM recognition region sequence and the sequence of the gRNA are determined based on the following relationship:
the inventor verifies through experiments that by using the gene modification method according to the embodiment of the invention, the sequences of the PAM identification region and the gRNA realize successful modification of the EMX1, IL1RN, RUNX1 and ZSCAN2 genes under the identification corresponding relationship, and the modification success rate and efficiency are high.
According to an embodiment of the invention, the genetic modification comprises gene knockout or expression regulation of a predetermined site.
In a fifth aspect of the invention, a cell is provided. According to an embodiment of the invention, the above-mentioned cells are cells obtained according to the method described previously. The genome of the cell according to embodiments of the invention is effectively targeted.
According to an embodiment of the invention, the cell is an animal cell, a plant cell or a microbial cell. Furthermore, various cells, such as animal cells, plant cells or microorganism cells, with the genome being effectively targeted for modification can be obtained by using the method for genetic modification according to the embodiment of the present invention, and further, based on the above method, corresponding specific genetically modified animal models, transgenic plants or microorganisms can also be obtained.
Drawings
FIG. 1 is a phylogenetic tree analysis of a SaCas9 homologous protein according to an embodiment of the invention;
FIG. 2 is the alignment of the homologous sequences of the PAM action region of the SaCas9 homologous protein according to the present invention;
FIG. 3 is a schematic diagram of an EFYP recombinant fluorescence reporter system used to explore the propensity of cCas9, according to an embodiment of the present invention;
fig. 4 is a graph of the results of an optimized gRNA backbone promoting Cas9 activity according to an embodiment of the present invention;
fig. 5 is a graph of the results of PAM recognition activities of 32 different cCas9 in accordance with an embodiment of the invention;
FIG. 6 is a graph showing the result of editing of V42 at the position of an endogenous gene and the activity of gene activation according to an embodiment of the present invention;
FIG. 7 is a graph showing the results of V42 and V17K having higher activity at the position of RRV PAM in accordance with an embodiment of the present invention;
fig. 8 is a graph showing the results of enhanced recognition of PAM in the cCas9 series mutants according to an embodiment of the present invention;
fig. 9 is a result graph of editing efficiency of SaCas9V21R and V21L at RRN positions according to an embodiment of the present invention;
FIG. 10 is a graph of the results of evaluating the off-target effect of high fidelity versions of SaCas9-KKH and V21R in accordance with an embodiment of the present invention;
FIG. 11 is a diagram of the sequence alignment of SaCas9 homologous proteins according to an embodiment of the invention; and
fig. 12 is a graph of the results of cleavage activity of orthogonal full-length Cas9 in mammalian cells according to an embodiment of the invention.
Detailed Description
Reference will now be made in detail to embodiments of the present invention, examples of which are illustrated in the accompanying drawings. The embodiments described below with reference to the drawings are illustrative and intended to be illustrative of the invention and are not to be construed as limiting the invention.
It should be noted that, unless otherwise specified, the "wild-type protein" described herein refers to both naturally occurring in nature and existing in the prior art, such as O13, O40, O23, O39, O26, O18, O38, O12, O36, O27, O10, O33, O34, O14, O44, O15, O28, O42, O20, O37, O24, O43 are naturally occurring proteins having homology with sasa 43 (refer to the protein numbers marked on the right side of fig. 1, where the protein numbers indicate names of proteins having homology), the cas 43-cas 43 is a naturally occurring protein, and the existing cas 43 is modified from cas 36sa 43, wherein the existing cas-43, as follows:
MKRNYILGLDIGITSVGYGIIDYETRDVIDAGVRLFKEANVENNEGRRSKRGARRLKRRRRHRIQRVKKLLFDYNLLTDHSELSGINPYEARVKGLSQKLSEEEFSAALLHLAKRRGVHNVNEVEEDTGNELSTKEQISRNSKALEEKYVAELQLERLKKDGEVRGSINRFKTSDYVKEAKQLLKVQKAYHQLDQSFIDTYIDLLETRRTYYEGPGEGSPFGWKDIKEWYEMLMGHCTYFPEELRSVKYAYNADLYNALNDLNNLVITRDENEKLEYYEKFQIIENVFKQKKKPTLKQIAKEILVNEEDIKGYRVTSTGKPEFTNLKVYHDIKDITARKEIIENAELLDQIAKILTIYQSSEDIQEELTNLNSELTQEEIEQISNLKGYTGTHNLSLKAINLILDELWHTNDNQIAIFNRLKLVPKKVDLSQQKEIPTTLVDDFILSPVVKRSFIQSIKVINAIIKKYGLPNDIIIELAREKNSKDAQKMINEMQKRNRQTNERIEEIIRTTGKENAKYLIEKIKLHDMQEGKCLYSLEAIPLEDLLNNPFNYEVDHIIPRSVSFDNSFNNKVLVKQEENSKKGNRTPFQYLSSSDSKISYETFKKHILNLAKGKGRISKTKKEYLLEERDINRFSVQKDFINRNLVDTRYATRGLMNLLRSYFRVNNLDVKVKSINGGFTSFLRRKWKFKKERNKGYKHHAEDALIIANADFIFKEWKKLDKAKKVMENQMFEEKQAESMPEIETEQEYKEIFITPHQIKHIKDFKDYKYSHRVDKKPNRELINDTLYSTRKDDKGNTLIVNNLNGLYDKDNDKLKKLINKSPEKLLMYHHDPQTYQKLKLIMEQYGDEKNPLYKYYEETGNYLTKYSKKDNGPVIKKIKYYGNKLNAHLDITDDYPNSRNKVVKLSLKPYRFDVYLDNGVYKFVTVKNLDVIKKENYYEVNSKCYEEAKKLKKISNQAEFIASFYNNDLIKINGELYRVIGVNNDLLNRIEVNMIDITYREYLENMNDKRPPRIIKTIASKTQSIKKYSTDILGNLYEVKSKKHPQIIKKG
SaCas9-KKH has E782K, N968K, R1015H mutations compared to SaCas9 sequence.
Unless otherwise stated, the PAM recognition region of saCas9 described herein refers to an amino acid position segment at positions 982-994, and the remaining amino acid segments are framework regions.
The emergence of CRISPR/Cas9 technology has brought a revolutionary breakthrough from basic biological research to clinical applications. Although Cas9 protein is widely available in the microbial community, the range that Cas9 can target is limited by PAM. In the examples of the present application, the inventors obtained a series of highly homologous proteins of SaCas9 by gene mining, replacing them into the PI domain responsible for the direct interaction region with PAM in SaCas 9-KKH. The inventors constructed a series of chimeric Cas9 (chimeracas 9, Cas9), and through functional testing, the inventors selected 64 different PAM combinations at the PAM 4-6 position. The inventors have identified multiple cCas9 mutants with enhanced recognition at the PAM position of the RRN. Meanwhile, the recognition capability of ACT, ATG, ATT, GCT, GTG and GTT PAM positions is expanded.
Overall, in the following examples, the inventors provide that a series of cCas9 mutants can achieve that the position of 1/3 can be edited in mammalian cells.