Adc68XY adenovirus vector, virus packaged by same and application

文档序号:900154 发布日期:2021-02-26 浏览:2次 中文

阅读说明:本技术 腺病毒载体AdC68XY、由其包装的病毒及应用 (Adc68XY adenovirus vector, virus packaged by same and application ) 是由 周东明 邢嫚 祝丰 于 2019-08-22 设计创作,主要内容包括:本发明提供了一种新型腺病毒载体AdC68XY、由其包装的病毒及应用。本发明的腺病毒表达载体能够高效地包装,病毒产量高,且获得的病毒感染能力强。同时,本发明的腺病毒表达载体可容纳较大的外源基因,具有很大的应用价值。(The invention provides a novel adenovirus vector AdC68XY, a virus packaged by the same and application thereof. The adenovirus expression vector can be efficiently packaged, the virus yield is high, and the obtained virus has strong infection capacity. Meanwhile, the adenovirus expression vector can contain larger exogenous genes and has great application value.)

1. A method of making an adenoviral expression vector comprising: based on the genome sequence of the chimpanzee adenovirus AdC68, the E1 sequence and the E3 sequence are deleted, and the E4 sequence is modified; the modification comprises replacing the E4 sequence or fragment thereof in the AdC68 genome with the corresponding E4 sequence or fragment thereof in the human serum adenovirus type 5 AdHu5 genome.

2. A recombinant adenoviral vector comprising sequences based on the genome of chimpanzee adenovirus AdC68, wherein the E1 and E3 sequences are deleted and the E4 sequence is engineered; the modification comprises replacing the E4 sequence or fragment thereof in the AdC68 genome with the corresponding E4 sequence or fragment thereof in the human serum adenovirus type 5 AdHu5 genome.

3. The method of claim 1 or the recombinant adenoviral vector of claim 2, wherein the engineering is selected from the group consisting of:

(a) replacing the AdC68 genomic E4 sequence with an AdHu5 genomic E4 sequence; more preferably, the deletion region is the 33518-36105 bit sequence of the AdC68 genome, and the 32914-35641 bit sequence of the AdHu5 genome is inserted into the deletion region;

(b) replacing the sequences of ORF6 and ORF6/7 regions in the sequence of AdC68 genome E4 by the sequences of ORF6 and ORF6/7 regions in the sequence of AdHu5 genome E4; more preferably, the deletion region is the 33518-34671 bit sequence of the AdC68 genome, and the 32914-34077 bit sequence of the AdHu5 genome is inserted into the deletion region; or

(c) Replacing ORF 1-ORF 6/7 sequences in the AdC68 genome E4 sequence by ORF 3-ORF 6 sequences in the AdHu5 genome E4 sequence; more preferably, the deletion region is the 33518-36105 bit sequence of the AdC68 genome, into which the 33193-34703 bit sequence of the AdHu5 genome was inserted.

4. The method of claim 1 or the recombinant adenoviral vector of claim 2, further comprising an element or gene selected from the group consisting of:

an exogenous gene expression cassette;

an expression cassette for a reporter gene;

an inverted terminal repeat sequence; and/or

A restriction enzyme site; preferably comprising: HpaI, I-CeuI, PI-SceI, SwaI and PacI.

5. The method of claim 1 or the recombinant adenoviral vector of claim 2, which has the nucleotide sequence shown in SEQ ID NO 1, but,

wherein positions 29278-31865 are replaced by SEQ ID NO 3;

wherein the 29278-30431 position is replaced by SEQ ID NO. 4; or

Wherein positions 29278-31865 are replaced by SEQ ID NO 5.

6. Use of a recombinant adenoviral vector according to any one of claims 1 to 5 for preparing a recombinant adenovirus; or, for insertion of a foreign gene, thereby expressing the foreign gene.

7. A recombinant adenovirus obtained by packaging the recombinant adenovirus vector according to any one of claims 1 to 5.

8. A method of producing the recombinant adenovirus of claim 7, comprising:

(1) preparing a recombinant adenoviral vector according to any one of claims 1 to 5;

(2) transfecting the recombinant adenovirus vector in (1) into a virus production cell, and packaging to obtain the recombinant adenovirus.

9. Use of the recombinant adenovirus according to claim 7, for: expressing the exogenous gene to obtain the target protein; preferably, the target protein comprises: proteins with immunogenicity, proteins with pharmaceutical activity, proteins with diagnostic activity, structural proteins, enzymes, fusion peptides, reporter proteins.

10. A kit, comprising:

a recombinant adenoviral vector according to any one of claims 1 to 5;

the recombinant adenovirus of claim 7; and/or

A virus-producing cell.

Technical Field

The invention belongs to the field of biomedicine and virology, and particularly relates to a novel adenovirus vector AdC68XY, a virus packaged by the same, a preparation method and application thereof; preferably, the adenoviral vector AdC68XY is constructed based on synthetic biology techniques.

Background

Adenovirus vectors are one of the most promising viral vectors currently. Adenovirus is a non-enveloped, linear, double-stranded DNA virus with a genome size of approximately 36 kb. Since the first discovery of adenoviruses in 1953, to date, 57 human serotypes (classified into subgenera A-G7) and a number of adenoviruses from other species have been isolated, the most common of which is the human serum type 5 adenovirus, AdHu 5. Since adenovirus has high transduction efficiency, can carry foreign genes, is easy to amplify and purify, is regarded as an excellent gene delivery vehicle, and is widely applied to various basic researches or clinical applications, including the development of vaccines for diseases such as HIV, influenza, Ebola, malaria, rabies, prostate cancer, melanoma and the like, and the oncolytic therapy. However, since pre-existing neutralizing antibodies against AdHu5 are present in 75-80% of individuals, the vaccine with AdHu5 as a vector has immune effects affected by the pre-existing neutralizing antibodies, not only impairing the immune effects, but also possibly enhancing side effects. To avoid this problem, the inventors have previously selected as a vaccine vector an adenovirus of a rare serotype or other species origin, such as a chimpanzee type adenovirus or the like. The chimpanzee type adenovirus is only popular in chimpanzees in Africa, and people generally do not contain neutralizing antibodies for resisting the chimpanzee type adenovirus, so the chimpanzee type adenovirus is taken as a vaccine vector, and the comprehensive immune effect of the vaccine vector is superior to that of an AdHu5 vector.

Because the adenovirus genome is relatively large, the construction of the recombinant adenovirus vector is difficult to complete by common molecular biology methods, and therefore, the construction of the adenovirus vector is still a technical bottleneck. Traditional adenovirus vector construction strategies mainly include homologous recombination and direct genome cloning, and are time-consuming and labor-consuming, and take more than 2 months at the fastest speed. A further strategy to improve on this is isothermal assembly, which, although claimed to require only about 1 week, only refers to the time required to join multiple fragments together into one clone by isothermal assembly, not including the time and validation process required to obtain multiple small fragments with complementary sequences at the linker sites. Furthermore, these strategies must first obtain the wild-type virus and amplify and purify the virus to obtain its genomic DNA. If only the adenovirus gene sequence is known, the construction of the adenovirus vector cannot be completed.

In conclusion, there is a need in the art for further research and exploration to obtain improved adenoviral vectors and adenoviruses to enhance their clinical applicability.

Disclosure of Invention

The invention aims to provide a novel adenovirus vector AdC68XY constructed based on synthetic biology technology.

In a first aspect of the invention, there is provided a method of preparing an adenoviral expression vector comprising: based on the genome sequence of the chimpanzee adenovirus AdC68, the E1 sequence and the E3 sequence are deleted, and the E4 sequence is modified; the modification comprises replacing the E4 sequence or fragment thereof in the AdC68 genome with the corresponding E4 sequence or fragment thereof in the human serum adenovirus type 5 AdHu5 genome.

In another aspect of the invention, there is provided a recombinant adenoviral vector comprising sequences based on the genome of the adenovirus AdC68 of chimpanzee type, wherein the E1 sequence and the E3 sequence are deleted and the E4 sequence is engineered; the modification comprises replacing the E4 sequence or fragment thereof in the AdC68 genome with the corresponding E4 sequence or fragment thereof in the human serum adenovirus type 5 AdHu5 genome.

In a preferred embodiment, in the aforementioned method or vector, the modification is selected from:

(a) replacing the AdC68 genomic E4 sequence with an AdHu5 genomic E4 sequence; more preferably, the deletion region is the 33518-36105 bit sequence of the AdC68 genome, and the 32914-35641 bit sequence of the AdHu5 genome is inserted into the deletion region;

(b) replacing the sequences of ORF6 and ORF6/7 regions in the sequence of AdC68 genome E4 by the sequences of ORF6 and ORF6/7 regions in the sequence of AdHu5 genome E4; more preferably, the deletion region is the 33518-34671 bit sequence of the AdC68 genome, and the 32914-34077 bit sequence of the AdHu5 genome is inserted into the deletion region; or

(c) Replacing ORF 1-ORF 6/7 sequences in the AdC68 genome E4 sequence by ORF 3-ORF 6 sequences in the AdHu5 genome E4 sequence; more preferably, the deletion region is the 33518-36105 bit sequence of the AdC68 genome, into which the 33193-34703 bit sequence of the AdHu5 genome was inserted.

In another preferred embodiment, the recombinant adenovirus vector further comprises the following operably linked elements or genes: an exogenous gene expression cassette; an expression cassette for a reporter gene; an inverted terminal repeat sequence; and/or a cleavage site; preferably including (but not limited to): HpaI, I-CeuI, PI-SceI, SwaI and PacI.

In another preferred embodiment, the expression cassette of the reporter gene has the nucleotide sequence shown in SEQ ID NO. 2.

In another preferred example, the enzyme cleavage site includes an enzyme cleavage site for inserting a foreign gene.

In another preferred example, the exogenous gene expression cassette can be expressed to obtain the target protein; preferably, the protein of interest includes (but is not limited to): proteins with immunogenicity, proteins with pharmaceutical activity (e.g. antibodies), proteins with diagnostic activity (e.g. antibodies), structural proteins, enzymes, fusion peptides, reporter proteins.

In another preferred embodiment, the foreign gene is a foreign gene having a length of 30bp to 8.5kb, such as 50bp, 100bp, 200bp, 500bp, 1kb, 2kb, 3kb, 4kb, 5kb, 6kb, 7kb, 8 kb.

In another preferred embodiment, the adenoviral expression vector is constructed using synthetic techniques.

In another preferred embodiment, the recombinant adenovirus vector has the nucleotide sequence shown in SEQ ID NO. 1(AdC68XY1-EGFP), but wherein the 29278-31865 positions are replaced by SEQ ID NO. 3; wherein the 29278-30431 position is replaced by SEQ ID NO. 4; or wherein positions 29278-31865 are replaced by SEQ ID NO 5.

In another aspect of the invention there is provided the use of a recombinant adenoviral vector as described in any one of the preceding paragraphs for the preparation of a recombinant adenovirus; or, for insertion of a foreign gene, thereby expressing the foreign gene.

In another aspect of the present invention, there is provided a recombinant adenovirus obtained by packaging the recombinant adenovirus vector described in any one of the above.

In another aspect of the present invention, there is provided a method for preparing the recombinant adenovirus, the method comprising: (1) preparing a recombinant adenoviral vector as described in any one of the preceding paragraphs; and (2) transfecting the recombinant adenovirus vector of (1) into a virus production cell, thereby packaging to obtain the recombinant adenovirus.

In a preferred embodiment, the virus-producing cell is a cell that can effect viral packaging; preferably comprising: HEK293 cells.

In another aspect of the present invention, there is provided a use of the recombinant adenovirus for: expressing the exogenous gene to obtain the target protein; preferably, the protein of interest includes (but is not limited to): proteins with immunogenic properties, proteins with pharmaceutical activity (such as antibodies, proteins with killing activity (such as tumoricidal proteins)), proteins with diagnostic activity (such as antibodies), structural proteins, enzymes, fusion peptides, reporter proteins.

In a preferred embodiment, the immunogenic proteins include, but are not limited to: protein vaccine for HIV, influenza, Ebola, malaria, rabies, prostate cancer, melanoma, etc.

In another aspect of the present invention, there is provided a kit comprising: a recombinant adenoviral vector as described in any one of the preceding; the recombinant adenovirus as described above; and/or virus producing cells.

Other aspects of the invention will be apparent to those skilled in the art in view of the disclosure herein.

Drawings

FIG. 1 is a schematic diagram showing the construction of the recombinant adenovirus vector pAdC68XY1-EGFP in which the E1 region has been completely deleted; the E3 region was partially deleted, and a SwaI cleavage site was added for subsequent foreign gene insertion.

FIG. 2 shows the results of the restriction and identification of recombinant adenovirus vectors pAdC68XY1-EGFP, pAdC68XY2-EGFP, pAdC68XY3-EGFP and pAdC68XY 4-EGFP. Wherein, M is 1kb plus DNA ladder; 1-3: BglII, Xho I and MfeI.

FIG. 3A viral plaque formation of recombinant adenovirus AdC68XY 1-EGFP.

FIG. 3B viral plaque formation of recombinant adenovirus AdC68XY 2-EGFP.

FIG. 3C viral plaque formation of recombinant adenovirus AdC68XY 3-EGFP.

FIG. 3 viral plaque formation of recombinant adenovirus AdC68XY 4-EGFP.

FIG. 4 shows the results of genome restriction and identification of recombinant adenovirus AdC68XY1-EGFP, AdC68XY2-EGFP, AdC68XY3-EGFP and AdC68XY 4-EGFP. Wherein, M is 1kb plus DNA ladder; 1-3: BglII, Xho I and MfeI.

Detailed Description

Adenovirus has a large genome size, and it is difficult to express foreign genes. Therefore, the present inventors have conducted intensive studies to disclose a novel adenovirus vector constructed based on synthetic biology techniques, which is called AdC68XY series vector. On one hand, the adenovirus expression vector can be efficiently packaged, the virus yield is high, and the obtained virus has strong infection capacity; on the other hand, the adenovirus expression vector can contain larger exogenous genes and has larger application value.

As used herein, the term "element" refers to a series of functional nucleic acid sequences useful for the expression of a protein, and in the present invention, is systematically constructed to form an expression construct. The sequence of the "element" may be those provided in the present invention, and also includes variants thereof, as long as the variants substantially retain the function of the "element", which are obtained by inserting or deleting some bases (e.g., 1 to 50 bp; preferably 1 to 30bp, more preferably 1 to 20bp, still more preferably 1 to 10bp), or by random or site-directed mutagenesis, or the like.

As used herein, the term "operably linked" or "operably linked" refers to a functional spatial arrangement of two or more nucleic acid regions or nucleic acid sequences. For example: the promoter region is placed in a specific position relative to the nucleic acid sequence of the gene of interest such that transcription of the nucleic acid sequence is directed by the promoter region, whereby the promoter region is "operably linked" to the nucleic acid sequence.

As used herein, the term "expression cassette" refers to a gene expression system comprising all the necessary elements required for expression of a gene of interest, typically comprising the following elements: promoter, target gene sequence, terminator; in addition, a signal peptide coding sequence and the like can be optionally included. These elements are operatively connected.

As used herein, the term "construct" refers to a single-or double-stranded DNA molecule that has been artificially manipulated to contain DNA segments combined and arranged according to sequences not found in nature. The "construct" includes an expression vector; alternatively, the "construct" is included in an expression vector as part of the expression vector.

As used herein, the term "exogenous" or "heterologous" refers to the relationship between two or more nucleic acid or protein sequences from different sources, or the relationship between a nucleic acid from a different source and a host cell. For example, a nucleic acid is heterologous to a host cell if the combination of the nucleic acid and the host cell does not normally occur in nature. A particular sequence is "foreign" to the cell or organism into which it is inserted.

Adenoviral vectors

The genome of the adenovirus is relatively large, and in the prior art, the inventor tries more modification strategies, including carrying out total or partial modification on E1 and E3 in the genome and deleting partial genes, but in view of the important role of the E4 region, the prior art has not been precedent for carrying out successful modification on E4. The present inventors have found that a virus having enhanced replication and infection abilities can be obtained by modifying E4 wholly or partially, replacing the corresponding region in the human serum adenovirus type 5 AdHu5 genome, or deleting the sequence to a limited extent, and that even an increase in the insertion capacity of a foreign gene can be achieved.

In the invention, a novel replication-defective recombinant adenovirus vector deleted by E1/E3 is designed by using chimpanzee adenovirus AdC68 as a template and using the genome sequence of the chimpanzee adenovirus AdC 68. The adenovirus genome contains the E1-E4 early genes, and E1A can activate the promoters of E1B, E2A, E2B, E3, and E4. The adenovirus E4 region includes 7 open reading frames, the products of which are ORF1, ORF2, ORF3, ORF4, ORF3/4, ORF6 and ORF6/7, respectively. Among them, E4ORF6 is a multifunctional protein involved in many processes of viral replication and is extremely important for replication of adenovirus. ORF6 can bind to E1B 55-kDa to form an ORF6-E1B complex that promotes export of major late viral mRNA during the post-transcriptional phase, thereby enhancing expression of late viral genes while inhibiting synthesis of host proteins. In adenovirus-infected cells, the ORF6-E1B complex is also involved in inhibiting E1A-induced p53 activation, thereby degrading p53 and inhibiting apoptosis of host cells. Furthermore, ORF6 can indirectly increase the efficiency of viral DNA synthesis. ORF3 is functionally similar to ORF6 and can partially or completely compensate for each other deletion. For improved safety, adenoviruses used as vaccine vectors are usually replication-defective, i.e., the E1 gene is deleted. Thus, non-replicating adenoviruses are typically packaged and amplified in HEK293 cells incorporating the AdHu5E1 gene. Although the interaction between AdHu5E1B 55k and E4ORF6 proteins is relatively conserved, binding of chimpanzee adenovirus E4ORF6 protein to AdHu5E1B 55k may be unstable or less efficient, resulting in reduced non-replicating chimpanzee adenovirus production. In order to avoid the problems and improve the yield of the recombinant adenovirus AdC68, the inventor replaces the E4 sequence of AdC68 with the E4 sequence of AdHu5 in whole or in part, designs a plurality of recombinant AdC68 vectors with different E4 sequences respectively, and names AdC68XY1, AdC68XY2, AdC68XY3 and AdC68XY4 respectively; in the E1 deletion region of the recombinant vector, the green fluorescent protein gene was inserted. Then, the inventor adopts synthetic biology technology, the whole gene is combined into a novel recombinant adenosis AdC68XY1 with green fluorescence label, the linear recombinant adenosis AdC68XY1 is successfully packaged in HEK293 cells, and the genetically stable recombinant adenosis AdC68XY1 is amplified. On the basis of obtaining the recombinant adenovirus vector AdC68XY1, the inventors make other various substitutions on the E4 sequence of AdC68 according to the established design scheme to obtain vectors such as AdC68XY2, AdC68XY3, AdC68XY4, and the like. The method is time-saving and labor-saving, and does not depend on wild adenovirus genome DNA.

Based on the new findings of the present inventors, there is provided a recombinant adenovirus vector (plasmid) comprising: an engineered chimpanzee adenovirus AdC68 genomic sequence; wherein the whole sequence of E1 and the partial sequence of E3 are deleted, and the sequence of E4 is modified; the modification comprises replacing the E4 sequence or fragment thereof in the AdC68 genome with the corresponding E4 sequence or fragment thereof in the human serum adenovirus type 5 AdHu5 genome.

As a preferred mode of the present invention, the modification is selected from: (a) the AdHu5 genomic E4 sequence replaces the AdC68 genomic E4 sequence; (b) replacing the sequences of ORF6 and ORF6/7 regions in the sequence of AdC68 genome E4 by the sequences of ORF6 and ORF6/7 regions in the sequence of AdHu5 genome E4; or (c) replacing the sequences of ORF 1-ORF 6/7 in the sequence of AdC68 genome E4 by the sequences of ORF 3-ORF 6 in the sequence of AdHu5 genome E4.

Preferred embodiments of the present invention further comprise an element or gene selected from the group consisting of: an exogenous gene expression cassette; an expression cassette for a reporter gene; inverted terminal repeats and/or cleavage sites. It is understood that the design of the reporter gene, the restriction enzyme site, and the like is sometimes designed to provide a convenient way to observe or detect when performing research, but is not necessary when the recombinant adenovirus vector or the recombinant adenovirus of the present invention is applied to the field of clinical medicine; alternatively, the reporter gene may be replaced at a position where it is capable of expressing the protein of interest having pharmaceutical activity.

The expression vector usually further contains an origin of replication and/or a marker gene and the like. Methods well known to those skilled in the art can be used to construct the expression vectors required by the present invention, as suggested by the present invention. These methods include in vitro recombinant DNA techniques, DNA synthesis techniques, in vivo recombinant techniques, and the like. The DNA sequence may be operably linked to a suitable promoter (e.g., CMV) in an expression vector to direct mRNA synthesis. The expression vector also includes a ribosome binding site for translation initiation and a transcription terminator. Furthermore, the expression vector preferably comprises one or more selectable marker genes to provide a phenotypic trait for selection of transformed host cells.

The expression vector of the invention can be inserted with an expression cassette of a foreign gene, thereby expressing a target protein of interest. The adenovirus expression vector of the invention can be used as an expression vector platform and is suitable for expressing a plurality of target proteins, and the target proteins are not particularly limited. As some alternative embodiments, the protein of interest includes (but is not limited to): proteins with immunogenicity, proteins with pharmaceutical activity (e.g. antibodies), proteins with diagnostic activity (e.g. antibodies), structural proteins, enzymes, fusion peptides, reporter proteins. The target protein may be a single protein or two or more proteins, so long as the length of the gene encoding the target protein is within the packaging capacity that the adenovirus can accommodate.

In the expression vector of the present invention, the insertion position of the foreign gene expression cassette is not unique, and preferably includes (but is not limited to): the E1 region, the E3 region, the E4 region.

The elements of the expression vectors of the present invention are operably linked to facilitate viral packaging or efficient expression of the protein of interest.

As a preferred mode of the present invention, the novel adenoviral vector of the present invention is constructed using synthetic biology-based techniques. In contrast to the research direction of traditional biology, synthetic biology is a step-by-step building of parts starting from the most basic elements, in the process of which genetic material is designed and modified to finally obtain ideal new organisms. However, it is understood that, in addition to synthetic biology techniques, other methods of constructing the adenoviral vectors of the invention are also encompassed by the invention, once the technical solution of the invention has been obtained.

The invention provides a rapid and effective method for constructing a novel adenovirus vector, which is beneficial to expanding the application of the adenovirus vector and simultaneously obtaining a novel recombinant adenovirus AdC68XY series vector.

Adenoviral vectors

After obtaining the adenovirus expression vector, transfecting the adenovirus expression vector into a virus production cell to propagate viruses. After a period of time following transfection, the virus may be harvested. The virus-producing cell may be any cell known in the art that is capable of propagating adenovirus, such as HEK293 cells and the like.

As a preferred mode of the present invention, the harvested virus can be repeatedly infected with virus-producing cells and passaged continuously. Viral Titer (TCID)50) The determination of (b) can be performed according to a method conventional in the art. The obtained recombinant adenovirus is also included in the present invention.

The adenovirus obtained by the adenovirus vector expression of the invention can form clear and obvious plaques on the surface of a cell culture 4 days after transfecting virus production cells, and the quantity of the plaques is more, obviously because of the adenovirus vector before modification.

The recombinant adenovirus generated by the recombinant adenovirus vector can transfect mammalian cells efficiently, is easy to amplify and purify, has low toxicity, high expression efficiency and long term. The adenovirus vector expression recombination process of the invention is simple, low in price and lasting in curative effect.

Composition, kit or kit

The invention also provides a composition comprising an effective amount of the adenovirus, and a pharmaceutically acceptable carrier. Preferably, the adenovirus expresses a protein of interest. For example, the protein of interest includes, but is not limited to: proteins with immunogenicity, proteins with pharmaceutical activity (such as antibodies, proteins with activity of killing tumor cells), proteins with diagnostic activity (such as antibodies), structural proteins, enzymes, fusion peptides.

As used herein, "comprising" means that the various ingredients can be applied together in a mixture or composition of the invention. Thus, the terms "consisting essentially of and" consisting of are encompassed by the term "comprising.

As used herein, "effective amount" or "effective dose" refers to an amount that produces a function or activity in a human and/or animal and is acceptable to the human and/or animal as used herein.

As used herein, a "pharmaceutically acceptable" component is one that is suitable for use in humans and/or mammals without undue adverse side effects (such as toxicity, irritation, and allergic response), i.e., at a reasonable benefit/risk ratio. The term "pharmaceutically acceptable carrier" refers to a carrier for administration of a therapeutic agent, including various excipients and diluents.

Typically, the adenovirus will be formulated in a non-toxic, inert and pharmaceutically acceptable aqueous carrier medium, typically having a pH of about 5 to about 8, preferably a pH of about 6 to about 8.

The pharmaceutical composition of the present invention can be prepared in the form of an injection, for example, by a conventional method using physiological saline or an aqueous solution containing glucose and other adjuvants. The amount of active ingredient (adenovirus) administered is a therapeutically effective amount, for example from about 0.1 micrograms per kilogram of body weight to about 10 milligrams per kilogram of body weight per day. Of course, the particular dosage will depend upon such factors as the route of administration, the health of the patient, and the like, and is within the skill of the skilled practitioner.

Based on the new discovery of the invention, the invention also provides a kit/kit, wherein the kit/kit comprises the adenovirus expression vector or the adenovirus. The kit/kit may also include virus producing cells (e.g., 293 cells), culture media, and the like. In addition, the kit/kit may further comprise instructions for using the adenovirus after the expression method.

The invention will be further illustrated with reference to the following specific examples. It should be understood that these examples are for illustrative purposes only and are not intended to limit the scope of the present invention. The experimental procedures, for which specific conditions are not noted in the following examples, are generally performed according to conventional conditions such as those described in J. SammBruk et al, molecular cloning protocols, third edition, scientific Press, 2002, or according to the manufacturer's recommendations.

Cells

The HEK293 cell line was purchased from Shanghai cell bank of Chinese academy of sciences, and the culture medium was DMEM containing 10% fetal bovine serum.

Restriction enzyme, competent cell, genomic sequence

All restriction enzymes were purchased from New England Biolabs; stbl2 strain was purchased from Suzhou New Saimei Biotechnology Inc. (Suzhou).

Chimpanzee adenovirus AdC68 sequences are referenced to GenBank published sequences, i.e. GenBank accession numbers: AC _ 000011.1.

The E4 sequence of adenovirus human serotype 5 (AdHu5) is referenced to GenBank accession No.: AC _ 000008.1.

Example 1 preparation of adenovirus vector AdC68XY and recombinant adenovirus

1. Design of AdC68XY AdC

First, the inventors deleted part of the E1 and E3 sequences in the AdC68 genomic sequence. The E1 deletion was 2552bp (deletion region was 459bp-3010bp), and thus a replication-defective adenovirus vector was obtained. The E3 deletion was 3721bp (deletion region AC _000011.1, 27615bp-31335 bp). The deletion of E1/E3 can make the insertion capacity of the foreign gene about 7.5 kb.

Secondly, the inventors transformed or replaced the E4 sequence of AdC68 with the E4 sequence of AdHu5 in a different way to obtain different recombinant vectors, and some optimized recombinant vectors are as follows:

AdC68XY1 vector: wherein E1/E3 is deleted, enzyme cutting sites HpaI are added at 33517bp and 36106bp (corresponding to AC _000011.1), and the sequence of E4 is not changed. Meanwhile, rare enzyme cutting sites I-CeuI and PI-SceI are inserted into the deletion region of E1 by the inventor, so that the insertion of a foreign gene into the region is facilitated. The restriction site SwaI was inserted in the deletion region of E3, so that a foreign gene could also be inserted in this region. Enzyme cutting sites HpaI are added at two ends of the E4 region so as to be convenient for carrying out subsequent modification on the E4 region. The ITRs of the modified AdC68 genome are added with PacI restriction sites on both sides (so that the recombinant adenovirus plasmid is linearized by PacI), and the recombinant adenovirus vector AdC68XY1 is obtained. In order to facilitate the observation of the packaging of the novel recombinant adenovirus, the inventors inserted a CMV-EGFP sequence (CMV promoter driving EGFP expression, SEQ ID NO:2) between the I-CeuI and PI-SceI cleavage sites of the deletion region of E1, so that the obtained recombinant adenovirus carries a green fluorescent marker (AdC68XY 1-EGFP). AdC68XY1-EGFP has the sequence shown in SEQ ID NO 1, and the components are shown in figure 1.

AdC68XY2 vector: wherein E1/E3 is deleted and the E4 sequence is completely replaced by the E4 sequence of AdHu 5: the deletion region of the AdC68E4 sequence is 33518bp-36105bp (corresponding to AC _ 000011.1); the insertion region (SEQ ID NO:3) of the AdHu5E4 sequence was 32914bp-35641bp (corresponding to AC _ 000008.1).

AdC68XY3 vector: wherein E1/E3 is deleted and the E4 regions ORF6/7 and ORF6 are replaced with AdHu 5: the deletion region of the AdC68E4 sequence is 33518bp-34671bp (corresponding to AC _ 000011.1); the insertion region (SEQ ID NO:4) of the AdHu5E4 sequence was 32914bp-34077bp (corresponding to AC-000008.1). In which the ORF1-4 sequence of AdC68 was retained.

AdC68XY4 vector: wherein E1/E3 was deleted and the E4 region was replaced with ORF3-6 of AdHu5 (i.e., ORFs 1-2 and 6/7 were deleted): the deletion region of the AdC68E4 sequence was 33518bp-36105bp (corresponding to AC _000011.1), and the insertion region of the AdHu5E4 sequence (SEQ ID NO:5) was 33193bp-34703bp (corresponding to AC _ 000008.1).

2. Synthesis of recombinant adenovirus pAdC68XY1-EGFP molecular clone

The above-designed recombinant adenovirus pAdC68XY1-EGFP molecular clone sequence was sent to general biological systems (Anhui) Inc., and the molecular clone was synthesized by the company.

On the basis of the sequence of the synthesized pAdC68XY1-EGFP vector, the vector is further modified to obtain an AdC68XY2 vector, an AdC68XY3 vector and an AdC68XY4 vector.

3. Recombinant adenovirus AdC68XY1-EGFP packaging and amplification

The recombinant adenovirus pAdC68XY1-EGFP molecular clone (plasmid DNA) obtained by the synthesis is linearized by PacI digestion, and the digested AdC68XY1-EGFP plasmid DNA is transfected into HEK293 cells (6-well plates) with the density of 60-70% according to the instructions of a Lipofectamine 2000 transfection kit. Before transfection, the medium was changed to serum-free medium, the DNA/liposome complex was added, and after 5 hours of transfection, the medium was changed to DMEM medium containing 10% FBS and 1% double antibody. EGFP fluorescence intensity and cytopathic effect are observed every other day under a fluorescence microscope, cells are collected until a great amount of plaques appear in HEK293 cells, the cells are repeatedly frozen and thawed 3 times between room temperature and an ultra-low temperature refrigerator at minus 80 ℃, centrifugation is carried out for 10min at 3500rpm, and supernatant is collected and placed in the ultra-low temperature refrigerator at minus 80 ℃ for seed preservation.

4. Molecular cloning and virus packaging of pAdC68XY2-EGFP, pAdC68XY3-EGFP and pAdC68XY4-EGFP

On the basis of obtaining the recombinant adenovirus vector pAdC68XY1-EGFP, the inventor carries out other various substitutions on an E4 sequence of AdC68 according to a set design scheme, firstly enzyme-cuts the pAdC68XY1-EGFP vector by HpaI, then respectively obtains pAdC68XY2-EGFP, pAdC68XY3-EGFP and pAdC68XY4-EGFP vectors by an isothermal recombinant molecular cloning method, and packages, amplifies and purifies the viruses according to the method.

5. Identification of recombinant adenovirus AdC68XY-EGFP

Amplifying each recombinant adenovirus, purifying virus liquid by cesium chloride density gradient centrifugation, measuring the titer of the virus, and subpackaging in a refrigerator at the ultralow temperature of-80 ℃ for storage. Each recombinant adenovirus AdC68XY-EGFP was partially purified and isolated from genomic DNA of each recombinant adenovirus according to the instructions of DNeasy Tissue Kit (QIAGEN, Valencia, CA) and then identified by multiple-enzyme cutting.

Example 2 identification of recombinant adenovirus pAdC68XY-EGFP molecular clones

The plasmid DNA of the recombinant adenovirus pAdC68XY1-EGFP is cut by a plurality of enzymes (MfeI, BglII and XhoI) and subjected to electrophoresis by 1% agarose gel, and the molecular clone of the pAdC68XY1-EGFP is proved to be complete and correct by DNA bands, which is shown in figure 2.

Plasmid DNA of other recombinant adenoviruses pAdC68XY2-EGFP, pAdC68XY3-EGFP and pAdC68XY4-EGFP obtained by further modification are subjected to the same enzyme digestion identification, and the plasmid DNA is shown in figure 2.

Example 3 viral packaging and rescue of recombinant adenovirus AdC68XY-EGFP

Lipofefection pAdC68XY1-EGFP, pAdC68XY2-EGFP, pAdC68XY3-EGFP, pAdC68XY4-EGFP plasmid DNA linearized by PacI was transfected into HEK293 cells (cultured in 6-well plates) at a density of about 70%, and green fluorescent protein expression was observed periodically and recorded by photography. Over time, green fluorescence expression changes, fluorescence gradually enriches and forms clusters, and plaques are finally formed, which indicates that the chimpanzee-derived replication-defective adenoviruses AdC68XY1-EGFP, AdC68XY2-EGFP and AdC68XY3-EGFP are successfully packaged and replicated to obtain infectious virus particles in HEK293 cells, and the figure is shown in figure 3.

According to FIG. 3, the adenovirus AdC68XY1-EGFP, which was not modified in the E4 region, showed plaques at day 10, with a plaque number of 3.

AdC68XY2-EGFP with deletion of E4 region showed plaques at day 4, with 11 plaques.

The E4 region-engineered AdC68XY3-EGFP showed plaques at day 4, with a 12 plaque number.

The E4 region-engineered AdC68XY4-EGFP showed plaques at day 4, with a plaque number of 15.

According to the result, the time required for adenovirus rescue after the E4 region is modified is obviously shorter than that of an unmodified group, and the plaque number is obviously more than that of the unmodified group, so that the replication capacity is enhanced after the E4 region is partially or completely replaced by the corresponding region of the AdHu5 adenovirus.

In addition, for the group of the foreign gene insertion capacity AdC68XY4-EGFP, after the deletion of ORF1, ORF2 and ORF6/7 in the E4 region, the insertion capacity of the foreign gene is reduced by 970bp compared with that of AdC68XY1, so that the insertion capacity of the foreign gene is increased and can reach about 8.5 kb. Meanwhile, the inserting capacity of AdC68XY4 is increased, the replication capacity of adenovirus, virus package and the like are not affected, and the yield is not affected.

Example 4 identification of recombinant adenovirus AdC68XY-EGFP

Purifying the amplified recombinant adenoviruses AdC68XY1-EGFP, AdC68XY2-EGFP, AdC68XY3-EGFP and AdC68XY4-EGFP by cesium chloride gradient centrifugation, wherein the titer of each purified recombinant virus is 6 multiplied by 1012vp/L、2×1013vp/L、1×1013vp/L、2×1013vp/L. Higher titer can be obtained.

In the experimental result, the E4 region is partially or completely replaced by the Hu5 source recombinant adenovirus, the virus yield is relatively highest, the expected effect is achieved, and the virus yield is also obviously improved by the other partially modified recombinant adenovirus with the E4 region. Therefore, the invention obtains various recombinant adenoviruses which are more beneficial to growth in HEK293 cells and improves the yield.

Separating virus genome DNA, carrying out enzyme digestion by MfeI, BglII and XhoI, carrying out 1% agarose gel electrophoresis, and proving that recombinant adenoviruses AdC68XY1-EGFP, AdC68XY2-EGFP, AdC68XY3-EGFP and AdC68XY4-EGFP are correct virus genome through the bands of the genome DNA subjected to enzyme digestion. See fig. 3.

Conclusion

In the invention, the adenovirus AdC68 genome sequence is analyzed and designed, and a biosynthesis technology is utilized to synthesize the recombinant adenovirus vector AdC68XY1 as a whole. In order to embody the effect, the inventor adds green fluorescent protein into the plasmid, and can directly obtain the recombinant adenovirus AdC68XY1-EGFP after transfection, virus packaging, amplification and purification. The correct recombinant adenovirus vector AdC68XY is obtained by enzyme digestion identification analysis. On the basis, the inventor further replaces E4 of AdC68XY1 by conventional molecular cloning to obtain recombinant adenoviruses AdC68XY2-EGFP, AdC68XY3-EGFP and AdC68XY4-EGFP, and obtains corresponding recombinant adenoviruses. The vectors can be used in a variety of basic research and clinical applications, such as expression vectors, vaccine development, gene therapy, and the like.

The invention not only obtains a series of novel vectors AdC68XY based on chimpanzee adenovirus, the modified vector is more suitable for growth and propagation in a packaging cell HEK293 compared with the traditional AdC68 vector, and the yield is high, thereby reducing the production cost of the product; in addition, the invention establishes a new strategy for preparing the recombinant adenovirus vector by synthetic biology, and the strategy does not need to obtain wild adenovirus, thereby avoiding complex cloning process, directly synthesizing molecular cloning of the recombinant adenovirus and saving time and labor. The strategy solves the technical bottleneck of constructing the recombinant adenovirus vector, and provides solid theoretical and technical support for the development, popularization and application of the recombinant adenovirus vector.

All documents referred to herein are incorporated by reference into this application as if each were individually incorporated by reference. Furthermore, it should be understood that various changes and modifications of the present invention can be made by those skilled in the art after reading the above teachings of the present invention, and these equivalents also fall within the scope of the present invention as defined by the appended claims.

Sequence listing

<110> Suzhou gambling biotechnology, Inc

<120> AdC68XY adenovirus vector, virus packaged by it and use

<130> 196276

<160> 5

<170> SIPOSequenceListing 1.0

<210> 1

<211> 35048

<212> DNA

<213> Artificial Sequence (Artificial Sequence)

<220>

<221> misc_feature

<222> (1)..(35048)

<223> AdC68XY1-EGFP nucleotide sequence of recombinant adenovirus

<400> 1

gtcgagggat gagcgaccgt taggggcggg gcgagtgacg ttttgatgac gtggttgcga 60

ggaggagcca gtttgcaagt tctcgtggga aaagtgacgt caaacgaggt gtggtttgaa 120

cacggaaata ctcaattttc ccgcgctctc tgacaggaaa tgaggtgttt ctgggcggat 180

gcaagtgaaa acgggccatt ttcgcgcgaa aactgaatga ggaagtgaaa atctgagtaa 240

tttcgcgttt atggcaggga ggagtatttg ccgagggccg agtagacttt gaccgattac 300

gtgggggttt cgattaccgt gtttttcacc taaatttccg cgtacggtgt caaagtccgg 360

tgtttttacg tacgatatca tttccccgaa aagtgccacc tgaccgtaac tataacggtc 420

ctaaggtagc gaaagctcag atctcccgat cccctatggt gcactctcag tacaatctgc 480

tctgatgccg catagttaag ccagtatctg ctccctgctt gtgtgttgga ggtcgctgag 540

tagtgcgcga gcaaaattta agctacaaca aggcaaggct tgaccgacaa ttgcatgaag 600

aatctgctta gggttaggcg ttttgcgctg cttcgcgatg tacgggccag atatacgcgt 660

tgacattgat tattgactag ttattaatag taatcaatta cggggtcatt agttcatagc 720

ccatatatgg agttccgcgt tacataactt acggtaaatg gcccgcctgg ctgaccgccc 780

aacgaccccc gcccattgac gtcaataatg acgtatgttc ccatagtaac gccaataggg 840

actttccatt gacgtcaatg ggtggactat ttacggtaaa ctgcccactt ggcagtacat 900

caagtgtatc atatgccaag tacgccccct attgacgtca atgacggtaa atggcccgcc 960

tggcattatg cccagtacat gaccttatgg gactttccta cttggcagta catctacgta 1020

ttagtcatcg ctattaccat ggtgatgcgg ttttggcagt acatcaatgg gcgtggatag 1080

cggtttgact cacggggatt tccaagtctc caccccattg acgtcaatgg gagtttgttt 1140

tggcaccaaa atcaacggga ctttccaaaa tgtcgtaaca actccgcccc attgacgcaa 1200

atgggcggta ggcgtgtacg gtgggaggtc tatataagca gagctctctg gctaactaga 1260

gaacccactg cttactggct tatcgaaatt aatacgactc actataggga gacccaagct 1320

ggctagcgtt taaacgggcc cgcccgggct tataagccac catggtgagc aagggcgagg 1380

agctgttcac cggggtggtg cccatcctgg tcgagctgga cggcgacgta aacggccaca 1440

agttcagcgt gtccggcgag ggcgagggcg atgccaccta cggcaagctg accctgaagt 1500

tcatctgcac caccggcaag ctgcccgtgc cctggcccac cctcgtgacc accctgacct 1560

acggcgtgca gtgcttcagc cgctaccccg accacatgaa gcagcacgac ttcttcaagt 1620

ccgccatgcc cgaaggctac gtccaggagc gcaccatctt cttcaaggac gacggcaact 1680

acaagacccg cgccgaggtg aagttcgagg gcgacaccct ggtgaaccgc atcgagctga 1740

agggcatcga cttcaaggag gacggcaaca tcctggggca caagctggag tacaactaca 1800

acagccacaa cgtctatatc atggccgaca agcagaagaa cggcatcaag gtgaacttca 1860

agatccgcca caacatcgag gacggcagcg tgcagctcgc cgaccactac cagcagaaca 1920

cccccatcgg cgacggcccc gtgctgctgc ccgacaacca ctacctgagc acccagtccg 1980

ccctgagcaa agaccccaac gagaagcgcg atcacatggt cctgctggag ttcgtgaccg 2040

ccgccgggat cactctcggc atggacgagc tgtacaagta attataagcc cgggcaagct 2100

taagtttaaa ccgctgatca gcctcgactg tgccttctag ttgccagcca tctgttgttt 2160

gcccctcccc cgtgccttcc ttgaccctgg aaggtgccac tcccactgtc ctttcctaat 2220

aaaatgagga aattgcatcg cattgtctga gtaggtgtca ttctattctg gggggtgggg 2280

tggggcagga cagcaagggg gaggattggg aagacaatag caggcatgct ggggatgcgg 2340

tgggctctat ggcttctgag gcggaaagaa ccagcagatc tgcagatctg aattcatcta 2400

tgtcgggtgc ggagaaagag gtaatgaaat ggcattatgg gtattatggg tctgcattaa 2460

tgaatcggcc agatatcgac atatgctggc caccgtacat gtggcttccc atgctcgcaa 2520

gccctggccc gagttcgagc acaatgtcat gaccaggtgc aatatgcatc tggggtcccg 2580

ccgaggcatg ttcatgccct accagtgcaa cctgaattat gtgaaggtgc tgctggagcc 2640

cgatgccatg tccagagtga gcctgacggg ggtgtttgac atgaatgtgg aggtgtggaa 2700

gattctgaga tatgatgaat ccaagaccag gtgccgagcc tgcgagtgcg gagggaagca 2760

tgccaggttc cagcccgtgt gtgtggatgt gacggaggac ctgcgacccg atcatttggt 2820

gttgccctgc accgggacgg agttcggttc cagcggggaa gaatctgact agagtgagta 2880

gtgttctggg gcgggggagg acctgcatga gggccagaat aactgaaatc tgtgcttttc 2940

tgtgtgttgc agcagcatga gcggaagcgg ctcctttgag ggaggggtat tcagccctta 3000

tctgacgggg cgtctcccct cctgggcggg agtgcgtcag aatgtgatgg gatccacggt 3060

ggacggccgg cccgtgcagc ccgcgaactc ttcaaccctg acctatgcaa ccctgagctc 3120

ttcgtcgttg gacgcagctg ccgccgcagc tgctgcatct gccgccagcg ccgtgcgcgg 3180

aatggccatg ggcgccggct actacggcac tctggtggcc aactcgagtt ccaccaataa 3240

tcccgccagc ctgaacgagg agaagctgtt gctgctgatg gcccagctcg aggccttgac 3300

ccagcgcctg ggcgagctga cccagcaggt ggctcagctg caggagcaga cgcgggccgc 3360

ggttgccacg gtgaaatcca aataaaaaat gaatcaataa ataaacggag acggttgttg 3420

attttaacac agagtctgaa tctttatttg atttttcgcg cgcggtaggc cctggaccac 3480

cggtctcgat cattgagcac ccggtggatc ttttccagga cccggtagag gtgggcttgg 3540

atgttgaggt acatgggcat gagcccgtcc cgggggtgga ggtagctcca ttgcagggcc 3600

tcgtgctcgg gggtggtgtt gtaaatcacc cagtcatagc aggggcgcag ggcatggtgt 3660

tgcacaatat ctttgaggag gagactgatg gccacgggca gccctttggt gtaggtgttt 3720

acaaatctgt tgagctggga gggatgcatg cggggggaga tgaggtgcat cttggcctgg 3780

atcttgagat tggcgatgtt accgcccaga tcccgcctgg ggttcatgtt gtgcaggacc 3840

accagcacgg tgtatccggt gcacttgggg aatttatcat gcaacttgga agggaaggcg 3900

tgaaagaatt tggcgacgcc tttgtgcccg cccaggtttt ccatgcactc atccatgatg 3960

atggcgatgg gcccgtgggc ggcggcctgg gcaaagacgt ttcgggggtc ggacacatca 4020

tagttgtggt cctgggtgag gtcatcatag gccattttaa tgaatttggg gcggagggtg 4080

ccggactggg ggacaaaggt accctcgatc ccgggggcgt agttcccctc acagatctgc 4140

atctcccagg ctttgagctc ggaggggggg atcatgtcca cctgcggggc gataaagaac 4200

acggtttccg gggcggggga gatgagctgg gccgaaagca agttccggag cagctgggac 4260

ttgccgcagc cggtggggcc gtagatgacc ccgatgaccg gctgcaggtg gtagttgagg 4320

gagagacagc tgccgtcctc ccggaggagg ggggccacct cgttcatcat ctcgcgcacg 4380

tgcatgttct cgcgcaccag ttccgccagg aggcgctctc cccccaggga taggagctcc 4440

tggagcgagg cgaagttttt cagcggcttg agtccgtcgg ccatgggcat tttggagagg 4500

gtttgttgca agagttccag gcggtcccag agctcggtga tgtgctctac ggcatctcga 4560

tccagcagac ctcctcgttt cgcgggttgg gacggctgcg ggagtagggc accagacgat 4620

gggcgtccag cgcagccagg gtccggtcct tccagggtcg cagcgtccgc gtcagggtgg 4680

tctccgtcac ggtgaagggg tgcgcgccgg gctgggcgct tgcgagggtg cgcttcaggc 4740

tcatccggct ggtcgaaaac cgctcccgat cggcgccctg cgcgtcggcc aggtagcaat 4800

tgaccatgag ttcgtagttg agcgcctcgg ccgcgtggcc tttggcgcgg agcttacctt 4860

tggaagtctg cccgcaggcg ggacagagga gggacttgag ggcgtagagc ttgggggcga 4920

ggaagacgga ctcgggggcg taggcgtccg cgccgcagtg ggcgcagacg gtctcgcact 4980

ccacgagcca ggtgaggtcg ggctggtcgg ggtcaaaaac cagtttcccg ccgttctttt 5040

tgatgcgttt cttacctttg gtctccatga gctcgtgtcc ccgctgggtg acaaagaggc 5100

tgtccgtgtc cccgtagacc gactttatgg gccggtcctc gagcggtgtg ccgcggtcct 5160

cctcgtagag gaaccccgcc cactccgaga cgaaagcccg ggtccaggcc agcacgaagg 5220

aggccacgtg ggacgggtag cggtcgttgt ccaccagcgg gtccaccttt tccagggtat 5280

gcaaacacat gtccccctcg tccacatcca ggaaggtgat tggcttgtaa gtgtaggcca 5340

cgtgaccggg ggtcccggcc gggggggtat aaaagggtgc gggtccctgc tcgtcctcac 5400

tgtcttccgg atcgctgtcc aggagcgcca gctgttgggg taggtattcc ctctcgaagg 5460

cgggcatgac ctcggcactc aggttgtcag tttctagaaa cgaggaggat ttgatattga 5520

cggtgccggc ggagatgcct ttcaagagcc cctcgtccat ctggtcagaa aagacgatct 5580

ttttgttgtc gagcttggtg gcgaaggagc cgtagagggc gttggagagg agcttggcga 5640

tggagcgcat ggtctggttt ttttccttgt cggcgcgctc cttggcggcg atgttgagct 5700

gcacgtactc gcgcgccacg cacttccatt cggggaagac ggtggtcagc tcgtcgggca 5760

cgattctgac ctgccagccc cgattatgca gggtgatgag gtccacactg gtggccacct 5820

cgccgcgcag gggctcatta gtccagcaga ggcgtccgcc cttgcgcgag cagaaggggg 5880

gcagggggtc cagcatgacc tcgtcggggg ggtcggcatc gatggtgaag atgccgggca 5940

ggaggtcggg gtcaaagtag ctgatggaag tggccagatc gtccagggca gcttgccatt 6000

cgcgcacggc cagcgcgcgc tcgtagggac tgaggggcgt gccccagggc atgggatggg 6060

taagcgcgga ggcgtacatg ccgcagatgt cgtagacgta gaggggctcc tcgaggatgc 6120

cgatgtaggt ggggtagcag cgccccccgc ggatgctggc gcgcacgtag tcatacagct 6180

cgtgcgaggg ggcgaggagc cccgggccca ggttggtgcg actgggcttt tcggcgcggt 6240

agacgatctg gcggaaaatg gcatgcgagt tggaggagat ggtgggcctt tggaagatgt 6300

tgaagtgggc gtggggcagt ccgaccgagt cgcggatgaa gtgggcgtag gagtcttgca 6360

gcttggcgac gagctcggcg gtgactagga cgtccagagc gcagtagtcg agggtctcct 6420

ggatgatgtc atacttgagc tgtccctttt gtttccacag ctcgcggttg agaaggaact 6480

cttcgcggtc cttccagtac tcttcgaggg ggaacccgtc ctgatctgca cggtaagagc 6540

ctagcatgta gaactggttg acggccttgt aggcgcagca gcccttctcc acggggaggg 6600

cgtaggcctg ggcggccttg cgcagggagg tgtgcgtgag ggcgaaagtg tccctgacca 6660

tgaccttgag gaactggtgc ttgaagtcga tatcgtcgca gcccccctgc tcccagagct 6720

ggaagtccgt gcgcttcttg taggcggggt tgggcaaagc gaaagtaaca tcgttgaaga 6780

ggatcttgcc cgcgcggggc ataaagttgc gagtgatgcg gaaaggttgg ggcacctcgg 6840

cccggttgtt gatgacctgg gcggcgagca cgatctcgtc gaagccgttg atgttgtggc 6900

ccacgatgta gagttccacg aatcgcggac ggcccttgac gtggggcagt ttcttgagct 6960

cctcgtaggt gagctcgtcg gggtcgctga gcccgtgctg ctcgagcgcc cagtcggcga 7020

gatgggggtt ggcgcggagg aaggaagtcc agagatccac ggccagggcg gtttgcagac 7080

ggtcccggta ctgacggaac tgctgcccga cggccatttt ttcgggggtg acgcagtaga 7140

aggtgcgggg gtccccgtgc cagcgatccc atttgagctg gagggcgaga tcgagggcga 7200

gctcgacgag ccggtcgtcc ccggagagtt tcatgaccag catgaagggg acgagctgct 7260

tgccgaagga ccccatccag gtgtaggttt ccacatcgta ggtgaggaag agcctttcgg 7320

tgcgaggatg cgagccgatg gggaagaact ggatctcctg ccaccaattg gaggaatggc 7380

tgttgatgtg atggaagtag aaatgccgac ggcgcgccga acactcgtgc ttgtgtttat 7440

acaagcggcc acagtgctcg caacgctgca cgggatgcac gtgctgcacg agctgtacct 7500

gagttccttt gacgaggaat ttcagtggga agtggagtcg tggcgcctgc atctcgtgct 7560

gtactacgtc gtggtggtcg gcctggccct cttctgcctc gatggtggtc atgctgacga 7620

gcccgcgcgg gaggcaggtc cagacctcgg cgcgagcggg tcggagagcg aggacgaggg 7680

cgcgcaggcc ggagctgtcc agggtcctga gacgctgcgg agtcaggtca gtgggcagcg 7740

gcggcgcgcg gttgacttgc aggagttttt ccagggcgcg cgggaggtcc agatggtact 7800

tgatctccac cgcgccattg gtggcgacgt cgatggcttg cagggtcccg tgcccctggg 7860

gtgtgaccac cgtcccccgt ttcttcttgg gcggctgggg cgacgggggc ggtgcctctt 7920

ccatggttag aagcggcggc gaggacgcgc gccgggcggc aggggcggct cggggcccgg 7980

aggcaggggc ggcaggggca cgtcggcgcc gcgcgcgggt aggttctggt actgcgcccg 8040

gagaagactg gcgtgagcga cgacgcgacg gttgacgtcc tggatctgac gcctctgggt 8100

gaaggccacg ggacccgtga gtttgaacct gaaagagagt tcgacagaat caatctcggt 8160

atcgttgacg gcggcctgcc gcaggatctc ttgcacgtcg cccgagttgt cctggtaggc 8220

gatctcggtc atgaactgct cgatctcctc ctcttgaagg tctccgcggc cggcgcgctc 8280

cacggtggcc gcgaggtcgt tggagatgcg gcccatgagc tgcgagaagg cgttcatgcc 8340

cgcctcgttc cagacgcggc tgtagaccac gacgccctcg ggatcgccgg cgcgcatgac 8400

cacctgggcg aggttgagct ccacgtggcg cgtgaagacc gcgtagttgc agaggcgctg 8460

gtagaggtag ttgagcgtgg tggcgatgtg ctcggtgacg aagaaataca tgatccagcg 8520

gcggagcggc atctcgctga cgtcgcccag cgcctccaaa cgttccatgg cctcgtaaaa 8580

gtccacggcg aagttgaaaa actgggagtt gcgcgccgag acggtcaact cctcctccag 8640

aagacggatg agctcggcga tggtggcgcg cacctcgcgc tcgaaggccc ccgggagttc 8700

ctccacttcc tcttcttcct cctccactaa catctcttct acttcctcct caggcggcag 8760

tggtggcggg ggagggggcc tgcgtcgccg gcggcgcacg ggcagacggt cgatgaagcg 8820

ctcgatggtc tcgccgcgcc ggcgtcgcat ggtctcggtg acggcgcgcc cgtcctcgcg 8880

gggccgcagc gtgaagacgc cgccgcgcat ctccaggtgg ccgggggggt ccccgttggg 8940

cagggagagg gcgctgacga tgcatcttat caattgcccc gtagggactc cgcgcaagga 9000

cctgagcgtc tcgagatcca cgggatctga aaaccgctga acgaaggctt cgagccagtc 9060

gcagtcgcaa ggtaggctga gcacggtttc ttctggcggg tcatgttggt tgggagcggg 9120

gcgggcgatg ctgctggtga tgaagttgaa ataggcggtt ctgagacggc ggatggtggc 9180

gaggagcacc aggtctttgg gcccggcttg ctggatgcgc agacggtcgg ccatgcccca 9240

ggcgtggtcc tgacacctgg ccaggtcctt gtagtagtcc tgcatgagcc gctccacggg 9300

cacctcctcc tcgcccgcgc ggccgtgcat gcgcgtgagc ccgaagccgc gctggggctg 9360

gacgagcgcc aggtcggcga cgacgcgctc ggcgaggatg gcttgctgga tctgggtgag 9420

ggtggtctgg aagtcatcaa agtcgacgaa gcggtggtag gctccggtgt tgatggtgta 9480

ggagcagttg gccatgacgg accagttgac ggtctggtgg cccggacgca cgagctcgtg 9540

gtacttgagg cgcgagtagg cgcgcgtgtc gaagatgtag tcgttgcagg tgcgcaccag 9600

gtactggtag ccgatgagga agtgcggcgg cggctggcgg tagagcggcc atcgctcggt 9660

ggcgggggcg ccgggcgcga ggtcctcgag catggtgcgg tggtagccgt agatgtacct 9720

ggacatccag gtgatgccgg cggcggtggt ggaggcgcgc gggaactcgc ggacgcggtt 9780

ccagatgttg cgcagcggca ggaagtagtt catggtgggc acggtctggc ccgtgaggcg 9840

cgcgcagtcg tggatgctct atacgggcaa aaacgaaagc ggtcagcggc tcgactccgt 9900

ggcctggagg ctaagcgaac gggttgggct gcgcgtgtac cccggttcga atctcgaatc 9960

aggctggagc cgcagctaac gtggtattgg cactcccgtc tcgacccaag cctgcaccaa 10020

ccctccagga tacggaggcg ggtcgttttg caactttttt ttggaggccg gatgagacta 10080

gtaagcgcgg aaagcggccg accgcgatgg ctcgctgccg tagtctggag aagaatcgcc 10140

agggttgcgt tgcggtgtgc cccggttcga ggccggccgg attccgcggc taacgagggc 10200

gtggctgccc cgtcgtttcc aagaccccat agccagccga cttctccagt tacggagcga 10260

gcccctcttt tgttttgttt gtttttgcca gatgcatccc gtactgcggc agatgcgccc 10320

ccaccaccct ccaccgcaac aacagccccc tccacagccg gcgcttctgc ccccgcccca 10380

gcagcaactt ccagccacga ccgccgcggc cgccgtgagc ggggctggac agagttatga 10440

tcaccagctg gccttggaag agggcgaggg gctggcgcgc ctgggggcgt cgtcgccgga 10500

gcggcacccg cgcgtgcaga tgaaaaggga cgctcgcgag gcctacgtgc ccaagcagaa 10560

cctgttcaga gacaggagcg gcgaggagcc cgaggagatg cgcgcggccc ggttccacgc 10620

ggggcgggag ctgcggcgcg gcctggaccg aaagagggtg ctgagggacg aggatttcga 10680

ggcggacgag ctgacgggga tcagccccgc gcgcgcgcac gtggccgcgg ccaacctggt 10740

cacggcgtac gagcagaccg tgaaggagga gagcaacttc caaaaatcct tcaacaacca 10800

cgtgcgcacc ctgatcgcgc gcgaggaggt gaccctgggc ctgatgcacc tgtgggacct 10860

gctggaggcc atcgtgcaga accccaccag caagccgctg acggcgcagc tgttcctggt 10920

ggtgcagcat agtcgggaca acgaagcgtt cagggaggcg ctgctgaata tcaccgagcc 10980

cgagggccgc tggctcctgg acctggtgaa cattctgcag agcatcgtgg tgcaggagcg 11040

cgggctgccg ctgtccgaga agctggcggc catcaacttc tcggtgctga gtttgggcaa 11100

gtactacgct aggaagatct acaagacccc gtacgtgccc atagacaagg aggtgaagat 11160

cgacgggttt tacatgcgca tgaccctgaa agtgctgacc ctgagcgacg atctgggggt 11220

gtaccgcaac gacaggatgc accgtgcggt gagcgccagc aggcggcgcg agctgagcga 11280

ccaggagctg atgcatagtc tgcagcgggc cctgaccggg gccgggaccg agggggagag 11340

ctactttgac atgggcgcgg acctgcactg gcagcccagc cgccgggcct tggaggcggc 11400

ggcaggaccc tacgtagaag aggtggacga tgaggtggac gaggagggcg agtacctgga 11460

agactgatgg cgcgaccgta tttttgctag atgcaacaac aacagccacc tcctgatccc 11520

gcgatgcggg cggcgctgca gagccagccg tccggcatta actcctcgga cgattggacc 11580

caggccatgc aacgcatcat ggcgctgacg acccgcaacc ccgaagcctt tagacagcag 11640

ccccaggcca accggctctc ggccatcctg gaggccgtgg tgccctcgcg ctccaacccc 11700

acgcacgaga aggtcctggc catcgtgaac gcgctggtgg agaacaaggc catccgcggc 11760

gacgaggccg gcctggtgta caacgcgctg ctggagcgcg tggcccgcta caacagcacc 11820

aacgtgcaga ccaacctgga ccgcatggtg accgacgtgc gcgaggccgt ggcccagcgc 11880

gagcggttcc accgcgagtc caacctggga tccatggtgg cgctgaacgc cttcctcagc 11940

acccagcccg ccaacgtgcc ccggggccag gaggactaca ccaacttcat cagcgccctg 12000

cgcctgatgg tgaccgaggt gccccagagc gaggtgtacc agtccgggcc ggactacttc 12060

ttccagacca gtcgccaggg cttgcagacc gtgaacctga gccaggcttt caagaacttg 12120

cagggcctgt ggggcgtgca ggccccggtc ggggaccgcg cgacggtgtc gagcctgctg 12180

acgccgaact cgcgcctgct gctgctgctg gtggccccct tcacggacag cggcagcatc 12240

aaccgcaact cgtacctggg ctacctgatt aacctgtacc gcgaggccat cggccaggcg 12300

cacgtggacg agcagaccta ccaggagatc acccacgtga gccgcgccct gggccaggac 12360

gacccgggca acctggaagc caccctgaac tttttgctga ccaaccggtc gcagaagatc 12420

ccgccccagt acgcgctcag caccgaggag gagcgcatcc tgcgttacgt gcagcagagc 12480

gtgggcctgt tcctgatgca ggagggggcc acccccagcg ccgcgctcga catgaccgcg 12540

cgcaacatgg agcccagcat gtacgccagc aaccgcccgt tcatcaataa actgatggac 12600

tacttgcatc gggcggccgc catgaactct gactatttca ccaacgccat cctgaatccc 12660

cactggctcc cgccgccggg gttctacacg ggcgagtacg acatgcccga ccccaatgac 12720

gggttcctgt gggacgatgt ggacagcagc gtgttctccc cccgaccggg tgctaacgag 12780

cgccccttgt ggaagaagga aggcagcgac cgacgcccgt cctcggcgct gtccggccgc 12840

gagggtgctg ccgcggcggt gcccgaggcc gccagtcctt tcccgagctt gcccttctcg 12900

ctgaacagta tccgcagcag cgagctgggc aggatcacgc gcccgcgctt gctgggcgaa 12960

gaggagtact tgaatgactc gctgttgaga cccgagcggg agaagaactt ccccaataac 13020

gggatagaaa gcctggtgga caagatgagc cgctggaaga cgtatgcgca ggagcacagg 13080

gacgatcccc gggcgtcgca gggggccacg agccggggca gcgccgcccg taaacgccgg 13140

tggcacgaca ggcagcgggg acagatgtgg gacgatgagg actccgccga cgacagcagc 13200

gtgttggact tgggtgggag tggtaacccg ttcgctcacc tgcgcccccg tatcgggcgc 13260

atgatgtaag agaaaccgaa aataaatgat actcaccaag gccatggcga ccagcgtgcg 13320

ttcgtttctt ctctgttgtt gttgtatcta gtatgatgag gcgtgcgtac ccggagggtc 13380

ctcctccctc gtacgagagc gtgatgcagc aggcgatggc ggcggcggcg atgcagcccc 13440

cgctggaggc tccttacgtg cccccgcggt acctggcgcc tacggagggg cggaacagca 13500

ttcgttactc ggagctggca cccttgtacg ataccacccg gttgtacctg gtggacaaca 13560

agtcggcgga catcgcctcg ctgaactacc agaacgacca cagcaacttc ctgaccaccg 13620

tggtgcagaa caatgacttc acccccacgg aggccagcac ccagaccatc aactttgacg 13680

agcgctcgcg gtggggcggc cagctgaaaa ccatcatgca caccaacatg cccaacgtga 13740

acgagttcat gtacagcaac aagttcaagg cgcgggtgat ggtctcccgc aagaccccca 13800

atggggtgac agtgacagag gattatgatg gtagtcagga tgagctgaag tatgaatggg 13860

tggaatttga gctgcccgaa ggcaacttct cggtgaccat gaccatcgac ctgatgaaca 13920

acgccatcat cgacaattac ttggcggtgg ggcggcagaa cggggtgctg gagagcgaca 13980

tcggcgtgaa gttcgacact aggaacttca ggctgggctg ggaccccgtg accgagctgg 14040

tcatgcccgg ggtgtacacc aacgaggctt tccatcccga tattgtcttg ctgcccggct 14100

gcggggtgga cttcaccgag agccgcctca gcaacctgct gggcattcgc aagaggcagc 14160

ccttccagga aggcttccag atcatgtacg aggatctgga ggggggcaac atccccgcgc 14220

tcctggatgt cgacgcctat gagaaaagca aggaggatgc agcagctgaa gcaactgcag 14280

ccgtagctac cgcctctacc gaggtcaggg gcgataattt tgcaagcgcc gcagcagtgg 14340

cagcggccga ggcggctgaa accgaaagta agatagtcat tcagccggtg gagaaggata 14400

gcaagaacag gagctacaac gtactaccgg acaagataaa caccgcctac cgcagctggt 14460

acctagccta caactatggc gaccccgaga agggcgtgcg ctcctggacg ctgctcacca 14520

cctcggacgt cacctgcggc gtggagcaag tctactggtc gctgcccgac atgatgcaag 14580

acccggtcac cttccgctcc acgcgtcaag ttagcaacta cccggtggtg ggcgccgagc 14640

tcctgcccgt ctactccaag agcttcttca acgagcaggc cgtctactcg cagcagctgc 14700

gcgccttcac ctcgcttacg cacgtcttca accgcttccc cgagaaccag atcctcgtcc 14760

gcccgcccgc gcccaccatt accaccgtca gtgaaaacgt tcctgctctc acagatcacg 14820

ggaccctgcc gctgcgcagc agtatccggg gagtccagcg cgtgaccgtt actgacgcca 14880

gacgccgcac ctgcccctac gtctacaagg ccctgggcat agtcgcgccg cgcgtcctct 14940

cgagccgcac cttctaaatg tccattctca tctcgcccag taataacacc ggttggggcc 15000

tgcgcgcgcc cagcaagatg tacggaggcg ctcgccaacg ctccacgcaa caccccgtgc 15060

gcgtgcgcgg gcacttccgc gctccctggg gcgccctcaa gggccgcgtg cggtcgcgca 15120

ccaccgtcga cgacgtgatc gaccaggtgg tggccgacgc gcgcaactac acccccgccg 15180

ccgcgcccgt ctccaccgtg gacgccgtca tcgacagcgt ggtggcggac gcgcgccggt 15240

acgcccgcgc caagagccgg cggcggcgca tcgcccggcg gcaccggagc acccccgcca 15300

tgcgcgcggc gcgagccttg ctgcgcaggg ccaggcgcac gggacgcagg gccatgctca 15360

gggcggccag acgcgcggct tcaggcgcca gcgccggcag gacccggaga cgcgcggcca 15420

cggcggcggc agcggccatc gccagcatgt cccgcccgcg gcgagggaac gtgtactggg 15480

tgcgcgacgc cgccaccggt gtgcgcgtgc ccgtgcgcac ccgcccccct cgcacttgaa 15540

gatgttcact tcgcgatgtt gatgtgtccc agcggcgagg aggatgtcca agcgcaaatt 15600

caaggaagag atgctccagg tcatcgcgcc tgagatctac ggccctgcgg tggtgaagga 15660

ggaaagaaag ccccgcaaaa tcaagcgggt caaaaaggac aaaaaggaag aagaaagtga 15720

tgtggacgga ttggtggagt ttgtgcgcga gttcgccccc cggcggcgcg tgcagtggcg 15780

cgggcggaag gtgcaaccgg tgctgagacc cggcaccacc gtggtcttca cgcccggcga 15840

gcgctccggc accgcttcca agcgctccta cgacgaggtg tacggggatg atgatattct 15900

ggagcaggcg gccgagcgcc tgggcgagtt tgcttacggc aagcgcagcc gttccgcacc 15960

gaaggaagag gcggtgtcca tcccgctgga ccacggcaac cccacgccga gcctcaagcc 16020

cgtgaccttg cagcaggtgc tgccgaccgc ggcgccgcgc cgggggttca agcgcgaggg 16080

cgaggatctg taccccacca tgcagctgat ggtgcccaag cgccagaagc tggaagacgt 16140

gctggagacc atgaaggtgg acccggacgt gcagcccgag gtcaaggtgc ggcccatcaa 16200

gcaggtggcc ccgggcctgg gcgtgcagac cgtggacatc aagattccca cggagcccat 16260

ggaaacgcag accgagccca tgatcaagcc cagcaccagc accatggagg tgcagacgga 16320

tccctggatg ccatcggctc ctagtcgaag accccggcgc aagtacggcg cggccagcct 16380

gctgatgccc aactacgcgc tgcatccttc catcatcccc acgccgggct accgcggcac 16440

gcgcttctac cgcggtcata ccagcagccg ccgccgcaag accaccactc gccgccgccg 16500

tcgccgcacc gccgctgcaa ccacccctgc cgccctggtg cggagagtgt accgccgcgg 16560

ccgcgcacct ctgaccctgc cgcgcgcgcg ctaccacccg agcatcgcca tttaaacttt 16620

cgccagcttt gcagatcaat ggccctcaca tgccgccttc gcgttcccat tacgggctac 16680

cgaggaagaa aaccgcgccg tagaaggctg gcggggaacg ggatgcgtcg ccaccaccac 16740

cggcggcggc gcgccatcag caagcggttg gggggaggct tcctgcccgc gctgatcccc 16800

atcatcgccg cggcgatcgg ggcgatcccc ggcattgctt ccgtggcggt gcaggcctct 16860

cagcgccact gagacacact tggaaacatc ttgtaataaa cccatggact ctgacgctcc 16920

tggtcctgtg atgtgttttc gtagacagat ggaagacatc aatttttcgt ccctggctcc 16980

gcgacacggc acgcggccgt tcatgggcac ctggagcgac atcggcacca gccaactgaa 17040

cgggggcgcc ttcaattgga gcagtctctg gagcgggctt aagaatttcg ggtccacgct 17100

taaaacctat ggcagcaagg cgtggaacag caccacaggg caggcgctga gggataagct 17160

gaaagagcag aacttccagc agaaggtggt cgatgggctc gcctcgggca tcaacggggt 17220

ggtggacctg gccaaccagg ccgtgcagcg gcagatcaac agccgcctgg acccggtgcc 17280

gcccgccggc tccgtggaga tgccgcaggt ggaggaggag ctgcctcccc tggacaagcg 17340

gggcgagaag cgaccccgcc ccgatgcgga ggagacgctg ctgacgcaca cggacgagcc 17400

gcccccgtac gaggaggcgg tgaaactggg tctgcccacc acgcggccca tcgcgcccct 17460

ggccaccggg gtgctgaaac ccgaaaagcc cgcgaccctg gacttgcctc ctccccagcc 17520

ttcccgcccc tctacagtgg ctaagcccct gccgccggtg gccgtggccc gcgcgcgacc 17580

cgggggcacc gcccgccctc atgcgaactg gcagagcact ctgaacagca tcgtgggtct 17640

gggagtgcag agtgtgaagc gccgccgctg ctattaaacc taccgtagcg cttaacttgc 17700

ttgtctgtgt gtgtatgtat tatgtcgccg ccgccgctgt ccaccagaag gaggagtgaa 17760

gaggcgcgtc gccgagttgc aagatggcca ccccatcgat gctgccccag tgggcgtaca 17820

tgcacatcgc cggacaggac gcttcggagt acctgagtcc gggtctggtg cagtttgccc 17880

gcgccacaga cacctacttc agtctgggga acaagtttag gaaccccacg gtggcgccca 17940

cgcacgatgt gaccaccgac cgcagccagc ggctgacgct gcgcttcgtg cccgtggacc 18000

gcgaggacaa cacctactcg tacaaagtgc gctacacgct ggccgtgggc gacaaccgcg 18060

tgctggacat ggccagcacc tactttgaca tccgcggcgt gctggatcgg ggccctagct 18120

tcaaacccta ctccggcacc gcctacaaca gtctggcccc caagggagca cccaacactt 18180

gtcagtggac atataaagcc gatggtgaaa ctgccacaga aaaaacctat acatatggaa 18240

atgcacccgt gcagggcatt aacatcacaa aagatggtat tcaacttgga actgacaccg 18300

atgatcagcc aatctacgca gataaaacct atcagcctga acctcaagtg ggtgatgctg 18360

aatggcatga catcactggt actgatgaaa agtatggagg cagagctctt aagcctgata 18420

ccaaaatgaa gccttgttat ggttcttttg ccaagcctac taataaagaa ggaggtcagg 18480

caaatgtgaa aacaggaaca ggcactacta aagaatatga catagacatg gctttctttg 18540

acaacagaag tgcggctgct gctggcctag ctccagaaat tgttttgtat actgaaaatg 18600

tggatttgga aactccagat acccatattg tatacaaagc aggcacagat gacagcagct 18660

cttctattaa tttgggtcag caagccatgc ccaacagacc taactacatt ggtttcagag 18720

acaactttat cgggctcatg tactacaaca gcactggcaa tatgggggtg ctggccggtc 18780

aggcttctca gctgaatgct gtggttgact tgcaagacag aaacaccgag ctgtcctacc 18840

agctcttgct tgactctctg ggtgacagaa cccggtattt cagtatgtgg aatcaggcgg 18900

tggacagcta tgatcctgat gtgcgcatta ttgaaaatca tggtgtggag gatgaacttc 18960

ccaactattg tttccctctg gatgctgttg gcagaacaga tacttatcag ggaattaagg 19020

ctaatggaac tgatcaaacc acatggacca aagatgacag tgtcaatgat gctaatgaga 19080

taggcaaggg taatccattc gccatggaaa tcaacatcca agccaacctg tggaggaact 19140

tcctctacgc caacgtggcc ctgtacctgc ccgactctta caagtacacg ccggccaatg 19200

ttaccctgcc caccaacacc aacacctacg attacatgaa cggccgggtg gtggcgccct 19260

cgctggtgga ctcctacatc aacatcgggg cgcgctggtc gctggatccc atggacaacg 19320

tgaacccctt caaccaccac cgcaatgcgg ggctgcgcta ccgctccatg ctcctgggca 19380

acgggcgcta cgtgcccttc cacatccagg tgccccagaa atttttcgcc atcaagagcc 19440

tcctgctcct gcccgggtcc tacacctacg agtggaactt ccgcaaggac gtcaacatga 19500

tcctgcagag ctccctcggc aacgacctgc gcacggacgg ggcctccatc tccttcacca 19560

gcatcaacct ctacgccacc ttcttcccca tggcgcacaa cacggcctcc acgctcgagg 19620

ccatgctgcg caacgacacc aacgaccagt ccttcaacga ctacctctcg gcggccaaca 19680

tgctctaccc catcccggcc aacgccacca acgtgcccat ctccatcccc tcgcgcaact 19740

gggccgcctt ccgcggctgg tccttcacgc gtctcaagac caaggagacg ccctcgctgg 19800

gctccgggtt cgacccctac ttcgtctact cgggctccat cccctacctc gacggcacct 19860

tctacctcaa ccacaccttc aagaaggtct ccatcacctt cgactcctcc gtcagctggc 19920

ccggcaacga ccggctcctg acgcccaacg agttcgaaat caagcgcacc gtcgacggcg 19980

agggctacaa cgtggcccag tgcaacatga ccaaggactg gttcctggtc cagatgctgg 20040

cccactacaa catcggctac cagggcttct acgtgcccga gggctacaag gaccgcatgt 20100

actccttctt ccgcaacttc cagcccatga gccgccaggt ggtggacgag gtcaactaca 20160

aggactacca ggccgtcacc ctggcctacc agcacaacaa ctcgggcttc gtcggctacc 20220

tcgcgcccac catgcgccag ggccagccct accccgccaa ctacccctac ccgctcatcg 20280

gcaagagcgc cgtcaccagc gtcacccaga aaaagttcct ctgcgacagg gtcatgtggc 20340

gcatcccctt ctccagcaac ttcatgtcca tgggcgcgct caccgacctc ggccagaaca 20400

tgctctatgc caactccgcc cacgcgctag acatgaattt cgaagtcgac cccatggatg 20460

agtccaccct tctctatgtt gtcttcgaag tcttcgacgt cgtccgagtg caccagcccc 20520

accgcggcgt catcgaggcc gtctacctgc gcaccccctt ctcggccggt aacgccacca 20580

cctaagctct tgcttcttgc aagccatggc cgcgggctcc ggcgagcagg agctcagggc 20640

catcatccgc gacctgggct gcgggcccta cttcctgggc accttcgata agcgcttccc 20700

gggattcatg gccccgcaca agctggcctg cgccatcgtc aacacggccg gccgcgagac 20760

cgggggcgag cactggctgg ccttcgcctg gaacccgcgc tcgaacacct gctacctctt 20820

cgaccccttc gggttctcgg acgagcgcct caagcagatc taccagttcg agtacgaggg 20880

cctgctgcgc cgcagcgccc tggccaccga ggaccgctgc gtcaccctgg aaaagtccac 20940

ccagaccgtg cagggtccgc gctcggccgc ctgcgggctc ttctgctgca tgttcctgca 21000

cgccttcgtg cactggcccg accgccccat ggacaagaac cccaccatga acttgctgac 21060

gggggtgccc aacggcatgc tccagtcgcc ccaggtggaa cccaccctgc gccgcaacca 21120

ggaggcgctc taccgcttcc tcaactccca ctccgcctac tttcgctccc accgcgcgcg 21180

catcgagaag gccaccgcct tcgaccgcat gaatcaagac atgtaaaccg tgtgtgtatg 21240

ttaaatgtct ttaataaaca gcactttcat gttacacatg catctgagat gatttattta 21300

gaaatcgaaa gggttctgcc gggtctcggc atggcccgcg ggcagggaca cgttgcggaa 21360

ctggtacttg gccagccact tgaactcggg gatcagcagt ttgggcagcg gggtgtcggg 21420

gaaggagtcg gtccacagct tccgcgtcag ttgcagggcg cccagcaggt cgggcgcgga 21480

gatcttgaaa tcgcagttgg gacccgcgtt ctgcgcgcgg gagttgcggt acacggggtt 21540

gcagcactgg aacaccatca gggccgggtg cttcacgctc gccagcaccg tcgcgtcggt 21600

gatgctctcc acgtcgaggt cctcggcgtt ggccatcccg aagggggtca tcttgcaggt 21660

ctgccttccc atggtgggca cgcacccggg cttgtggttg caatcgcagt gcagggggat 21720

cagcatcatc tgggcctggt cggcgttcat ccccgggtac atggccttca tgaaagcctc 21780

caattgcctg aacgcctgct gggccttggc tccctcggtg aagaagaccc cgcaggactt 21840

gctagagaac tggttggtgg cgcacccggc gtcgtgcacg cagcagcgcg cgtcgttgtt 21900

ggccagctgc accacgctgc gcccccagcg gttctgggtg atcttggccc ggtcggggtt 21960

ctccttcagc gcgcgctgcc cgttctcgct cgccacatcc atctcgatca tgtgctcctt 22020

ctggatcatg gtggtcccgt gcaggcaccg cagcttgccc tcggcctcgg tgcacccgtg 22080

cagccacagc gcgcacccgg tgcactccca gttcttgtgg gcgatctggg aatgcgcgtg 22140

cacgaagccc tgcaggaagc ggcccatcat ggtggtcagg gtcttgttgc tagtgaaggt 22200

cagcggaatg ccgcggtgct cctcgttgat gtacaggtgg cagatgcggc ggtacacctc 22260

gccctgctcg ggcatcagct ggaagttggc tttcaggtcg gtctccacgc ggtagcggtc 22320

catcagcata gtcatgattt ccataccctt ctcccaggcc gagacgatgg gcaggctcat 22380

agggttcttc accatcatct tagcgctagc agccgcggcc agggggtcgc tctcgtccag 22440

ggtctcaaag ctccgcttgc cgtccttctc ggtgatccgc accggggggt agctgaagcc 22500

cacggccgcc agctcctcct cggcctgtct ttcgtcctcg ctgtcctggc tgacgtcctg 22560

caggaccaca tgcttggtct tgcggggttt cttcttgggc ggcagcggcg gcggagatgt 22620

tggagatggc gagggggagc gcgagttctc gctcaccact actatctctt cctcttcttg 22680

gtccgaggcc acgcggcggt aggtatgtct cttcgggggc agaggcggag gcgacgggct 22740

ctcgccgccg cgacttggcg gatggctggc agagcccctt ccgcgttcgg gggtgcgctc 22800

ccggcggcgc tctgactgac ttcctccgcg gccggccatt gtgttctcct agggaggaac 22860

aacaagcatg gagactcagc catcgccaac ctcgccatct gcccccaccg ccgacgagaa 22920

gcagcagcag cagaatgaaa gcttaaccgc cccgccgccc agccccgcca cctccgacgc 22980

ggccgtccca gacatgcaag agatggagga atccatcgag attgacctgg gctatgtgac 23040

gcccgcggag cacgaggagg agctggcagt gcgcttttca caagaagaga tacaccaaga 23100

acagccagag caggaagcag agaatgagca gagtcaggct gggctcgagc atgacggcga 23160

ctacctccac ctgagcgggg gggaggacgc gctcatcaag catctggccc ggcaggccac 23220

catcgtcaag gatgcgctgc tcgaccgcac cgaggtgccc ctcagcgtgg aggagctcag 23280

ccgcgcctac gagttgaacc tcttctcgcc gcgcgtgccc cccaagcgcc agcccaatgg 23340

cacctgcgag cccaacccgc gcctcaactt ctacccggtc ttcgcggtgc ccgaggccct 23400

ggccacctac cacatctttt tcaagaacca aaagatcccc gtctcctgcc gcgccaaccg 23460

cacccgcgcc gacgcccttt tcaacctggg tcccggcgcc cgcctacctg atatcgcctc 23520

cttggaagag gttcccaaga tcttcgaggg tctgggcagc gacgagactc gggccgcgaa 23580

cgctctgcaa ggagaaggag gagagcatga gcaccacagc gccctggtcg agttggaagg 23640

cgacaacgcg cggctggcgg tgctcaaacg cacggtcgag ctgacccatt tcgcctaccc 23700

ggctctgaac ctgcccccca aagtcatgag cgcggtcatg gaccaggtgc tcatcaagcg 23760

cgcgtcgccc atctccgagg acgagggcat gcaagactcc gaggagggca agcccgtggt 23820

cagcgacgag cagctggccc ggtggctggg tcctaatgct agtccccaga gtttggaaga 23880

gcggcgcaaa ctcatgatgg ccgtggtcct ggtgaccgtg gagctggagt gcctgcgccg 23940

cttcttcgcc gacgcggaga ccctgcgcaa ggtcgaggag aacctgcact acctcttcag 24000

gcacgggttc gtgcgccagg cctgcaagat ctccaacgtg gagctgacca acctggtctc 24060

ctacatgggc atcttgcacg agaaccgcct ggggcagaac gtgctgcaca ccaccctgcg 24120

cggggaggcc cggcgcgact acatccgcga ctgcgtctac ctctacctct gccacacctg 24180

gcagacgggc atgggcgtgt ggcagcagtg tctggaggag cagaacctga aagagctctg 24240

caagctcctg cagaagaacc tcaagggtct gtggaccggg ttcgacgagc gcaccaccgc 24300

ctcggacctg gccgacctca ttttccccga gcgcctcagg ctgacgctgc gcaacggcct 24360

gcccgacttt atgagccaaa gcatgttgca aaactttcgc tctttcatcc tcgaacgctc 24420

cggaatcctg cccgccacct gctccgcgct gccctcggac ttcgtgccgc tgaccttccg 24480

cgagtgcccc ccgccgctgt ggagccactg ctacctgctg cgcctggcca actacctggc 24540

ctaccactcg gacgtgatcg aggacgtcag cggcgagggc ctgctcgagt gccactgccg 24600

ctgcaacctc tgcacgccgc accgctccct ggcctgcaac ccccagctgc tgagcgagac 24660

ccagatcatc ggcaccttcg agttgcaagg gcccagcgaa ggcgagggtt cagccgccaa 24720

ggggggtctg aaactcaccc cggggctgtg gacctcggcc tacttgcgca agttcgtgcc 24780

cgaggactac catcccttcg agatcaggtt ctacgaggac caatcccatc cgcccaaggc 24840

cgagctgtcg gcctgcgtca tcacccaggg ggcgatcctg gcccaattgc aagccatcca 24900

gaaatcccgc caagaattct tgctgaaaaa gggccgcggg gtctacctcg acccccagac 24960

cggtgaggag ctcaaccccg gcttccccca ggatgccccg aggaaacaag aagctgaaag 25020

tggagctgcc gcccgtggag gatttggagg aagactggga gaacagcagt caggcagagg 25080

aggaggagat ggaggaagac tgggacagca ctcaggcaga ggaggacagc ctgcaagaca 25140

gtctggagga agacgaggag gaggcagagg aggaggtgga agaagcagcc gccgccagac 25200

cgtcgtcctc ggcgggggag aaagcaagca gcacggatac catctccgct ccgggtcggg 25260

gtcccgctcg accacacagt agatgggacg agaccggacg attcccgaac cccaccaccc 25320

agaccggtaa gaaggagcgg cagggataca agtcctggcg ggggcacaaa aacgccatcg 25380

tctcctgctt gcaggcctgc gggggcaaca tctccttcac ccggcgctac ctgctcttcc 25440

accgcggggt gaactttccc cgcaacatct tgcattacta ccgtcacctc cacagcccct 25500

actacttcca agaagaggca gcagcagcag aaaaagacca gcagaaaacc agcagctaga 25560

aaatccacag cggcggcagc aggtggactg aggatcgcgg cgaacgagcc ggcgcaaacc 25620

cgggagctga ggaaccggat ctttcccacc ctctatgcca tcttccagca gagtcggggg 25680

caggagcagg aactgaaagt caagaaccgt tctctgcgct cgctcacccg cagttgtctg 25740

tatcacaaga gcgaagacca acttcagcgc actctcgagg acgccgaggc tctcttcaac 25800

aagtactgcg cgctcactct taaagagtag cccgcgcccg cccagtcgca gaaaaaggcg 25860

ggaattacgt cacctgtgcc cttcgcccta gccgcctcca cccatcatca tgagcaaaga 25920

gattcccacg ccttacatgt ggagctacca gccccagatg ggcctggccg ccggtgccgc 25980

ccaggactac tccacccgca tgaattggct cagcgccggg cccgcgatga tctcacgggt 26040

gaatgacatc cgcgcccacc gaaaccagat actcctagaa cagtcagcgc tcaccgccac 26100

gccccgcaat cacctcaatc cgcgtaattg gcccgccgcc ctggtgtacc aggaaattcc 26160

ccagcccacg accgtactac ttccgcgaga cgcccaggcc gaagtccagc tgactaactc 26220

aggtgtccag ctggcgggcg gcgccaccct gtgtcgtcac cgccccgctc agggtataaa 26280

gcggctggtg atccggggca gaggcacaca gctcaacgac gaggtggtga gctcttcgct 26340

gggtctgcga cctgacggag tcttccaact cgccggatcg gggagatctt ccttcacgcc 26400

tcgtcaggcc gtcctgactt tggagagttc gtcctcgcag ccccgctcgg gtggcatcgg 26460

cactctccag ttcgtggagg agttcactcc ctcggtctac ttcaacccct tctccggctc 26520

ccccggccac tacccggacg agttcatccc gaacttcgac gccatcagcg agtcggtgga 26580

cggctacgat tgaatgtccc atggtggcgc agctgaccta gctcggcttc gacacctgga 26640

ccactgccgc cgcttccgct gcttcgctcg ggatctcgcc gagtttgcct actttgagct 26700

gcccgaggag caccctcagg gcccggccca cggagtgcgg atcgtcgtcg aagggggcct 26760

cgactcccac ctgcttcgga tcttcagcca gcgtccgatc ctggtcgagc gcgagcaagg 26820

acagaccctt ctgactctgt actgcatctg caaccacccc ggcctgcatg aaagtctttg 26880

ttgtctgctg tgtactgagt ataataaaag ctgagatcag cgactactcc ggacttccgt 26940

gtgttcctga atccatcaac cagtctttgt tcttcaccgg gaacgagacc gagctccagc 27000

tccagtgtaa gccccacaag aagtacctca cctggctgtt ccagggctcc ccgatcgccg 27060

ttgtcaacca ctgcgacaac gactatttaa atccacaata catgcccata ttagactatg 27120

aggccgagcc acagcgaccc atgctccccg ctattagtta cttcaatcta accggcggag 27180

atgactgacc cactggccaa caacaacgtc aacgaccttc tcctggacat ggacggccgc 27240

gcctcggagc agcgactcgc ccaacttcgc attcgccagc agcaggagag agccgtcaag 27300

gagctgcagg atgcggtggc catccaccag tgcaagagag gcatcttctg cctggtgaaa 27360

caggccaaga tctcctacga ggtcactcca aacgaccatc gcctctccta cgagctcctg 27420

cagcagcgcc agaagttcac ctgcctggtc ggagtcaacc ccatcgtcat cacccagcag 27480

tctggcgata ccaaggggtg catccactgc tcctgcgact cccccgactg cgtccacact 27540

ctgatcaaga ccctctgcgg cctccgcgac ctcctcccca tgaactaatc acccccttat 27600

ccagtgaaat aaagatcata ttgatgatga ttttacagaa ataaaaaata atcatttgat 27660

ttgaaataaa gatacaatca tattgatgat ttgagtttaa caaaaaaata aagaatcact 27720

tacttgaaat ctgataccag gtctctgtcc atgttttctg ccaacaccac ttcactcccc 27780

tcttcccagc tctggtactg caggccccgg cgggctgcaa acttcctcca cacgctgaag 27840

gggatgtcaa attcctcctg tccctcaatc ttcattttat cttctatcag atgtccaaaa 27900

agcgcgtccg ggtggatgat gacttcgacc ccgtctaccc ctacgatgca gacaacgcac 27960

cgaccgtgcc cttcatcaac ccccccttcg tctcttcaga tggattccaa gagaagcccc 28020

tgggggtgtt gtccctgcga ctggccgacc ccgtcaccac caagaacggg gaaatcaccc 28080

tcaagctggg agagggggtg gacctcgatt cctcgggaaa actcatctcc aacacggcca 28140

ccaaggccgc cgcccctctc agtttttcca acaacaccat ttcccttaac atggatcacc 28200

ccttttacac taaagatgga aaattatcct tacaagtttc tccaccatta aatatactga 28260

gaacaagcat tctaaacaca ctagctttag gttttggatc aggtttagga ctccgtggct 28320

ctgccttggc agtacagtta gtctctccac ttacatttga tactgatgga aacataaagc 28380

ttaccttaga cagaggtttg catgttacaa caggagatgc aattgaaagc aacataagct 28440

gggctaaagg tttaaaattt gaagatggag ccatagcaac caacattgga aatgggttag 28500

agtttggaag cagtagtaca gaaacaggtg ttgatgatgc ttacccaatc caagttaaac 28560

ttggatctgg ccttagcttt gacagtacag gagccataat ggctggtaac aaagaagacg 28620

ataaactcac tttgtggaca acacctgatc catcaccaaa ctgtcaaata ctcgcagaaa 28680

atgatgcaaa actaacactt tgcttgacta aatgtggtag tcaaatactg gccactgtgt 28740

cagtcttagt tgtaggaagt ggaaacctaa accccattac tggcaccgta agcagtgctc 28800

aggtgtttct acgttttgat gcaaacggtg ttcttttaac agaacattct acactaaaaa 28860

aatactgggg gtataggcag ggagatagca tagatggcac tccatatacc aatgctgtag 28920

gattcatgcc caatttaaaa gcttatccaa agtcacaaag ttctactact aaaaataata 28980

tagtagggca agtatacatg aatggagatg tttcaaaacc tatgcttctc actataaccc 29040

tcaatggtac tgatgacagc aacagtacat attcaatgtc attttcatac acctggacta 29100

atggaagcta tgttggagca acatttgggg ctaactctta taccttctca tacatcgccc 29160

aagaatgaac actgtatccc accctgcatg ccaacccttc ccaccccact ctgtggaaca 29220

aactctgaaa cacaaaataa aataaagttc aagtgtttta ttgattcaac agttaactta 29280

caggattcga gcagttattt ttcctccacc ctcccaggac atggaataca ccaccctctc 29340

cccccgcaca gccttgaaca tctgaatgcc attggtgatg gacatgcttt tggtctccac 29400

gttccacaca gtttcagagc gagccagtct cgggtcggtc agggagatga aaccctccgg 29460

gcactcccgc atctgcacct cacagctcaa cagctgagga ttgtcctcgg tggtcgggat 29520

cacggttatc tggaagaagc agaagagcgg cggtgggaat catagtccgc gaacgggatc 29580

ggccggtggt gtcgcatcag gccccgcagc agtcgctgcc gccgccgctc cgtcaagctg 29640

ctgctcaggg ggtccgggtc cagggactcc ctcagcatga tgcccacggc cctcagcatc 29700

agtcgtctgg tgcggcgggc gcagcagcgc atgcggatct cgctcaggtc gctgcagtac 29760

gtgcaacaca gaaccaccag gttgttcaac agtccatagt tcaacacgct ccagccgaaa 29820

ctcatcgcgg gaaggatgct acccacgtgg ccgtcgtacc agatcctcag gtaaatcaag 29880

tggtgccccc tccagaacac gctgcccacg tacatgatct ccttgggcat gtggcggttc 29940

accacctccc ggtaccacat caccctctgg ttgaacatgc agccccggat gatcctgcgg 30000

aaccacaggg ccagcaccgc cccgcccgcc atgcagcgaa gagaccccgg gtcccggcaa 30060

tggcaatgga ggacccaccg ctcgtacccg tggatcatct gggagctgaa caagtctatg 30120

ttggcacagc acaggcatat gctcatgcat ctcttcagca ctctcaactc ctcgggggtc 30180

aaaaccatat cccagggcac ggggaactct tgcaggacag cgaaccccgc agaacagggc 30240

aatcctcgca cagaacttac attgtgcatg gacagggtat cgcaatcagg cagcaccggg 30300

tgatcctcca ccagagaagc gcgggtctcg gtctcctcac agcgtggtaa gggggccggc 30360

cgatacgggt gatggcggga cgcggctgat cgtgttcgcg accgtgtcat gatgcagttg 30420

ctttcggaca ttttcgtact tgctgtagca gaacctggtc cgggcgctgc acaccgatcg 30480

ccggcggcgg tctcggcgct tggaacgctc ggtgttgaaa ttgtaaaaca gccactctct 30540

cagaccgtgc agcagatcta gggcctcagg agtgatgaag atcccatcat gcctgatggc 30600

tctgatcaca tcgaccaccg tggaatgggc cagacccagc cagatgatgc aattttgttg 30660

ggtttcggtg acggcggggg agggaagaac aggaagaacc atgattaact tttaatccaa 30720

acggtctcgg agtacttcaa aatgaagatc gcggagatgg cacctctcgc ccccgctgtg 30780

ttggtggaaa ataacagcca ggtcaaaggt gatacggttc tcgagatgtt ccacggtggc 30840

ttccagcaaa gcctccacgc gcacatccag aaacaagaca atagcgaaag cgggagggtt 30900

ctctaattcc tcaatcatca tgttacactc ctgcaccatc cccagataat tttcattttt 30960

ccagccttga atgattcgaa ctagttcgtg aggtaaatcc aagccagcca tgataaagag 31020

ctcgcgcaga gcgccctcca ccggcattct taagcacacc ctcataattc caagatattc 31080

tgctcctggt tcacctgcag cagattgaca agcggaatat caaaatctct gccgcgatcc 31140

ctgagctcct ccctcagcaa taactgtaag tactctttca tatcctctcc gaaattttta 31200

gccataggac caccaggaat aagattaggg caagccacag tacagataaa ccgaagtcct 31260

ccccagtgag cattgccaaa tgcaagactg ctataagcat gctggctaga cccggtgata 31320

tcttccagat aactggacag aaaatcgccc aggcaatttt taagaaaatc aacaaaagaa 31380

aaatcctcca ggtggacgtt tagagcctcg ggaacaacga tgaagtaaat gcaagcggtg 31440

cgttccagca tggttagtta gctgatctgt agaaaaaaca aaaatgaaca ttaaaccatg 31500

ctagcctggc gaacaggtgg gtaaatcgtt ctctccagca ccaggcaggc cacggggtct 31560

ccggcgcgac cctcgtaaaa attgtcgcta tgattgaaaa ccatcacaga gagacgttcc 31620

cggtggccgg cgtgaatgat tcgacaagat gaatacaccc ccggaacatt ggcgtccgcg 31680

agtgaaaaaa agcgcccgag gaagcaataa ggcactacaa tgctcagtct caagtccagc 31740

aaagcgatgc catgcggatg aagcacaaaa ttctcaggtg cgtacaaaat gtaattactc 31800

ccctcctgca caggcagcaa agcccccgat ccctccaggt acacatacaa agcctcagcg 31860

tccatgttaa cagcttaccg agcagcagca cacaacaggc gcaagagtca gagaaaggct 31920

gagctctaac ctgtccaccc gctctctgct caatatatag cccagatcta cactgacgta 31980

aaggccaaag tctaaaaata cccgccaaat aatcacacac gcccagcaca cgcccagaaa 32040

ccggtgacac actcaaaaaa atacgcgcac ttcctcaaac gcccaaaact gccgtcattt 32100

ccgggttccc acgctacgtc atcaaaacac gactttcaaa ttccgtcgac cgttaaaaac 32160

gtcacccgcc ccgcccctaa cggtcgcccg tctctcagcc aatcagcgcc ccgcatcccc 32220

aaattcaaac acctcatttg catattaacg cgcacaaaaa gtttgaggta tattattgat 32280

gatggttaat taagaattca ctggccgtcg ttttacaacg tcgtgactgg gaaaaccctg 32340

gcgttaccca acttaatcgc cttgcagcac atcccccttt cgccagctgg cgtaatagcg 32400

aagaggcccg caccgatcgc ccttcccaac agttgcgcag cctgaatggc gaatggcgcc 32460

tgatgcggta ttttctcctt acgcatctgt gcggtatttc acaccgcata tggtgcactc 32520

tcagtacaat ctgctctgat gccgcatagt taagccagcc ccgacacccg ccaacacccg 32580

ctgacgcgcc ctgacgggct tgtctgctcc cggcatccgc ttacagacaa gctgtgaccg 32640

tctccgggag ctgcatgtgt cagaggtttt caccgtcatc accgaaacgc gcgagacgaa 32700

agggcctcgt gatacgccta tttttatagg ttaatgtcat gataataatg gtttcttaga 32760

cgtcaggtgg cacttttcgg ggaaatgtgc gcggaacccc tatttgttta tttttctaaa 32820

tacattcaaa tatgtatccg ctcatgagac aataaccctg ataaatgctt caataatatt 32880

gaaaaaggaa gagtatgagt attcaacatt tccgtgtcgc ccttattccc ttttttgcgg 32940

cattttgcct tcctgttttt gctcacccag aaacgctggt gaaagtaaaa gatgctgaag 33000

atcagttggg tgcacgagtg ggttacatcg aactggatct caacagcggt aagatccttg 33060

agagttttcg ccccgaagaa cgttttccaa tgatgagcac ttttaaagtt ctgctatgtg 33120

gcgcggtatt atcccgtatt gacgccgggc aagagcaact cggtcgccgc atacactatt 33180

ctcagaatga cttggttgag tactcaccag tcacagaaaa gcatcttacg gatggcatga 33240

cagtaagaga attatgcagt gctgccataa ccatgagtga taacactgcg gccaacttac 33300

ttctgacaac gatcggagga ccgaaggagc taaccgcttt tttgcacaac atgggggatc 33360

atgtaactcg ccttgatcgt tgggaaccgg agctgaatga agccatacca aacgacgagc 33420

gtgacaccac gatgcctgta gcaatggcaa caacgttgcg caaactatta actggcgaac 33480

tacttactct agcttcccgg caacaattaa tagactggat ggaggcggat aaagttgcag 33540

gaccacttct gcgctcggcc cttccggctg gctggtttat tgctgataaa tctggagccg 33600

gtgagcgtgg gtctcgcggt atcattgcag cactggggcc agatggtaag ccctcccgta 33660

tcgtagttat ctacacgacg gggagtcagg caactatgga tgaacgaaat agacagatcg 33720

ctgagatagg tgcctcactg attaagcatt ggtaactgtc agaccaagtt tactcatata 33780

tactttagat tgatttaaaa cttcattttt aatttaaaag gatctaggtg aagatccttt 33840

ttgataatct catgaccaaa atcccttaac gtgagttttc gttccactga gcgtcagacc 33900

ccgtagaaaa gatcaaagga tcttcttgag atcctttttt tctgcgcgta atctgctgct 33960

tgcaaacaaa aaaaccaccg ctaccagcgg tggtttgttt gccggatcaa gagctaccaa 34020

ctctttttcc gaaggtaact ggcttcagca gagcgcagat accaaatact gttcttctag 34080

tgtagccgta gttaggccac cacttcaaga actctgtagc accgcctaca tacctcgctc 34140

tgctaatcct gttaccagtg gctgctgcca gtggcgataa gtcgtgtctt accgggttgg 34200

actcaagacg atagttaccg gataaggcgc agcggtcggg ctgaacgggg ggttcgtgca 34260

cacagcccag cttggagcga acgacctaca ccgaactgag atacctacag cgtgagcttt 34320

gagaaagcgc cacgcttccc gaagggagaa aggcggacag gtatccggta agcggcaggg 34380

tcggaacagg agagcgcacg agggagcttc cagggggaaa cgcctggtat ctttatagtc 34440

ctgtcgggtt tcgccacctc tgacttgagc gtcgattttt gtgatgctcg tcaggggggc 34500

ggagcctatg gaaaaacgcc agcaacgcgg cctttttacg gttcctggcc ttttgctggc 34560

cttttgctca catgttcttt cctgcgttat cccctgattc tgtggataac cgtattaccg 34620

cctttgagtg agctgatacc gctcgccgca gccgaacgac cgagcgcagc gagtcagtga 34680

gcgaggaagc ggaagagcgc ccaatacgca aaccgcctct ccccgcgcgt tggccgattc 34740

attaatgcag ctggcacgac aggtttcccg actggaaagc gggcagtgag cgcaacgcaa 34800

ttaatgtgag ttagctcact cattaggcac cccaggcttt acactttatg cttccggctc 34860

gtatgttgtg tggaattgtg agcggataac aatttcacac aggaaacagc tatgaccatg 34920

attacgccaa gcttgcatgc ctgcaggttt aaacttaatt aaccatcttc aataatatac 34980

ctcaaacttt ttgtgcgcgt taatatgcaa atgaggcgtt tgaatttggg gaggaagggc 35040

ggtgattg 35048

<210> 2

<211> 1426

<212> DNA

<213> Artificial Sequence (Artificial Sequence)

<220>

<221> misc_feature

<222> (1)..(1426)

<223> CMV-EGFP nucleotide sequence

<400> 2

cgcgttgaca ttgattattg actagttatt aatagtaatc aattacgggg tcattagttc 60

atagcccata tatggagttc cgcgttacat aacttacggt aaatggcccg cctggctgac 120

cgcccaacga cccccgccca ttgacgtcaa taatgacgta tgttcccata gtaacgccaa 180

tagggacttt ccattgacgt caatgggtgg actatttacg gtaaactgcc cacttggcag 240

tacatcaagt gtatcatatg ccaagtacgc cccctattga cgtcaatgac ggtaaatggc 300

ccgcctggca ttatgcccag tacatgacct tatgggactt tcctacttgg cagtacatct 360

acgtattagt catcgctatt accatggtga tgcggttttg gcagtacatc aatgggcgtg 420

gatagcggtt tgactcacgg ggatttccaa gtctccaccc cattgacgtc aatgggagtt 480

tgttttggca ccaaaatcaa cgggactttc caaaatgtcg taacaactcc gccccattga 540

cgcaaatggg cggtaggcgt gtacggtggg aggtctatat aagcagagct ctctggctaa 600

ctagagaacc cactgcttac tggcttatcg aaattaatac gactcactat agggagaccc 660

aagctggcta gcgtttaaac gggcccgccc gggcttataa gccaccatgg tgagcaaggg 720

cgaggagctg ttcaccgggg tggtgcccat cctggtcgag ctggacggcg acgtaaacgg 780

ccacaagttc agcgtgtccg gcgagggcga gggcgatgcc acctacggca agctgaccct 840

gaagttcatc tgcaccaccg gcaagctgcc cgtgccctgg cccaccctcg tgaccaccct 900

gacctacggc gtgcagtgct tcagccgcta ccccgaccac atgaagcagc acgacttctt 960

caagtccgcc atgcccgaag gctacgtcca ggagcgcacc atcttcttca aggacgacgg 1020

caactacaag acccgcgccg aggtgaagtt cgagggcgac accctggtga accgcatcga 1080

gctgaagggc atcgacttca aggaggacgg caacatcctg gggcacaagc tggagtacaa 1140

ctacaacagc cacaacgtct atatcatggc cgacaagcag aagaacggca tcaaggtgaa 1200

cttcaagatc cgccacaaca tcgaggacgg cagcgtgcag ctcgccgacc actaccagca 1260

gaacaccccc atcggcgacg gccccgtgct gctgcccgac aaccactacc tgagcaccca 1320

gtccgccctg agcaaagacc ccaacgagaa gcgcgatcac atggtcctgc tggagttcgt 1380

gaccgccgcc gggatcactc tcggcatgga cgagctgtac aagtaa 1426

<210> 3

<211> 2728

<212> DNA

<213> Artificial Sequence (Artificial Sequence)

<220>

<221> misc_feature

<222> (1)..(2728)

<223> AdHu5E4 nucleotide sequence

<400> 3

tcacagaacc ctagtattca acctgccacc tccctcccaa cacacagagt acacagtcct 60

ttctccccgg ctggccttaa aaagcatcat atcatgggta acagacatat tcttaggtgt 120

tatattccac acggtttcct gtcgagccaa acgctcatca gtgatattaa taaactcccc 180

gggcagctca cttaagttca tgtcgctgtc cagctgctga gccacaggct gctgtccaac 240

ttgcggttgc ttaacgggcg gcgaaggaga agtccacgcc tacatggggg tagagtcata 300

atcgtgcatc aggatagggc ggtggtgctg cagcagcgcg cgaataaact gctgccgccg 360

ccgctccgtc ctgcaggaat acaacatggc agtggtctcc tcagcgatga ttcgcaccgc 420

ccgcagcata aggcgccttg tcctccgggc acagcagcgc accctgatct cacttaaatc 480

agcacagtaa ctgcagcaca gcaccacaat attgttcaaa atcccacagt gcaaggcgct 540

gtatccaaag ctcatggcgg ggaccacaga acccacgtgg ccatcatacc acaagcgcag 600

gtagattaag tggcgacccc tcataaacac gctggacata aacattacct cttttggcat 660

gttgtaattc accacctccc ggtaccatat aaacctctga ttaaacatgg cgccatccac 720

caccatccta aaccagctgg ccaaaacctg cccgccggct atacactgca gggaaccggg 780

actggaacaa tgacagtgga gagcccagga ctcgtaacca tggatcatca tgctcgtcat 840

gatatcaatg ttggcacaac acaggcacac gtgcatacac ttcctcagga ttacaagctc 900

ctcccgcgtt agaaccatat cccagggaac aacccattcc tgaatcagcg taaatcccac 960

actgcaggga agacctcgca cgtaactcac gttgtgcatt gtcaaagtgt tacattcggg 1020

cagcagcgga tgatcctcca gtatggtagc gcgggtttct gtctcaaaag gaggtagacg 1080

atccctactg tacggagtgc gccgagacaa ccgagatcgt gttggtcgta gtgtcatgcc 1140

aaatggaacg ccggacgtag tcatatttcc tgaagcaaaa ccaggtgcgg gcgtgacaaa 1200

cagatctgcg tctccggtct cgccgcttag atcgctctgt gtagtagttg tagtatatcc 1260

actctctcaa agcatccagg cgccccctgg cttcgggttc tatgtaaact ccttcatgcg 1320

ccgctgccct gataacatcc accaccgcag aataagccac acccagccaa cctacacatt 1380

cgttctgcga gtcacacacg ggaggagcgg gaagagctgg aagaaccatg tttttttttt 1440

tattccaaaa gattatccaa aacctcaaaa tgaagatcta ttaagtgaac gcgctcccct 1500

ccggtggcgt ggtcaaactc tacagccaaa gaacagataa tggcatttgt aagatgttgc 1560

acaatggctt ccaaaaggca aacggccctc acgtccaagt ggacgtaaag gctaaaccct 1620

tcagggtgaa tctcctctat aaacattcca gcaccttcaa ccatgcccaa ataattctca 1680

tctcgccacc ttctcaatat atctctaagc aaatcccgaa tattaagtcc ggccattgta 1740

aaaatctgct ccagagcgcc ctccaccttc agcctcaagc agcgaatcat gattgcaaaa 1800

attcaggttc ctcacagacc tgtataagat tcaaaagcgg aacattaaca aaaataccgc 1860

gatcccgtag gtcccttcgc agggccagct gaacataatc gtgcaggtct gcacggacca 1920

gcgcggccac ttccccgcca ggaaccatga caaaagaacc cacactgatt atgacacgca 1980

tactcggagc tatgctaacc agcgtagccc cgatgtaagc ttgttgcatg ggcggcgata 2040

taaaatgcaa ggtgctgctc aaaaaatcag gcaaagcctc gcgcaaaaaa gaaagcacat 2100

cgtagtcatg ctcatgcaga taaaggcagg taagctccgg aaccaccaca gaaaaagaca 2160

ccatttttct ctcaaacatg tctgcgggtt tctgcataaa cacaaaataa aataacaaaa 2220

aaacatttaa acattagaag cctgtcttac aacaggaaaa acaaccctta taagcataag 2280

acggactacg gccatgccgg cgtgaccgta aaaaaactgg tcaccgtgat taaaaagcac 2340

caccgacagc tcctcggtca tgtccggagt cataatgtaa gactcggtaa acacatcagg 2400

ttgattcaca tcggtcagtg ctaaaaagcg accgaaatag cccgggggaa tacatacccg 2460

caggcgtaga gacaacatta cagcccccat aggaggtata acaaaattaa taggagagaa 2520

aaacacataa acacctgaaa aaccctcctg cctaggcaaa atagcaccct cccgctccag 2580

aacaacatac agcgcttcca cagcggcagc cataacagtc agccttacca gtaaaaaaga 2640

aaacctatta aaaaaacacc actcgacacg gcaccagctc aatcagtcac agtgtaaaaa 2700

agggccaagt gcagagcgag tatatata 2728

<210> 4

<211> 1164

<212> DNA

<213> Artificial Sequence (Artificial Sequence)

<220>

<221> misc_feature

<222> (1)..(1164)

<223> AdHu5E 4ORF 6/7 nucleotide sequence

<400> 4

tcacagaacc ctagtattca acctgccacc tccctcccaa cacacagagt acacagtcct 60

ttctccccgg ctggccttaa aaagcatcat atcatgggta acagacatat tcttaggtgt 120

tatattccac acggtttcct gtcgagccaa acgctcatca gtgatattaa taaactcccc 180

gggcagctca cttaagttca tgtcgctgtc cagctgctga gccacaggct gctgtccaac 240

ttgcggttgc ttaacgggcg gcgaaggaga agtccacgcc tacatggggg tagagtcata 300

atcgtgcatc aggatagggc ggtggtgctg cagcagcgcg cgaataaact gctgccgccg 360

ccgctccgtc ctgcaggaat acaacatggc agtggtctcc tcagcgatga ttcgcaccgc 420

ccgcagcata aggcgccttg tcctccgggc acagcagcgc accctgatct cacttaaatc 480

agcacagtaa ctgcagcaca gcaccacaat attgttcaaa atcccacagt gcaaggcgct 540

gtatccaaag ctcatggcgg ggaccacaga acccacgtgg ccatcatacc acaagcgcag 600

gtagattaag tggcgacccc tcataaacac gctggacata aacattacct cttttggcat 660

gttgtaattc accacctccc ggtaccatat aaacctctga ttaaacatgg cgccatccac 720

caccatccta aaccagctgg ccaaaacctg cccgccggct atacactgca gggaaccggg 780

actggaacaa tgacagtgga gagcccagga ctcgtaacca tggatcatca tgctcgtcat 840

gatatcaatg ttggcacaac acaggcacac gtgcatacac ttcctcagga ttacaagctc 900

ctcccgcgtt agaaccatat cccagggaac aacccattcc tgaatcagcg taaatcccac 960

actgcaggga agacctcgca cgtaactcac gttgtgcatt gtcaaagtgt tacattcggg 1020

cagcagcgga tgatcctcca gtatggtagc gcgggtttct gtctcaaaag gaggtagacg 1080

atccctactg tacggagtgc gccgagacaa ccgagatcgt gttggtcgta gtgtcatgcc 1140

aaatggaacg ccggacgtag tcat 1164

<210> 5

<211> 1511

<212> DNA

<213> Artificial Sequence (Artificial Sequence)

<220>

<221> misc_feature

<222> (1)..(1511)

<223> AdHu5E 4ORF 3-6 nucleotide sequence

<400> 5

ctacatgggg gtagagtcat aatcgtgcat caggataggg cggtggtgct gcagcagcgc 60

gcgaataaac tgctgccgcc gccgctccgt cctgcaggaa tacaacatgg cagtggtctc 120

ctcagcgatg attcgcaccg cccgcagcat aaggcgcctt gtcctccggg cacagcagcg 180

caccctgatc tcacttaaat cagcacagta actgcagcac agcaccacaa tattgttcaa 240

aatcccacag tgcaaggcgc tgtatccaaa gctcatggcg gggaccacag aacccacgtg 300

gccatcatac cacaagcgca ggtagattaa gtggcgaccc ctcataaaca cgctggacat 360

aaacattacc tcttttggca tgttgtaatt caccacctcc cggtaccata taaacctctg 420

attaaacatg gcgccatcca ccaccatcct aaaccagctg gccaaaacct gcccgccggc 480

tatacactgc agggaaccgg gactggaaca atgacagtgg agagcccagg actcgtaacc 540

atggatcatc atgctcgtca tgatatcaat gttggcacaa cacaggcaca cgtgcataca 600

cttcctcagg attacaagct cctcccgcgt tagaaccata tcccagggaa caacccattc 660

ctgaatcagc gtaaatccca cactgcaggg aagacctcgc acgtaactca cgttgtgcat 720

tgtcaaagtg ttacattcgg gcagcagcgg atgatcctcc agtatggtag cgcgggtttc 780

tgtctcaaaa ggaggtagac gatccctact gtacggagtg cgccgagaca accgagatcg 840

tgttggtcgt agtgtcatgc caaatggaac gccggacgta gtcatatttc ctgaagcaaa 900

accaggtgcg ggcgtgacaa acagatctgc gtctccggtc tcgccgctta gatcgctctg 960

tgtagtagtt gtagtatatc cactctctca aagcatccag gcgccccctg gcttcgggtt 1020

ctatgtaaac tccttcatgc gccgctgccc tgataacatc caccaccgca gaataagcca 1080

cacccagcca acctacacat tcgttctgcg agtcacacac gggaggagcg ggaagagctg 1140

gaagaaccat gttttttttt ttattccaaa agattatcca aaacctcaaa atgaagatct 1200

attaagtgaa cgcgctcccc tccggtggcg tggtcaaact ctacagccaa agaacagata 1260

atggcatttg taagatgttg cacaatggct tccaaaaggc aaacggccct cacgtccaag 1320

tggacgtaaa ggctaaaccc ttcagggtga atctcctcta taaacattcc agcaccttca 1380

accatgccca aataattctc atctcgccac cttctcaata tatctctaag caaatcccga 1440

atattaagtc cggccattgt aaaaatctgc tccagagcgc cctccacctt cagcctcaag 1500

cagcgaatca t 1511

36页详细技术资料下载
上一篇:一种医用注射器针头装配设备
下一篇:一种高效装载特异性miRNA的功能性外泌体构建方法

网友询问留言

已有0条留言

还没有人留言评论。精彩留言会获得点赞!

精彩留言,会给你点赞!