800.227.0627

The Ebola Virus Genome and Proteome

The Ebola Virus Genome and Proteome


The Ebola virus is a single-stranded, negative-sense mini-genome RNA virus. One strain, the Zaire Ebola virus is responsible for the recent Ebola outbreak in West-Africa. Ebola viruses belong to the filoviridae family, and together with Paramyxoviridae, Rhabdoviridae, and Borna disease virus, Filoviridae viruses belong to the taxonomic order mononegavirales. The Ebola virus genome like all filovirdae genomes is similar to the Marburg virus genome. Filovirdae can cause severe hemorrhargic fever in non-human primates as well as in humans, to various degrees.


Mononegavirales is the term used for "nonsegmented negative-strand RNA viruses" (NNSV). These are enveloped viruses that have mini-genomes consisting of a single RNA molecule of negative or anti-mRNA sense. Hence the Ebola virus genome as well as all filovirus genomes are considered to be mini-genomes. A synthetic Ebola virus genome without is proteins appears to be non-infectious.

Nucleic acids isolated from negative strand RNA viruses or virus-infected cells cannot infect or initiate an infection cycle when introduced into the host cell. This criterion was used to distinguish “positive" from “negative”-strand RNA viruses. The viral genome needs to be first transcribed to produce mRNAs. Therefore, the purified virion RNA is not infectious. The virus needs to bring its own RNA polymerase into the cell in order to produce mRNA. To allow the virus to be infective a viral polymerase must be part of the viral particle or virion.

The use of non-infectious synthetic viral RNA allows for the design of
PCR primers or probes as well as peptides and recombinant proteins for molecular diagnostics. Similarly, these molecules may lend themselves for the design and production of vaccines against the virus. 

A visual model of the ebola virus is availabel at the "Visual Science" science website:

 
http://visual-science.com/projects/ebola/poster/.



EBOLA Model

Features of the unsegmented genome of negative-stranded RNA viruses are:

  • Negative sense RNA in the virion
  • Virion-associated RNA polymerase mediates transcription and replication
  • Genome transcribed into 6-10 separate mRNAs from a single promoter
  • Replication occurs by synthesis of a complete positive-sense RNA antigenome
  • Nucleoprotein is the functional template for synthesis of replicative and mRNA
  • Independently assembled nucleocapsids are enveloped at the cell surface at sites containing virus proteins
  • Are mainly cytoplasmic
  • Can occur in invertebrates, vertebrates and plants

Features of the family Filoviridae are:

  • Filamentous forms with branching; sometimes U-shaped, 6-shaped or circular 
  • Uniform diameter of 80 nm and varying lengths up to 14,000 nm. Infectious particle length is 790 nm for Marburg virus and 970 nm for Ebola virus
  • Surface spikes of 10 nm length
  • Helical nucleocapsid; 50 nm diameter, with an axial space of 20 nm diameter and helical periodicity of about 5 nm
  • Filamentous with a linear ~13-19 kb mini-genome with a negative-sense single-stranded RNA of molecular weight (Mr) =  4.2 x 10
  • At least five (5) proteins; a large (polymerase) protein, a surface glycoprotein, two (2) nucleocapsid-associated proteins, and at least one other protein of unknown function
  • Biology enigmatic; only two antigenically unrelated viruses known; blood borne infection of humans and monkeys

Filoviruses are responsible for newly emerging infections. Filoviruses are considered as Biosafety Level 4 agents, in comparison HIV is only considered as Biosafety Level 2+. Filoviruses can infect mice, hamsters, guinea pigs and monkeys. However, it is not known at presence where the virus originates in the wild. 

Most human epidemics appear to be blood-born spread, in hospitals often transmitted via contaminated needles, and transmitted via close contact with infected persons or their body fluids. Primary infections with Marburg and Ebola are usually 25 to 90% fatal.  Death is thought to occur because of visceral organ necrosis, for example of the liver, due to viral infection of tissue parenchymal cells. 

Viral RNA is not infectious by itself. Therefore, the use of cloned or synthetic viral RNA can be very useful for the development and production of diagnostic tests or the development of vaccines against filoviruses, for example, the Ebola virus. 

Research with the aim to develop a vaccine for Ebola has already been started for several years now. In 1998, the first immunization for Ebola virus infections that was successful was reported. 

“Abstract: Infection by Ebola virus causes rapidly progressive, often fatal, symptoms of fever, hemorrhage and hypotension. Previous attempts to elicit protective immunity for this disease have not met with success. We report here that protection against the lethal effects of Ebola virus can be achieved in an animal model by immunizing with plasmids encoding viral proteins. We analyzed immune responses to the viral nucleoprotein (NP) and the secreted or transmembrane forms of the glycoprotein (sGP or GP) and their ability to protect against infection in a guinea pig infection model analogous to the human disease. Protection was achieved and correlated with antibody titer and antigen-specific T-cell responses to sGP or GP. Immunity to Ebola virus can therefore be developed through genetic vaccination and may facilitate efforts to limit the spread of this disease.” 

{Xu L, Sanchez A, Yang Z, Zaki SR, Nabel EG, Nichol ST, Nabel GJ.; Immunization for Ebola virus infection. Nat Med. 1998 Jan;4(1):37-42.}

The result – a DNA vaccine encoding the glycoprotein (sGP or GP) of the Ebola virus evoked a T-cell based immune response in guinea pigs and protected the animals against infection. Further studies indicated that a DNA vaccine can is useful for vaccination. The use of DNA immunization together with adenovirus vectors encoding viral proteins in nonhuman primates resulted in the protection of crab-eating or cynomolgus macaques (Macaca fascicularis) from the lethal pathogen, the wild-type Zaire virus. 

“Abstract: Outbreaks of haemorrhagic fever caused by the Ebola virus are associated with high mortality rates that are a distinguishing feature of this human pathogen. The highest lethality is associated with the Zaire subtype, one of four strains identified to date. Its rapid progression allows little opportunity to develop natural immunity, and there is currently no effective anti-viral therapy. Therefore, vaccination offers a promising intervention to prevent infection and limit spread. Here we describe a highly effective vaccine strategy for Ebola virus infection in non-human primates. A combination of DNA immunization and boosting with adenoviral vectors that encode viral proteins generated cellular and humoral immunity in cynomolgus macaques. Challenge with a lethal dose of the highly pathogenic, wild-type, 1976 Mayinga strain of Ebola Zaire virus resulted in uniform infection in controls, who progressed to a moribund state and death in less than one week. In contrast, all vaccinated animals were asymptomatic for more than six months, with no detectable virus after the initial challenge. These findings demonstrate that it is possible to develop a preventive vaccine against Ebola virus infection in primates.”

 {Sullivan NJ, Sanchez A, Rollin PE, Yang ZY, Nabel GJ.; Development of a preventive vaccine for Ebola virus infection in primates. Nature. 2000 Nov 30;408(6812):605-9.}

Results from sequence analysis of Ebola viruses from outbreaks in 1976 and 1995 showed a high degree of genetic conservation for this virus type. An explanation of this could be that Ebola viruses may have coevolved with their natural host reservoirs and do not change a lot in the wild.

Reference

Biology of Negative Strand RNA Viruses: The Power of Reverse Genetics; Y. Kawaoka (Ed.). © Springer-Verlag Berlin Heidelberg 2004.

Ebihara H, Takada A, Kobasa D, Jones S, Neumann G, et al. (2006) Molecular determinants of Ebola virus virulence in mice. PLoS Pathog 2(7): e73. DOI: 10.1371/ journal.ppat.0020073.

MOLECULAR BASIS OF VIRUS EVOLUTION; Edited by ADRIAN J. GIBBS, CHARLES H. CALISHER, and FERNANDO GARCIA-ARENAL, © Cambridge University Press 1995.

http://www.ncbi.nlm.nih.gov/pubmed/?term=ebola+virus+review

 

The Zaire Ebola Virus Genome and Proteome

Graphical display and FASTA file from Pubmed.

Zaire ebolavirus isolate Ebola virus H.sapiens-tc/COD/1976/Yambuku-Mayinga, complete genome. NCBI Reference Sequence: NC_002549.1. 

Source

http://www.ncbi.nlm.nih.gov/nuccore/10313991?report=graph

Ebola Genome Zaire NIH 1

 Ebola Genome Zaire NIH 3

 Ebola Genome Zaire NIH 2

LOCUS       NC_002549              18959 bp    cRNA    linear   VRL 27-AUG-2014
DEFINITION  Zaire ebolavirus isolate Ebola virus
            H.sapiens-tc/COD/1976/Yambuku-Mayinga, complete genome.
ACCESSION   NC_002549
VERSION     NC_002549.1  GI:10313991
DBLINK      BioProject: PRJNA14703
KEYWORDS    RefSeq.
SOURCE      Zaire ebolavirus (ZEBOV)
  ORGANISM  Zaire ebolavirus
            Viruses; ssRNA negative-strand viruses; Mononegavirales;
            Filoviridae; Ebolavirus.
REFERENCE   1  (bases 1 to 18959)
  AUTHORS   Volchkov,V.E., Volchkova,V.A., Chepurnov,A.A., Blinov,V.M.,
            Dolnik,O., Netesov,S.V. and Feldmann,H.
  TITLE     Characterization of the L gene and 5' trailer region of Ebola virus
  JOURNAL   J. Gen. Virol. 80 (Pt 2), 355-362 (1999)
   PUBMED   10073695
REFERENCE   2  (bases 1 to 18959)
  AUTHORS   Volchkov,V.E., Volchkova,V.A., Slenczka,W., Klenk,H.D. and
            Feldmann,H.
  TITLE     Release of viral glycoproteins during Ebola virus infection
  JOURNAL   Virology 245 (1), 110-119 (1998)
   PUBMED   9614872
REFERENCE   3  (bases 1 to 18959)
  AUTHORS   Volchkov,V.E., Feldmann,H., Volchkova,V.A. and Klenk,H.D.
  TITLE     Processing of the Ebola virus glycoprotein by the proprotein
            convertase furin
  JOURNAL   Proc. Natl. Acad. Sci. U.S.A. 95 (10), 5762-5767 (1998)
   PUBMED   9576958
REFERENCE   4  (bases 1 to 18959)
  AUTHORS   Volchkov,V.E., Becker,S., Volchkova,V.A., Ternovoj,V.A.,
            Kotov,A.N., Netesov,S.V. and Klenk,H.D.
  TITLE     GP mRNA of Ebola virus is edited by the Ebola virus polymerase and
            by T7 and vaccinia virus polymerases
  JOURNAL   Virology 214 (2), 421-430 (1995)
   PUBMED   8553543
REFERENCE   5  (bases 1 to 18959)
  AUTHORS   Bukreyev,A.A., Volchkov,V.E., Blinov,V.M. and Netesov,S.V.
  TITLE     The VP35 and VP40 proteins of filoviruses. Homology between Marburg
            and Ebola viruses
  JOURNAL   FEBS Lett. 322 (1), 41-46 (1993)
   PUBMED   8482365
REFERENCE   6  (bases 1 to 18959)
  CONSRTM   NCBI Genome Project
  TITLE     Direct Submission
  JOURNAL   Submitted (27-SEP-2000) National Center for Biotechnology
            Information, NIH, Bethesda, MD 20894, USA
REFERENCE   7  (bases 1 to 18959)
  AUTHORS   Volchkov,V.E.
  TITLE     Direct Submission
  JOURNAL   Submitted (02-JUN-2000) Institute of Virology, Philipps-University
            Marburg, Robert-Koch-Str. 17, Marburg 35037, Germany
  REMARK    Sequence update by submitter
REFERENCE   8  (bases 1 to 18959)
  AUTHORS   Volchkov,V.E.
  TITLE     Direct Submission
  JOURNAL   Submitted (20-AUG-1998) Institute of Virology, Philipps-University
            Marburg, Robert-Koch-Str. 17, Marburg 35037, Germany
COMMENT     PROVISIONAL REFSEQ: This record has not yet been subject to final
            NCBI review. The reference sequence is identical to AF086833.
            COMPLETENESS: full length.
FEATURES             Location/Qualifiers
     source          1..18959
                     /organism="Zaire ebolavirus"
                     /mol_type="viral cRNA"
                     /isolate="Ebola virus
                     H.sapiens-tc/COD/1976/Yambuku-Mayinga"
                     /db_xref="taxon:186538"
     5'UTR           1..55
                     /note="putative leader region"
                     /citation=[1]
                     /function="regulation or initiation of RNA replication"
     gene            56..3026
                     /gene="NP"
                     /locus_tag="ZEBOVgp1"
                     /db_xref="GeneID:911830"
     mRNA            56..3026
                     /gene="NP"
                     /locus_tag="ZEBOVgp1"
                     /product="nucleoprotein"
                     /db_xref="GeneID:911830"
     misc_signal     56..67
                     /gene="NP"
                     /locus_tag="ZEBOVgp1"
                     /note="putative; transcription start signal"
                     /citation=[1]
     CDS             470..2689
                     /gene="NP"
                     /locus_tag="ZEBOVgp1"
                     /function="encapsidation of genomic RNA"
                     /codon_start=1
                     /product="nucleoprotein"
                     /protein_id="NP_066243.1"
                     /db_xref="GI:10314000"
                     /db_xref="GeneID:911830"
                     /translation="MDSRPQKIWMAPSLTESDMDYHKILTAGLSVQQGIVRQRVIPVY
                     QVNNLEEICQLIIQAFEAGVDFQESADSFLLMLCLHHAYQGDYKLFLESGAVKYLEGH
                     GFRFEVKKRDGVKRLEELLPAVSSGKNIKRTLAAMPEEETTEANAGQFLSFASLFLPK
                     LVVGEKACLEKVQRQIQVHAEQGLIQYPTAWQSVGHMMVIFRLMRTNFLIKFLLIHQG
                     MHMVAGHDANDAVISNSVAQARFSGLLIVKTVLDHILQKTERGVRLHPLARTAKVKNE
                     VNSFKAALSSLAKHGEYAPFARLLNLSGVNNLEHGLFPQLSAIALGVATAHGSTLAGV
                     NVGEQYQQLREAATEAEKQLQQYAESRELDHLGLDDQEKKILMNFHQKKNEISFQQTN
                     AMVTLRKERLAKLTEAITAASLPKTSGHYDDDDDIPFPGPINDDDNPGHQDDDPTDSQ
                     DTTIPDVVVDPDDGSYGEYQSYSENGMNAPDDLVLFDLDEDDEDTKPVPNRSTKGGQQ
                     KNSQKGQHIEGRQTQSRPIQNVPGPHRTIHHASAPLTDNDRRNEPSGSTSPRMLTPIN
                     EEADPLDDADDETSSLPPLESDDEEQDRDGTSNRTPTVAPPAPVYRDHSEKKELPQDE
                     QQDQDHTQEARNQDSDNTQSEHSFEEMYRHILRSQGPFDAVLYYHMMKDEPVVFSTSD
                     GKEYTYPDSLEEEYPPWLTEKEAMNEENRFVTLDGQQFYWPVMNHKNKFMAILQHHQ"
     misc_feature    524..2671
                     /gene="NP"
                     /locus_tag="ZEBOVgp1"
                     /note="Ebola nucleoprotein; Region: Ebola_NP; pfam05505"
                     /db_xref="CDD:147601"
     polyA_signal    3015..3026
                     /gene="NP"
                     /locus_tag="ZEBOVgp1"
     misc_feature    3027..3031
                     /note="intergenic region"
     gene            3032..4407
                     /gene="VP35"
                     /locus_tag="ZEBOVgp2"
                     /db_xref="GeneID:911827"
     mRNA            3032..4407
                     /gene="VP35"
                     /locus_tag="ZEBOVgp2"
                     /product="VP35"
                     /citation=[5]
                     /db_xref="GeneID:911827"
     misc_signal     3032..3043
                     /gene="VP35"
                     /locus_tag="ZEBOVgp2"
                     /note="putative; transcription start signal"
                     /citation=[5]
     CDS             3129..4151
                     /gene="VP35"
                     /locus_tag="ZEBOVgp2"
                     /function="polymerase complex protein"
                     /citation=[5]
                     /codon_start=1
                     /product="polymerase complex protein"
                     /protein_id="NP_066244.1"
                     /db_xref="GI:10313992"
                     /db_xref="GeneID:911827"
                     /translation="MTTRTKGRGHTAATTQNDRMPGPELSGWISEQLMTGRIPVSDIF
                     CDIENNPGLCYASQMQQTKPNPKTRNSQTQTDPICNHSFEEVVQTLASLATVVQQQTI
                     ASESLEQRITSLENGLKPVYDMAKTISSLNRVCAEMVAKYDLLVMTTGRATATAAATE
                     AYWAEHGQPPPGPSLYEESAIRGKIESRDETVPQSVREAFNNLNSTTSLTEENFGKPD
                     ISAKDLRNIMYDHLPGFGTAFHQLVQVICKLGKDSNSLDIIHAEFQASLAEGDSPQCA
                     LIQITKRVPIFQDAAPPVIHIRSRGDIPRACQKSLRPVPPSPKIDRGWVCVFQLQDGK
                     TLGLKI"
     misc_feature    3186..4148
                     /gene="VP35"
                     /locus_tag="ZEBOVgp2"
                     /note="Filoviridae VP35; Region: Filo_VP35; pfam02097"
                     /db_xref="CDD:145320"
     gene            4390..5894
                     /gene="VP40"
                     /locus_tag="ZEBOVgp3"
                     /db_xref="GeneID:911825"
     mRNA            4390..5894
                     /gene="VP40"
                     /locus_tag="ZEBOVgp3"
                     /product="VP40"
                     /citation=[5]
                     /db_xref="GeneID:911825"
     misc_signal     4390..4401
                     /gene="VP40"
                     /locus_tag="ZEBOVgp3"
                     /note="transcription start signal"
                     /citation=[5]
     polyA_signal    4397..4407
                     /gene="VP35"
                     /locus_tag="ZEBOVgp2"
                     /citation=[5]
     CDS             4479..5459
                     /gene="VP40"
                     /locus_tag="ZEBOVgp3"
                     /citation=[5]
                     /codon_start=1
                     /product="matrix protein"
                     /protein_id="NP_066245.1"
                     /db_xref="GI:10313993"
                     /db_xref="GeneID:911825"
                     /translation="MRRVILPTAPPEYMEAIYPVRSNSTIARGGNSNTGFLTPESVNG
                     DTPSNPLRPIADDTIDHASHTPGSVSSAFILEAMVNVISGPKVLMKQIPIWLPLGVAD
                     QKTYSFDSTTAAIMLASYTITHFGKATNPLVRVNRLGPGIPDHPLRLLRIGNQAFLQE
                     FVLPPVQLPQYFTFDLTALKLITQPLPAATWTDDTPTGSNGALRPGISFHPKLRPILL
                     PNKSGKKGNSADLTSPEKIQAIMTSLQDFKIVPIDPTKNIMGIEVPETLVHKLTGKKV
                     TSKNGQPIIPVLLPKYIGLDPVAPGDLTMVITQDCDTCHSPASLPAVIEK"
     misc_feature    4479..5363
                     /gene="VP40"
                     /locus_tag="ZEBOVgp3"
                     /note="Matrix protein VP40; Region: VP40; pfam07447"
                     /db_xref="CDD:116068"
     polyA_signal    5883..5894
                     /gene="VP40"
                     /locus_tag="ZEBOVgp3"
                     /citation=[5]
     misc_feature    5895..5899
                     /note="intergenic region"
     gene            5900..8305
                     /gene="GP"
                     /locus_tag="ZEBOVgp4"
                     /db_xref="GeneID:911829"
     mRNA            5900..8305
                     /gene="GP"
                     /locus_tag="ZEBOVgp4"
                     /product="sGP"
                     /note="unedited mRNA"
                     /citation=[4]
                     /db_xref="GeneID:911829"
     misc_signal     5900..5911
                     /gene="GP"
                     /locus_tag="ZEBOVgp4"
                     /note="putative; transcription start signal"
                     /citation=[4]
     CDS             join(6039..6923,6923..8068)
                     /gene="GP"
                     /locus_tag="ZEBOVgp4"
                     /function="receptor binding and fusion"
                     /artificial_location="low-quality sequence region"
                     /note="virion spike glycoprotein precursor; an addition A
                     residue is inserted during transcription; encodes two
                     disulfide linked subunits GP1 and GP2"
                     /citation=[2]
                     /citation=[3]
                     /citation=[4]
                     /codon_start=1
                     /product="spike glycoprotein"
                     /protein_id="NP_066246.1"
                     /db_xref="GI:10313995"
                     /db_xref="GeneID:911829"
                     /translation="MGVTGILQLPRDRFKRTSFFLWVIILFQRTFSIPLGVIHNSTLQ
                     VSDVDKLVCRDKLSSTNQLRSVGLNLEGNGVATDVPSATKRWGFRSGVPPKVVNYEAG
                     EWAENCYNLEIKKPDGSECLPAAPDGIRGFPRCRYVHKVSGTGPCAGDFAFHKEGAFF
                     LYDRLASTVIYRGTTFAEGVVAFLILPQAKKDFFSSHPLREPVNATEDPSSGYYSTTI
                     RYQATGFGTNETEYLFEVDNLTYVQLESRFTPQFLLQLNETIYTSGKRSNTTGKLIWK
                     VNPEIDTTIGEWAFWETKKNLTRKIRSEELSFTVVSNGAKNISGQSPARTSSDPGTNT
                     TTEDHKIMASENSSAMVQVHSQGREAAVSHLTTLATISTSPQSLTTKPGPDNSTHNTP
                     VYKLDISEATQVEQHHRRTDNDSTASDTPSATTAAGPPKAENTNTSKSTDFLDPATTT
                     SPQNHSETAGNNNTHHQDTGEESASSGKLGLITNTIAGVAGLITGGRRTRREAIVNAQ
                     PKCNPNLHYWTTQDEGAAIGLAWIPYFGPAAEGIYIEGLMHNQDGLICGLRQLANETT
                     QALQLFLRATTELRTFSILNRKAIDFLLQRWGGTCHILGPDCCIEPHDWTKNITDKID
                     QIIHDFVDKTLPDQGDNDNWWTGWRQWIPAGIGVTGVIIAVIALFCICKFVF"
     misc_feature    7529..7540
                     /gene="GP"
                     /locus_tag="ZEBOVgp4"
                     /note="encodes the glycoprotein cleavage site, precursor
                     GP is cleaved by subtilisin-like cellular protease furin
                     into subunits GP1 and GP2 that are linked by a disulfide
                     bond"
                     /citation=[3]
     misc_feature    7793..7870
                     /gene="GP"
                     /locus_tag="ZEBOVgp4"
                     /note="immunosuppressive motif; other site"
     misc_feature    7988..8053
                     /gene="GP"
                     /locus_tag="ZEBOVgp4"
                     /note="transmembrane anchor; transmembrane region"
     misc_feature    7706..7924
                     /gene="GP"
                     /locus_tag="ZEBOVgp4"
                     /note="heptad repeat 1-heptad repeat 2 region of the
                     transmembrane subunit of Filoviridae viruses, Ebola virus
                     and Marburg virus, and related domains; Region:
                     Ebola-like_HR1-HR2; cd09850"
                     /db_xref="CDD:197367"
     misc_feature    join(6081..6923,6923..7153)
                     /gene="GP"
                     /locus_tag="ZEBOVgp4"
                     /note="Filovirus glycoprotein; Region: Filo_glycop;
                     pfam01611"
                     /db_xref="CDD:110602"
     misc_feature    7706..7732
                     /gene="GP"
                     /locus_tag="ZEBOVgp4"
                     /note="HR1A; other site"
                     /db_xref="CDD:197367"
     misc_feature    7733..7762
                     /gene="GP"
                     /locus_tag="ZEBOVgp4"
                     /note="HR1B; other site"
                     /db_xref="CDD:197367"
     misc_feature    7763..7783
                     /gene="GP"
                     /locus_tag="ZEBOVgp4"
                     /note="HR1C; other site"
                     /db_xref="CDD:197367"
     misc_feature    7784..7831
                     /gene="GP"
                     /locus_tag="ZEBOVgp4"
                     /note="HR1D; other site"
                     /db_xref="CDD:197367"
     misc_feature    7787..7837
                     /gene="GP"
                     /locus_tag="ZEBOVgp4"
                     /note="immunosuppressive region; other site"
                     /db_xref="CDD:197367"
     misc_feature    order(7838..7858,7859..7861)
                     /gene="GP"
                     /locus_tag="ZEBOVgp4"
                     /note="CX(6,7)C motif; other site"
                     /db_xref="CDD:197367"
     misc_feature    7886..7924
                     /gene="GP"
                     /locus_tag="ZEBOVgp4"
                     /note="HR2; other site"
                     /db_xref="CDD:197367"
     misc_feature    order(7784..7786,7793..7795)
                     /gene="GP"
                     /locus_tag="ZEBOVgp4"
                     /note="Cl binding site [ion binding]; other site"
                     /db_xref="CDD:197367"
     misc_feature    order(7706..7714,7718..7723,7727..7732,7736..7744,
                     7748..7756,7760..7765,7769..7777,7781..7807,7811..7819,
                     7823..7828,7844..7849,7856..7858,7865..7876,7880..7882,
                     7889..7894,7901..7903,7910..7915,7922..7924)
                     /gene="GP"
                     /locus_tag="ZEBOVgp4"
                     /note="homotrimer interface [polypeptide binding]; other
                     site"
                     /db_xref="CDD:197367"
     misc_feature    order(7706..7714,7718..7726,7730..7735,7739..7747,
                     7754..7768,7772..7783,7787..7792,7796..7804,7808..7813,
                     7817..7819)
                     /gene="GP"
                     /locus_tag="ZEBOVgp4"
                     /note="HR1-GP1 interface [polypeptide binding]; other
                     site"
                     /db_xref="CDD:197367"
     CDS             6039..7133
                     /gene="GP"
                     /locus_tag="ZEBOVgp4"
                     /note="sGP, small non-structural, secreted glycoprotein;
                     sGP secreted as a anti-parallel oriented homodimer"
                     /citation=[4]
                     /codon_start=1
                     /product="small secreted glycoprotein"
                     /protein_id="NP_066247.1"
                     /db_xref="GI:10313994"
                     /db_xref="GeneID:911829"
                     /translation="MGVTGILQLPRDRFKRTSFFLWVIILFQRTFSIPLGVIHNSTLQ
                     VSDVDKLVCRDKLSSTNQLRSVGLNLEGNGVATDVPSATKRWGFRSGVPPKVVNYEAG
                     EWAENCYNLEIKKPDGSECLPAAPDGIRGFPRCRYVHKVSGTGPCAGDFAFHKEGAFF
                     LYDRLASTVIYRGTTFAEGVVAFLILPQAKKDFFSSHPLREPVNATEDPSSGYYSTTI
                     RYQATGFGTNETEYLFEVDNLTYVQLESRFTPQFLLQLNETIYTSGKRSNTTGKLIWK
                     VNPEIDTTIGEWAFWETKKTSLEKFAVKSCLSQLYQTEPKTSVVRVRRELLPTQGPTQ
                     QLKTTKSWLQKIPLQWFKCTVKEGKLQCRI"
     misc_feature    6081..7130
                     /gene="GP"
                     /locus_tag="ZEBOVgp4"
                     /note="Filovirus glycoprotein; Region: Filo_glycop;
                     pfam01611"
                     /db_xref="CDD:110602"
     CDS             join(6039..6922,6924..6933)
                     /gene="GP"
                     /locus_tag="ZEBOVgp4"
                     /artificial_location="low-quality sequence region"
                     /note="ssGP; second non-structural secreted glycoprotein;
                     secreted in a monomeric form; one A residue is deleted or
                     two additional A residues are inserted at the editing site
                     during transcription of the GP gene"
                     /citation=[4]
                     /codon_start=1
                     /product="second secreted glycoprotein"
                     /protein_id="NP_066248.1"
                     /db_xref="GI:10313996"
                     /db_xref="GeneID:911829"
                     /translation="MGVTGILQLPRDRFKRTSFFLWVIILFQRTFSIPLGVIHNSTLQ
                     VSDVDKLVCRDKLSSTNQLRSVGLNLEGNGVATDVPSATKRWGFRSGVPPKVVNYEAG
                     EWAENCYNLEIKKPDGSECLPAAPDGIRGFPRCRYVHKVSGTGPCAGDFAFHKEGAFF
                     LYDRLASTVIYRGTTFAEGVVAFLILPQAKKDFFSSHPLREPVNATEDPSSGYYSTTI
                     RYQATGFGTNETEYLFEVDNLTYVQLESRFTPQFLLQLNETIYTSGKRSNTTGKLIWK
                     VNPEIDTTIGEWAFWETKKPH"
     misc_feature    join(6081..6922,6924..>6924)
                     /gene="GP"
                     /locus_tag="ZEBOVgp4"
                     /note="Filovirus glycoprotein; Region: Filo_glycop;
                     pfam01611"
                     /db_xref="CDD:110602"
     misc_signal     6918..6924
                     /gene="GP"
                     /locus_tag="ZEBOVgp4"
                     /note="additional A residues are inserted or deleted
                     during transcription of the GP gene by the viral
                     polymerase"
                     /citation=[4]
                     /function="RNA editing"
     gene            8288..9740
                     /gene="VP30"
                     /locus_tag="ZEBOVgp5"
                     /db_xref="GeneID:911826"
     mRNA            8288..9740
                     /gene="VP30"
                     /locus_tag="ZEBOVgp5"
                     /product="VP30"
                     /db_xref="GeneID:911826"
     misc_signal     8288..8299
                     /gene="VP30"
                     /locus_tag="ZEBOVgp5"
                     /note="putative; transcription start signal"
     polyA_signal    8295..8305
                     /gene="GP"
                     /locus_tag="ZEBOVgp4"
                     /citation=[4]
     CDS             8509..9375
                     /gene="VP30"
                     /locus_tag="ZEBOVgp5"
                     /note="polymerase complex protein"
                     /codon_start=1
                     /product="minor nucleoprotein"
                     /protein_id="NP_066249.1"
                     /db_xref="GI:10313997"
                     /db_xref="GeneID:911826"
                     /translation="MEASYERGRPRAARQHSRDGHDHHVRARSSSRENYRGEYRQSRS
                     ASQVRVPTVFHKKRVEPLTVPPAPKDICPTLKKGFLCDSSFCKKDHQLESLTDRELLL
                     LIARKTCGSVEQQLNITAPKDSRLANPTADDFQQEEGPKITLLTLIKTAEHWARQDIR
                     TIEDSKLRALLTLCAVMTRKFSKSQLSLLCETHLRREGLGQDQAEPVLEVYQRLHSDK
                     GGSFEAALWQQWDRQSLIMFITAFLNIALQLPCESSAVVVSGLRTLVPQSDNEEASTN
                     PGTCSWSDEGTP"
     misc_feature    8932..9321
                     /gene="VP30"
                     /locus_tag="ZEBOVgp5"
                     /note="Ebola virus-specific transcription factor VP30;
                     Region: Transcript_VP30; pfam11507"
                     /db_xref="CDD:151944"
     polyA_signal    9730..9740
                     /gene="VP30"
                     /locus_tag="ZEBOVgp5"
                     /note="putative"
     misc_feature    9741..9884
                     /note="intergenic region"
     gene            9885..11518
                     /gene="VP24"
                     /locus_tag="ZEBOVgp6"
                     /note="putative"
                     /db_xref="GeneID:911828"
     mRNA            9885..11496
                     /gene="VP24"
                     /locus_tag="ZEBOVgp6"
                     /product="VP24"
                     /db_xref="GeneID:911828"
     misc_signal     9885..9896
                     /gene="VP24"
                     /locus_tag="ZEBOVgp6"
                     /note="transcription start signal"
     CDS             10345..11100
                     /gene="VP24"
                     /locus_tag="ZEBOVgp6"
                     /codon_start=1
                     /product="membrane-associated protein"
                     /protein_id="NP_066250.1"
                     /db_xref="GI:10313998"
                     /db_xref="GeneID:911828"
                     /translation="MAKATGRYNLISPKKDLEKGVVLSDLCNFLVSQTIQGWKVYWAG
                     IEFDVTHKGMALLHRLKTNDFAPAWSMTRNLFPHLFQNPNSTIESPLWALRVILAAGI
                     QDQLIDQSLIEPLAGALGLISDWLLTTNTNHFNMRTQRVKEQLSLKMLSLIRSNILKF
                     INKLDALHVVNYNGLLSSIEIGTQNHTIIITRTNMGFLVELQEPDKSAMNRMKPGPAK
                     FSLLHESTLKAFTQGSSTRMQSLILEFNSSLAI"
     misc_feature    10369..11040
                     /gene="VP24"
                     /locus_tag="ZEBOVgp6"
                     /note="Filovirus membrane-associated protein VP24; Region:
                     Filo_VP24; pfam06389"
                     /db_xref="CDD:253701"
     polyA_signal    11485..11496
                     /gene="VP24"
                     /locus_tag="ZEBOVgp6"
                     /note="putative"
     misc_feature    11497..11500
                     /gene="VP24"
                     /locus_tag="ZEBOVgp6"
                     /note="intergenic region"
     gene            11501..18282
                     /gene="L"
                     /locus_tag="ZEBOVgp7"
                     /db_xref="GeneID:911824"
     mRNA            11501..18282
                     /gene="L"
                     /locus_tag="ZEBOVgp7"
                     /product="polymerase"
                     /citation=[1]
                     /db_xref="GeneID:911824"
     misc_signal     11501..11512
                     /gene="L"
                     /locus_tag="ZEBOVgp7"
                     /note="transcription start signal"
                     /citation=[1]
     polyA_signal    11508..11518
                     /gene="VP24"
                     /locus_tag="ZEBOVgp6"
     CDS             11581..18219
                     /gene="L"
                     /locus_tag="ZEBOVgp7"
                     /function="synthesis of viral RNAs; transcriptional RNA
                     editing"
                     /note="polymerase"
                     /citation=[1]
                     /codon_start=1
                     /product="RNA-dependent RNA polymerase"
                     /protein_id="NP_066251.1"
                     /db_xref="GI:10313999"
                     /db_xref="GeneID:911824"
                     /translation="MATQHTQYPDARLSSPIVLDQCDLVTRACGLYSSYSLNPQLRNC
                     KLPKHIYRLKYDVTVTKFLSDVPVATLPIDFIVPVLLKALSGNGFCPVEPRCQQFLDE
                     IIKYTMQDALFLKYYLKNVGAQEDCVDEHFQEKILSSIQGNEFLHQMFFWYDLAILTR
                     RGRLNRGNSRSTWFVHDDLIDILGYGDYVFWKIPISMLPLNTQGIPHAAMDWYQASVF
                     KEAVQGHTHIVSVSTADVLIMCKDLITCRFNTTLISKIAEIEDPVCSDYPNFKIVSML
                     YQSGDYLLSILGSDGYKIIKFLEPLCLAKIQLCSKYTERKGRFLTQMHLAVNHTLEEI
                     TEMRALKPSQAQKIREFHRTLIRLEMTPQQLCELFSIQKHWGHPVLHSETAIQKVKKH
                     ATVLKALRPIVIFETYCVFKYSIAKHYFDSQGSWYSVTSDRNLTPGLNSYIKRNQFPP
                     LPMIKELLWEFYHLDHPPLFSTKIISDLSIFIKDRATAVERTCWDAVFEPNVLGYNPP
                     HKFSTKRVPEQFLEQENFSIENVLSYAQKLEYLLPQYRNFSFSLKEKELNVGRTFGKL
                     PYPTRNVQTLCEALLADGLAKAFPSNMMVVTEREQKESLLHQASWHHTSDDFGEHATV
                     RGSSFVTDLEKYNLAFRYEFTAPFIEYCNRCYGVKNVFNWMHYTIPQCYMHVSDYYNP
                     PHNLTLENRDNPPEGPSSYRGHMGGIEGLQQKLWTSISCAQISLVEIKTGFKLRSAVM
                     GDNQCITVLSVFPLETDADEQEQSAEDNAARVAASLAKVTSACGIFLKPDETFVHSGF
                     IYFGKKQYLNGVQLPQSLKTATRMAPLSDAIFDDLQGTLASIGTAFERSISETRHIFP
                     CRITAAFHTFFSVRILQYHHLGFNKGFDLGQLTLGKPLDFGTISLALAVPQVLGGLSF
                     LNPEKCFYRNLGDPVTSGLFQLKTYLRMIEMDDLFLPLIAKNPGNCTAIDFVLNPSGL
                     NVPGSQDLTSFLRQIVRRTITLSAKNKLINTLFHASADFEDEMVCKWLLSSTPVMSRF
                     AADIFSRTPSGKRLQILGYLEGTRTLLASKIINNNTETPVLDRLRKITLQRWSLWFSY
                     LDHCDNILAEALTQITCTVDLAQILREYSWAHILEGRPLIGATLPCMIEQFKVFWLKP
                     YEQCPQCSNAKQPGGKPFVSVAVKKHIVSAWPNASRISWTIGDGIPYIGSRTEDKIGQ
                     PAIKPKCPSAALREAIELASRLTWVTQGSSNSDLLIKPFLEARVNLSVQEILQMTPSH
                     YSGNIVHRYNDQYSPHSFMANRMSNSATRLIVSTNTLGEFSGGGQSARDSNIIFQNVI
                     NYAVALFDIKFRNTEATDIQYNRAHLHLTKCCTREVPAQYLTYTSTLDLDLTRYRENE
                     LIYDSNPLKGGLNCNISFDNPFFQGKRLNIIEDDLIRLPHLSGWELAKTIMQSIISDS
                     NNSSTDPISSGETRSFTTHFLTYPKIGLLYSFGAFVSYYLGNTILRTKKLTLDNFLYY
                     LTTQIHNLPHRSLRILKPTFKHASVMSRLMSIDPHFSIYIGGAAGDRGLSDAARLFLR
                     TSISSFLTFVKEWIINRGTIVPLWIVYPLEGQNPTPVNNFLYQIVELLVHDSSRQQAF
                     KTTISDHVHPHDNLVYTCKSTASNFFHASLAYWRSRHRNSNRKYLARDSSTGSSTNNS
                     DGHIERSQEQTTRDPHDGTERNLVLQMSHEIKRTTIPQENTHQGPSFQSFLSDSACGT
                     ANPKLNFDRSRHNVKFQDHNSASKREGHQIISHRLVLPFFTLSQGTRQLTSSNESQTQ
                     DEISKYLRQLRSVIDTTVYCRFTGIVSSMHYKLDEVLWEIESFKSAVTLAEGEGAGAL
                     LLIQKYQVKTLFFNTLATESSIESEIVSGMTTPRMLLPVMSKFHNDQIEIILNNSASQ
                     ITDITNPTWFKDQRARLPKQVEVITMDAETTENINRSKLYEAVYKLILHHIDPSVLKA
                     VVLKVFLSDTEGMLWLNDNLAPFFATGYLIKPITSSARSSEWYLCLTNFLSTTRKMPH
                     QNHLSCKQVILTALQLQIQRSPYWLSHLTQYADCELHLSYIRLGFPSLEKVLYHRYNL
                     VDSKRGPLVSITQHLAHLRAEIRELTNDYNQQRQSRTQTYHFIRTAKGRITKLVNDYL
                     KFFLIVQALKHNGTWQAEFKKLPELISVCNRFYHIRDCNCEERFLVQTLYLHRMQDSE
                     VKLIERLTGLLSLFPDGLYRFD"
     misc_feature    11608..14853
                     /gene="L"
                     /locus_tag="ZEBOVgp7"
                     /note="Mononegavirales RNA dependent RNA polymerase;
                     Region: Mononeg_RNA_pol; pfam00946"
                     /db_xref="CDD:250248"
     misc_feature    15223..18192
                     /gene="L"
                     /locus_tag="ZEBOVgp7"
                     /note="mRNA capping enzyme, paramyxovirus family; Region:
                     paramyx_RNAcap; TIGR04198"
                     /db_xref="CDD:234496"
     polyA_signal    18272..18282
                     /gene="L"
                     /locus_tag="ZEBOVgp7"
                     /citation=[1]
     3'UTR           18283..18959
                     /note="putative trailer region"
                     /citation=[1]
                     /function="regulation or initiation of RNA replication"
ORIGIN      
        1 cggacacaca aaaagaaaga agaattttta ggatcttttg tgtgcgaata actatgagga
       61 agattaataa ttttcctctc attgaaattt atatcggaat ttaaattgaa attgttactg
      121 taatcacacc tggtttgttt cagagccaca tcacaaagat agagaacaac ctaggtctcc
      181 gaagggagca agggcatcag tgtgctcagt tgaaaatccc ttgtcaacac ctaggtctta
      241 tcacatcaca agttccacct cagactctgc agggtgatcc aacaacctta atagaaacat
      301 tattgttaaa ggacagcatt agttcacagt caaacaagca agattgagaa ttaaccttgg
      361 ttttgaactt gaacacttag gggattgaag attcaacaac cctaaagctt ggggtaaaac
      421 attggaaata gttaaaagac aaattgctcg gaatcacaaa attccgagta tggattctcg
      481 tcctcagaaa atctggatgg cgccgagtct cactgaatct gacatggatt accacaagat
      541 cttgacagca ggtctgtccg ttcaacaggg gattgttcgg caaagagtca tcccagtgta
      601 tcaagtaaac aatcttgaag aaatttgcca acttatcata caggcctttg aagcaggtgt
      661 tgattttcaa gagagtgcgg acagtttcct tctcatgctt tgtcttcatc atgcgtacca
      721 gggagattac aaacttttct tggaaagtgg cgcagtcaag tatttggaag ggcacgggtt
      781 ccgttttgaa gtcaagaagc gtgatggagt gaagcgcctt gaggaattgc tgccagcagt
      841 atctagtgga aaaaacatta agagaacact tgctgccatg ccggaagagg agacaactga
      901 agctaatgcc ggtcagtttc tctcctttgc aagtctattc cttccgaaat tggtagtagg
      961 agaaaaggct tgccttgaga aggttcaaag gcaaattcaa gtacatgcag agcaaggact
     1021 gatacaatat ccaacagctt ggcaatcagt aggacacatg atggtgattt tccgtttgat
     1081 gcgaacaaat tttctgatca aatttctcct aatacaccaa gggatgcaca tggttgccgg
     1141 gcatgatgcc aacgatgctg tgatttcaaa ttcagtggct caagctcgtt tttcaggctt
     1201 attgattgtc aaaacagtac ttgatcatat cctacaaaag acagaacgag gagttcgtct
     1261 ccatcctctt gcaaggaccg ccaaggtaaa aaatgaggtg aactccttta aggctgcact
     1321 cagctccctg gccaagcatg gagagtatgc tcctttcgcc cgacttttga acctttctgg
     1381 agtaaataat cttgagcatg gtcttttccc tcaactatcg gcaattgcac tcggagtcgc
     1441 cacagcacac gggagtaccc tcgcaggagt aaatgttgga gaacagtatc aacaactcag
     1501 agaggctgcc actgaggctg agaagcaact ccaacaatat gcagagtctc gcgaacttga
     1561 ccatcttgga cttgatgatc aggaaaagaa aattcttatg aacttccatc agaaaaagaa
     1621 cgaaatcagc ttccagcaaa caaacgctat ggtaactcta agaaaagagc gcctggccaa
     1681 gctgacagaa gctatcactg ctgcgtcact gcccaaaaca agtggacatt acgatgatga
     1741 tgacgacatt ccctttccag gacccatcaa tgatgacgac aatcctggcc atcaagatga
     1801 tgatccgact gactcacagg atacgaccat tcccgatgtg gtggttgatc ccgatgatgg
     1861 aagctacggc gaataccaga gttactcgga aaacggcatg aatgcaccag atgacttggt
     1921 cctattcgat ctagacgagg acgacgagga cactaagcca gtgcctaata gatcgaccaa
     1981 gggtggacaa cagaagaaca gtcaaaaggg ccagcatata gagggcagac agacacaatc
     2041 caggccaatt caaaatgtcc caggccctca cagaacaatc caccacgcca gtgcgccact
     2101 cacggacaat gacagaagaa atgaaccctc cggctcaacc agccctcgca tgctgacacc
     2161 aattaacgaa gaggcagacc cactggacga tgccgacgac gagacgtcta gccttccgcc
     2221 cttggagtca gatgatgaag agcaggacag ggacggaact tccaaccgca cacccactgt
     2281 cgccccaccg gctcccgtat acagagatca ctctgaaaag aaagaactcc cgcaagacga
     2341 gcaacaagat caggaccaca ctcaagaggc caggaaccag gacagtgaca acacccagtc
     2401 agaacactct tttgaggaga tgtatcgcca cattctaaga tcacaggggc catttgatgc
     2461 tgttttgtat tatcatatga tgaaggatga gcctgtagtt ttcagtacca gtgatggcaa
     2521 agagtacacg tatccagact cccttgaaga ggaatatcca ccatggctca ctgaaaaaga
     2581 ggctatgaat gaagagaata gatttgttac attggatggt caacaatttt attggccggt
     2641 gatgaatcac aagaataaat tcatggcaat cctgcaacat catcagtgaa tgagcatgga
     2701 acaatgggat gattcaaccg acaaatagct aacattaagt agtcaaggaa cgaaaacagg
     2761 aagaattttt gatgtctaag gtgtgaatta ttatcacaat aaaagtgatt cttatttttg
     2821 aatttaaagc tagcttatta ttactagccg tttttcaaag ttcaatttga gtcttaatgc
     2881 aaataggcgt taagccacag ttatagccat aattgtaact caatattcta actagcgatt
     2941 tatctaaatt aaattacatt atgcttttat aacttaccta ctagcctgcc caacatttac
     3001 acgatcgttt tataattaag aaaaaactaa tgatgaagat taaaaccttc atcatcctta
     3061 cgtcaattga attctctagc actcgaagct tattgtcttc aatgtaaaag aaaagctggt
     3121 ctaacaagat gacaactaga acaaagggca ggggccatac tgcggccacg actcaaaacg
     3181 acagaatgcc aggccctgag ctttcgggct ggatctctga gcagctaatg accggaagaa
     3241 ttcctgtaag cgacatcttc tgtgatattg agaacaatcc aggattatgc tacgcatccc
     3301 aaatgcaaca aacgaagcca aacccgaaga cgcgcaacag tcaaacccaa acggacccaa
     3361 tttgcaatca tagttttgag gaggtagtac aaacattggc ttcattggct actgttgtgc
     3421 aacaacaaac catcgcatca gaatcattag aacaacgcat tacgagtctt gagaatggtc
     3481 taaagccagt ttatgatatg gcaaaaacaa tctcctcatt gaacagggtt tgtgctgaga
     3541 tggttgcaaa atatgatctt ctggtgatga caaccggtcg ggcaacagca accgctgcgg
     3601 caactgaggc ttattgggcc gaacatggtc aaccaccacc tggaccatca ctttatgaag
     3661 aaagtgcgat tcggggtaag attgaatcta gagatgagac cgtccctcaa agtgttaggg
     3721 aggcattcaa caatctaaac agtaccactt cactaactga ggaaaatttt gggaaacctg
     3781 acatttcggc aaaggatttg agaaacatta tgtatgatca cttgcctggt tttggaactg
     3841 ctttccacca attagtacaa gtgatttgta aattgggaaa agatagcaac tcattggaca
     3901 tcattcatgc tgagttccag gccagcctgg ctgaaggaga ctctcctcaa tgtgccctaa
     3961 ttcaaattac aaaaagagtt ccaatcttcc aagatgctgc tccacctgtc atccacatcc
     4021 gctctcgagg tgacattccc cgagcttgcc agaaaagctt gcgtccagtc ccaccatcgc
     4081 ccaagattga tcgaggttgg gtatgtgttt ttcagcttca agatggtaaa acacttggac
     4141 tcaaaatttg agccaatctc ccttccctcc gaaagaggcg aataatagca gaggcttcaa
     4201 ctgctgaact atagggtacg ttacattaat gatacacttg tgagtatcag ccctggataa
     4261 tataagtcaa ttaaacgacc aagataaaat tgttcatatc tcgctagcag cttaaaatat
     4321 aaatgtaata ggagctatat ctctgacagt attataatca attgttatta agtaacccaa
     4381 accaaaagtg atgaagatta agaaaaacct acctcggctg agagagtgtt ttttcattaa
     4441 ccttcatctt gtaaacgttg agcaaaattg ttaaaaatat gaggcgggtt atattgccta
     4501 ctgctcctcc tgaatatatg gaggccatat accctgtcag gtcaaattca acaattgcta
     4561 gaggtggcaa cagcaataca ggcttcctga caccggagtc agtcaatggg gacactccat
     4621 cgaatccact caggccaatt gccgatgaca ccatcgacca tgccagccac acaccaggca
     4681 gtgtgtcatc agcattcatc cttgaagcta tggtgaatgt catatcgggc cccaaagtgc
     4741 taatgaagca aattccaatt tggcttcctc taggtgtcgc tgatcaaaag acctacagct
     4801 ttgactcaac tacggccgcc atcatgcttg cttcatacac tatcacccat ttcggcaagg
     4861 caaccaatcc acttgtcaga gtcaatcggc tgggtcctgg aatcccggat catcccctca
     4921 ggctcctgcg aattggaaac caggctttcc tccaggagtt cgttcttccg ccagtccaac
     4981 taccccagta tttcaccttt gatttgacag cactcaaact gatcacccaa ccactgcctg
     5041 ctgcaacatg gaccgatgac actccaacag gatcaaatgg agcgttgcgt ccaggaattt
     5101 catttcatcc aaaacttcgc cccattcttt tacccaacaa aagtgggaag aaggggaaca
     5161 gtgccgatct aacatctccg gagaaaatcc aagcaataat gacttcactc caggacttta
     5221 agatcgttcc aattgatcca accaaaaata tcatgggaat cgaagtgcca gaaactctgg
     5281 tccacaagct gaccggtaag aaggtgactt ctaaaaatgg acaaccaatc atccctgttc
     5341 ttttgccaaa gtacattggg ttggacccgg tggctccagg agacctcacc atggtaatca
     5401 cacaggattg tgacacgtgt cattctcctg caagtcttcc agctgtgatt gagaagtaat
     5461 tgcaataatt gactcagatc cagttttata gaatcttctc agggatagtg ataacatcta
     5521 tttagtaatc cgtccattag aggagacact tttaattgat caatatacta aaggtgcttt
     5581 acaccattgt cttttttctc tcctaaatgt agaacttaac aaaagactca taatatactt
     5641 gtttttaaag gattgattga tgaaagatca taactaataa cattacaaat aatcctacta
     5701 taatcaatac ggtgattcaa atgttaatct ttctcattgc acatactttt tgcccttatc
     5761 ctcaaattgc ctgcatgctt acatctgagg atagccagtg tgacttggat tggaaatgtg
     5821 gagaaaaaat cgggacccat ttctaggttg ttcacaatcc aagtacagac attgcccttc
     5881 taattaagaa aaaatcggcg atgaagatta agccgacagt gagcgtaatc ttcatctctc
     5941 ttagattatt tgttttccag agtaggggtc gtcaggtcct tttcaatcgt gtaaccaaaa
     6001 taaactccac tagaaggata ttgtggggca acaacacaat gggcgttaca ggaatattgc
     6061 agttacctcg tgatcgattc aagaggacat cattctttct ttgggtaatt atccttttcc
     6121 aaagaacatt ttccatccca cttggagtca tccacaatag cacattacag gttagtgatg
     6181 tcgacaaact agtttgtcgt gacaaactgt catccacaaa tcaattgaga tcagttggac
     6241 tgaatctcga agggaatgga gtggcaactg acgtgccatc tgcaactaaa agatggggct
     6301 tcaggtccgg tgtcccacca aaggtggtca attatgaagc tggtgaatgg gctgaaaact
     6361 gctacaatct tgaaatcaaa aaacctgacg ggagtgagtg tctaccagca gcgccagacg
     6421 ggattcgggg cttcccccgg tgccggtatg tgcacaaagt atcaggaacg ggaccgtgtg
     6481 ccggagactt tgccttccat aaagagggtg ctttcttcct gtatgatcga cttgcttcca
     6541 cagttatcta ccgaggaacg actttcgctg aaggtgtcgt tgcatttctg atactgcccc
     6601 aagctaagaa ggacttcttc agctcacacc ccttgagaga gccggtcaat gcaacggagg
     6661 acccgtctag tggctactat tctaccacaa ttagatatca ggctaccggt tttggaacca
     6721 atgagacaga gtacttgttc gaggttgaca atttgaccta cgtccaactt gaatcaagat
     6781 tcacaccaca gtttctgctc cagctgaatg agacaatata tacaagtggg aaaaggagca
     6841 ataccacggg aaaactaatt tggaaggtca accccgaaat tgatacaaca atcggggagt
     6901 gggccttctg ggaaactaaa aaaacctcac tagaaaaatt cgcagtgaag agttgtcttt
     6961 cacagttgta tcaaacggag ccaaaaacat cagtggtcag agtccggcgc gaacttcttc
     7021 cgacccaggg accaacacaa caactgaaga ccacaaaatc atggcttcag aaaattcctc
     7081 tgcaatggtt caagtgcaca gtcaaggaag ggaagctgca gtgtcgcatc taacaaccct
     7141 tgccacaatc tccacgagtc cccaatccct cacaaccaaa ccaggtccgg acaacagcac
     7201 ccataataca cccgtgtata aacttgacat ctctgaggca actcaagttg aacaacatca
     7261 ccgcagaaca gacaacgaca gcacagcctc cgacactccc tctgccacga ccgcagccgg
     7321 acccccaaaa gcagagaaca ccaacacgag caagagcact gacttcctgg accccgccac
     7381 cacaacaagt ccccaaaacc acagcgagac cgctggcaac aacaacactc atcaccaaga
     7441 taccggagaa gagagtgcca gcagcgggaa gctaggctta attaccaata ctattgctgg
     7501 agtcgcagga ctgatcacag gcgggagaag aactcgaaga gaagcaattg tcaatgctca
     7561 acccaaatgc aaccctaatt tacattactg gactactcag gatgaaggtg ctgcaatcgg
     7621 actggcctgg ataccatatt tcgggccagc agccgaggga atttacatag aggggctaat
     7681 gcacaatcaa gatggtttaa tctgtgggtt gagacagctg gccaacgaga cgactcaagc
     7741 tcttcaactg ttcctgagag ccacaactga gctacgcacc ttttcaatcc tcaaccgtaa
     7801 ggcaattgat ttcttgctgc agcgatgggg cggcacatgc cacattctgg gaccggactg
     7861 ctgtatcgaa ccacatgatt ggaccaagaa cataacagac aaaattgatc agattattca
     7921 tgattttgtt gataaaaccc ttccggacca gggggacaat gacaattggt ggacaggatg
     7981 gagacaatgg ataccggcag gtattggagt tacaggcgtt ataattgcag ttatcgcttt
     8041 attctgtata tgcaaatttg tcttttagtt tttcttcaga ttgcttcatg gaaaagctca
     8101 gcctcaaatc aatgaaacca ggatttaatt atatggatta cttgaatcta agattacttg
     8161 acaaatgata atataataca ctggagcttt aaacatagcc aatgtgattc taactccttt
     8221 aaactcacag ttaatcataa acaaggtttg acatcaatct agttatctct ttgagaatga
     8281 taaacttgat gaagattaag aaaaaggtaa tctttcgatt atctttaatc ttcatccttg
     8341 attctacaat catgacagtt gtctttagtg acaagggaaa gaagcctttt tattaagttg
     8401 taataatcag atctgcgaac cggtagagtt tagttgcaac ctaacacaca taaagcattg
     8461 gtcaaaaagt caatagaaat ttaaacagtg agtggagaca acttttaaat ggaagcttca
     8521 tatgagagag gacgcccacg agctgccaga cagcattcaa gggatggaca cgaccaccat
     8581 gttcgagcac gatcatcatc cagagagaat tatcgaggtg agtaccgtca atcaaggagc
     8641 gcctcacaag tgcgcgttcc tactgtattt cataagaaga gagttgaacc attaacagtt
     8701 cctccagcac ctaaagacat atgtccgacc ttgaaaaaag gatttttgtg tgacagtagt
     8761 ttttgcaaaa aagatcacca gttggagagt ttaactgata gggaattact cctactaatc
     8821 gcccgtaaga cttgtggatc agtagaacaa caattaaata taactgcacc caaggactcg
     8881 cgcttagcaa atccaacggc tgatgatttc cagcaagagg aaggtccaaa aattaccttg
     8941 ttgacactga tcaagacggc agaacactgg gcgagacaag acatcagaac catagaggat
     9001 tcaaaattaa gagcattgtt gactctatgt gctgtgatga cgaggaaatt ctcaaaatcc
     9061 cagctgagtc ttttatgtga gacacaccta aggcgcgagg ggcttgggca agatcaggca
     9121 gaacccgttc tcgaagtata tcaacgatta cacagtgata aaggaggcag ttttgaagct
     9181 gcactatggc aacaatggga ccgacaatcc ctaattatgt ttatcactgc attcttgaat
     9241 attgctctcc agttaccgtg tgaaagttct gctgtcgttg tttcagggtt aagaacattg
     9301 gttcctcaat cagataatga ggaagcttca accaacccgg ggacatgctc atggtctgat
     9361 gagggtaccc cttaataagg ctgactaaaa cactatataa ccttctactt gatcacaata
     9421 ctccgtatac ctatcatcat atatttaatc aagacgatat cctttaaaac ttattcagta
     9481 ctataatcac tctcgtttca aattaataag atgtgcatga ttgccctaat atatgaagag
     9541 gtatgataca accctaacag tgatcaaaga aaatcataat ctcgtatcgc tcgtaatata
     9601 acctgccaag catacctctt gcacaaagtg attcttgtac acaaataatg ttttactcta
     9661 caggaggtag caacgatcca tcccatcaaa aaataagtat ttcatgactt actaatgatc
     9721 tcttaaaata ttaagaaaaa ctgacggaac ataaattctt tatgcttcaa gctgtggagg
     9781 aggtgtttgg tattggctat tgttatatta caatcaataa caagcttgta aaaatattgt
     9841 tcttgtttca agaggtagat tgtgaccgga aatgctaaac taatgatgaa gattaatgcg
     9901 gaggtctgat aagaataaac cttattattc agattaggcc ccaagaggca ttcttcatct
     9961 ccttttagca aagtactatt tcagggtagt ccaattagtg gcacgtcttt tagctgtata
    10021 tcagtcgccc ctgagatacg ccacaaaagt gtctctaagc taaattggtc tgtacacatc
    10081 ccatacattg tattaggggc aataatatct aattgaactt agccgtttaa aatttagtgc
    10141 ataaatctgg gctaacacca ccaggtcaac tccattggct gaaaagaagc ttacctacaa
    10201 cgaacatcac tttgagcgcc ctcacaatta aaaaatagga acgtcgttcc aacaatcgag
    10261 cgcaaggttt caaggttgaa ctgagagtgt ctagacaaca aaatattgat actccagaca
    10321 ccaagcaaga cctgagaaaa aaccatggct aaagctacgg gacgatacaa tctaatatcg
    10381 cccaaaaagg acctggagaa aggggttgtc ttaagcgacc tctgtaactt cttagttagc
    10441 caaactattc aggggtggaa ggtttattgg gctggtattg agtttgatgt gactcacaaa
    10501 ggaatggccc tattgcatag actgaaaact aatgactttg cccctgcatg gtcaatgaca
    10561 aggaatctct ttcctcattt atttcaaaat ccgaattcca caattgaatc accgctgtgg
    10621 gcattgagag tcatccttgc agcagggata caggaccagc tgattgacca gtctttgatt
    10681 gaacccttag caggagccct tggtctgatc tctgattggc tgctaacaac caacactaac
    10741 catttcaaca tgcgaacaca acgtgtcaag gaacaattga gcctaaaaat gctgtcgttg
    10801 attcgatcca atattctcaa gtttattaac aaattggatg ctctacatgt cgtgaactac
    10861 aacggattgt tgagcagtat tgaaattgga actcaaaatc atacaatcat cataactcga
    10921 actaacatgg gttttctggt ggagctccaa gaacccgaca aatcggcaat gaaccgcatg
    10981 aagcctgggc cggcgaaatt ttccctcctt catgagtcca cactgaaagc atttacacaa
    11041 ggatcctcga cacgaatgca aagtttgatt cttgaattta atagctctct tgctatctaa
    11101 ctaaggtaga atacttcata ttgagctaac tcatatatgc tgactcaata gttatcttga
    11161 catctctgct ttcataatca gatatataag cataataaat aaatactcat atttcttgat
    11221 aatttgttta accacagata aatcctcact gtaagccagc ttccaagttg acacccttac
    11281 aaaaaccagg actcagaatc cctcaaacaa gagattccaa gacaacatca tagaattgct
    11341 ttattatatg aataagcatt ttatcaccag aaatcctata tactaaatgg ttaattgtaa
    11401 ctgaacccgc aggtcacatg tgttaggttt cacagattct atatattact aactctatac
    11461 tcgtaattaa cattagataa gtagattaag aaaaaagcct gaggaagatt aagaaaaact
    11521 gcttattggg tctttccgtg ttttagatga agcagttgaa attcttcctc ttgatattaa
    11581 atggctacac aacataccca atacccagac gctaggttat catcaccaat tgtattggac
    11641 caatgtgacc tagtcactag agcttgcggg ttatattcat catactccct taatccgcaa
    11701 ctacgcaact gtaaactccc gaaacatatc taccgtttga aatacgatgt aactgttacc
    11761 aagttcttga gtgatgtacc agtggcgaca ttgcccatag atttcatagt cccagttctt
    11821 ctcaaggcac tgtcaggcaa tggattctgt cctgttgagc cgcggtgcca acagttctta
    11881 gatgaaatca ttaagtacac aatgcaagat gctctcttct tgaaatatta tctcaaaaat
    11941 gtgggtgctc aagaagactg tgttgatgaa cactttcaag agaaaatctt atcttcaatt
    12001 cagggcaatg aatttttaca tcaaatgttt ttctggtatg atctggctat tttaactcga
    12061 aggggtagat taaatcgagg aaactctaga tcaacatggt ttgttcatga tgatttaata
    12121 gacatcttag gctatgggga ctatgttttt tggaagatcc caatttcaat gttaccactg
    12181 aacacacaag gaatccccca tgctgctatg gactggtatc aggcatcagt attcaaagaa
    12241 gcggttcaag ggcatacaca cattgtttct gtttctactg ccgacgtctt gataatgtgc
    12301 aaagatttaa ttacatgtcg attcaacaca actctaatct caaaaatagc agagattgag
    12361 gatccagttt gttctgatta tcccaatttt aagattgtgt ctatgcttta ccagagcgga
    12421 gattacttac tctccatatt agggtctgat gggtataaaa ttattaagtt cctcgaacca
    12481 ttgtgcttgg ccaaaattca attatgctca aagtacactg agaggaaggg ccgattctta
    12541 acacaaatgc atttagctgt aaatcacacc ctagaagaaa ttacagaaat gcgtgcacta
    12601 aagccttcac aggctcaaaa gatccgtgaa ttccatagaa cattgataag gctggagatg
    12661 acgccacaac aactttgtga gctattttcc attcaaaaac actgggggca tcctgtgcta
    12721 catagtgaaa cagcaatcca aaaagttaaa aaacatgcta cggtgctaaa agcattacgc
    12781 cctatagtga ttttcgagac atactgtgtt tttaaatata gtattgccaa acattatttt
    12841 gatagtcaag gatcttggta cagtgttact tcagatagga atctaacacc gggtcttaat
    12901 tcttatatca aaagaaatca attccctccg ttgccaatga ttaaagaact actatgggaa
    12961 ttttaccacc ttgaccaccc tccacttttc tcaaccaaaa ttattagtga cttaagtatt
    13021 tttataaaag acagagctac cgcagtagaa aggacatgct gggatgcagt attcgagcct
    13081 aatgttctag gatataatcc acctcacaaa tttagtacta aacgtgtacc ggaacaattt
    13141 ttagagcaag aaaacttttc tattgagaat gttctttcct acgcacaaaa actcgagtat
    13201 ctactaccac aatatcggaa cttttctttc tcattgaaag agaaagagtt gaatgtaggt
    13261 agaaccttcg gaaaattgcc ttatccgact cgcaatgttc aaacactttg tgaagctctg
    13321 ttagctgatg gtcttgctaa agcatttcct agcaatatga tggtagttac ggaacgtgag
    13381 caaaaagaaa gcttattgca tcaagcatca tggcaccaca caagtgatga ttttggtgaa
    13441 catgccacag ttagagggag tagctttgta actgatttag agaaatacaa tcttgcattt
    13501 agatatgagt ttacagcacc ttttatagaa tattgcaacc gttgctatgg tgttaagaat
    13561 gtttttaatt ggatgcatta tacaatccca cagtgttata tgcatgtcag tgattattat
    13621 aatccaccac ataacctcac actggagaat cgagacaacc cccccgaagg gcctagttca
    13681 tacaggggtc atatgggagg gattgaagga ctgcaacaaa aactctggac aagtatttca
    13741 tgtgctcaaa tttctttagt tgaaattaag actggtttta agttacgctc agctgtgatg
    13801 ggtgacaatc agtgcattac tgttttatca gtcttcccct tagagactga cgcagacgag
    13861 caggaacaga gcgccgaaga caatgcagcg agggtggccg ccagcctagc aaaagttaca
    13921 agtgcctgtg gaatcttttt aaaacctgat gaaacatttg tacattcagg ttttatctat
    13981 tttggaaaaa aacaatattt gaatggggtc caattgcctc agtcccttaa aacggctaca
    14041 agaatggcac cattgtctga tgcaattttt gatgatcttc aagggaccct ggctagtata
    14101 ggcactgctt ttgagcgatc catctctgag acacgacata tctttccttg caggataacc
    14161 gcagctttcc atacgttttt ttcggtgaga atcttgcaat atcatcatct cgggttcaat
    14221 aaaggttttg accttggaca gttaacactc ggcaaacctc tggatttcgg aacaatatca
    14281 ttggcactag cggtaccgca ggtgcttgga gggttatcct tcttgaatcc tgagaaatgt
    14341 ttctaccgga atctaggaga tccagttacc tcaggcttat tccagttaaa aacttatctc
    14401 cgaatgattg agatggatga tttattctta cctttaattg cgaagaaccc tgggaactgc
    14461 actgccattg actttgtgct aaatcctagc ggattaaatg tccctgggtc gcaagactta
    14521 acttcatttc tgcgccagat tgtacgcagg accatcaccc taagtgcgaa aaacaaactt
    14581 attaatacct tatttcatgc gtcagctgac ttcgaagacg aaatggtttg taaatggcta
    14641 ttatcatcaa ctcctgttat gagtcgtttt gcggccgata tcttttcacg cacgccgagc
    14701 gggaagcgat tgcaaattct aggatacctg gaaggaacac gcacattatt agcctctaag
    14761 atcatcaaca ataatacaga gacaccggtt ttggacagac tgaggaaaat aacattgcaa
    14821 aggtggagcc tatggtttag ttatcttgat cattgtgata atatcctggc ggaggcttta
    14881 acccaaataa cttgcacagt tgatttagca cagattctga gggaatattc atgggctcat
    14941 attttagagg gaagacctct tattggagcc acactcccat gtatgattga gcaattcaaa
    15001 gtgttttggc tgaaacccta cgaacaatgt ccgcagtgtt caaatgcaaa gcaaccaggt
    15061 gggaaaccat tcgtgtcagt ggcagtcaag aaacatattg ttagtgcatg gccgaacgca
    15121 tcccgaataa gctggactat cggggatgga atcccataca ttggatcaag gacagaagat
    15181 aagataggac aacctgctat taaaccaaaa tgtccttccg cagccttaag agaggccatt
    15241 gaattggcgt cccgtttaac atgggtaact caaggcagtt cgaacagtga cttgctaata
    15301 aaaccatttt tggaagcacg agtaaattta agtgttcaag aaatacttca aatgacccct
    15361 tcacattact caggaaatat tgttcacagg tacaacgatc aatacagtcc tcattctttc
    15421 atggccaatc gtatgagtaa ttcagcaacg cgattgattg tttctacaaa cactttaggt
    15481 gagttttcag gaggtggcca gtctgcacgc gacagcaata ttattttcca gaatgttata
    15541 aattatgcag ttgcactgtt cgatattaaa tttagaaaca ctgaggctac agatatccaa
    15601 tataatcgtg ctcaccttca tctaactaag tgttgcaccc gggaagtacc agctcagtat
    15661 ttaacataca catctacatt ggatttagat ttaacaagat accgagaaaa cgaattgatt
    15721 tatgacagta atcctctaaa aggaggactc aattgcaata tctcattcga taatccattt
    15781 ttccaaggta aacggctgaa cattatagaa gatgatctta ttcgactgcc tcacttatct
    15841 ggatgggagc tagccaagac catcatgcaa tcaattattt cagatagcaa caattcatct
    15901 acagacccaa ttagcagtgg agaaacaaga tcattcacta cccatttctt aacttatccc
    15961 aagataggac ttctgtacag ttttggggcc tttgtaagtt attatcttgg caatacaatt
    16021 cttcggacta agaaattaac acttgacaat tttttatatt acttaactac tcaaattcat
    16081 aatctaccac atcgctcatt gcgaatactt aagccaacat tcaaacatgc aagcgttatg
    16141 tcacggttaa tgagtattga tcctcatttt tctatttaca taggcggtgc tgcaggtgac
    16201 agaggactct cagatgcggc caggttattt ttgagaacgt ccatttcatc ttttcttaca
    16261 tttgtaaaag aatggataat taatcgcgga acaattgtcc ctttatggat agtatatccg
    16321 ctagagggtc aaaacccaac acctgtgaat aattttctct atcagatcgt agaactgctg
    16381 gtgcatgatt catcaagaca acaggctttt aaaactacca taagtgatca tgtacatcct
    16441 cacgacaatc ttgtttacac atgtaagagt acagccagca atttcttcca tgcatcattg
    16501 gcgtactgga ggagcagaca cagaaacagc aaccgaaaat acttggcaag agactcttca
    16561 actggatcaa gcacaaacaa cagtgatggt catattgaga gaagtcaaga acaaaccacc
    16621 agagatccac atgatggcac tgaacggaat ctagtcctac aaatgagcca tgaaataaaa
    16681 agaacgacaa ttccacaaga aaacacgcac cagggtccgt cgttccagtc ctttctaagt
    16741 gactctgctt gtggtacagc aaatccaaaa ctaaatttcg atcgatcgag acacaatgtg
    16801 aaatttcagg atcataactc ggcatccaag agggaaggtc atcaaataat ctcacaccgt
    16861 ctagtcctac ctttctttac attatctcaa gggacacgcc aattaacgtc atccaatgag
    16921 tcacaaaccc aagacgagat atcaaagtac ttacggcaat tgagatccgt cattgatacc
    16981 acagtttatt gtagatttac cggtatagtc tcgtccatgc attacaaact tgatgaggtc
    17041 ctttgggaaa tagagagttt caagtcggct gtgacgctag cagagggaga aggtgctggt
    17101 gccttactat tgattcagaa ataccaagtt aagaccttat ttttcaacac gctagctact
    17161 gagtccagta tagagtcaga aatagtatca ggaatgacta ctcctaggat gcttctacct
    17221 gttatgtcaa aattccataa tgaccaaatt gagattattc ttaacaactc agcaagccaa
    17281 ataacagaca taacaaatcc tacttggttt aaagaccaaa gagcaaggct acctaagcaa
    17341 gtcgaggtta taaccatgga tgcagagaca acagagaata taaacagatc gaaattgtac
    17401 gaagctgtat ataaattgat cttacaccat attgatccta gcgtattgaa agcagtggtc
    17461 cttaaagtct ttctaagtga tactgagggt atgttatggc taaatgataa tttagccccg
    17521 ttttttgcca ctggttattt aattaagcca ataacgtcaa gtgctagatc tagtgagtgg
    17581 tatctttgtc tgacgaactt cttatcaact acacgtaaga tgccacacca aaaccatctc
    17641 agttgtaaac aggtaatact tacggcattg caactgcaaa ttcaacgaag cccatactgg
    17701 ctaagtcatt taactcagta tgctgactgt gagttacatt taagttatat ccgccttggt
    17761 tttccatcat tagagaaagt actataccac aggtataacc tcgtcgattc aaaaagaggt
    17821 ccactagtct ctatcactca gcacttagca catcttagag cagagattcg agaattaact
    17881 aatgattata atcaacagcg acaaagtcgg actcaaacat atcactttat tcgtactgca
    17941 aaaggacgaa tcacaaaact agtcaatgat tatttaaaat tctttcttat tgtgcaagca
    18001 ttaaaacata atgggacatg gcaagctgag tttaagaaat taccagagtt gattagtgtg
    18061 tgcaataggt tctaccatat tagagattgc aattgtgaag aacgtttctt agttcaaacc
    18121 ttatatttac atagaatgca ggattctgaa gttaagctta tcgaaaggct gacagggctt
    18181 ctgagtttat ttccggatgg tctctacagg tttgattgaa ttaccgtgca tagtatcctg
    18241 atacttgcaa aggttggtta ttaacataca gattataaaa aactcataaa ttgctctcat
    18301 acatcatatt gatctaatct caataaacaa ctatttaaat aacgaaagga gtccctatat
    18361 tatatactat atttagcctc tctccctgcg tgataatcaa aaaattcaca atgcagcatg
    18421 tgtgacatat tactgccgca atgaatttaa cgcaacataa taaactctgc actctttata
    18481 attaagcttt aacgaaaggt ctgggctcat attgttattg atataataat gttgtatcaa
    18541 tatcctgtca gatggaatag tgttttggtt gataacacaa cttcttaaaa caaaattgat
    18601 ctttaagatt aagtttttta taattatcat tactttaatt tgtcgtttta aaaacggtga
    18661 tagccttaat ctttgtgtaa aataagagat taggtgtaat aaccttaaca tttttgtcta
    18721 gtaagctact atttcataca gaatgataaa attaaaagaa aaggcaggac tgtaaaatca
    18781 gaaatacctt ctttacaata tagcagacta gataataatc ttcgtgttaa tgataattaa
    18841 gacattgacc acgctcatca gaaggctcgc cagaataaac gttgcaaaaa ggattcctgg
    18901 aaaaatggtc gcacacaaaa atttaaaaat aaatctattt cttctttttt gtgtgtcca

 

//Ebola Genome

Zaire ebolavirus isolate Ebola virus H.sapiens-tc/COD/1976/Yambuku-Mayinga, complete genome. NCBI Reference Sequence: NC_002549.1. 

http://www.ncbi.nlm.nih.gov/nuccore/10313991?report=graph

Biosynthesis

Biosynthesis

Biosynthesis Inc.

LOCUS       NC_002549              18959 bp    cRNA    linear   VRL 27-AUG-2014
DEFINITION  Zaire ebolavirus isolate Ebola virus
            H.sapiens-tc/COD/1976/Yambuku-Mayinga, complete genome.
ACCESSION   NC_002549
VERSION     NC_002549.1  GI:10313991
DBLINK      BioProject: PRJNA14703
KEYWORDS    RefSeq.
SOURCE      Zaire ebolavirus (ZEBOV)
  ORGANISM  Zaire ebolavirus
            Viruses; ssRNA negative-strand viruses; Mononegavirales;
            Filoviridae; Ebolavirus.
REFERENCE   1  (bases 1 to 18959)
  AUTHORS   Volchkov,V.E., Volchkova,V.A., Chepurnov,A.A., Blinov,V.M.,
            Dolnik,O., Netesov,S.V. and Feldmann,H.
  TITLE     Characterization of the L gene and 5' trailer region of Ebola virus
  JOURNAL   J. Gen. Virol. 80 (Pt 2), 355-362 (1999)
   PUBMED   10073695
REFERENCE   2  (bases 1 to 18959)
  AUTHORS   Volchkov,V.E., Volchkova,V.A., Slenczka,W., Klenk,H.D. and
            Feldmann,H.
  TITLE     Release of viral glycoproteins during Ebola virus infection
  JOURNAL   Virology 245 (1), 110-119 (1998)
   PUBMED   9614872
REFERENCE   3  (bases 1 to 18959)
  AUTHORS   Volchkov,V.E., Feldmann,H., Volchkova,V.A. and Klenk,H.D.
  TITLE     Processing of the Ebola virus glycoprotein by the proprotein
            convertase furin
  JOURNAL   Proc. Natl. Acad. Sci. U.S.A. 95 (10), 5762-5767 (1998)
   PUBMED   9576958
REFERENCE   4  (bases 1 to 18959)
  AUTHORS   Volchkov,V.E., Becker,S., Volchkova,V.A., Ternovoj,V.A.,
            Kotov,A.N., Netesov,S.V. and Klenk,H.D.
  TITLE     GP mRNA of Ebola virus is edited by the Ebola virus polymerase and
            by T7 and vaccinia virus polymerases
  JOURNAL   Virology 214 (2), 421-430 (1995)
   PUBMED   8553543
REFERENCE   5  (bases 1 to 18959)
  AUTHORS   Bukreyev,A.A., Volchkov,V.E., Blinov,V.M. and Netesov,S.V.
  TITLE     The VP35 and VP40 proteins of filoviruses. Homology between Marburg
            and Ebola viruses
  JOURNAL   FEBS Lett. 322 (1), 41-46 (1993)
   PUBMED   8482365
REFERENCE   6  (bases 1 to 18959)
  CONSRTM   NCBI Genome Project
  TITLE     Direct Submission
  JOURNAL   Submitted (27-SEP-2000) National Center for Biotechnology
            Information, NIH, Bethesda, MD 20894, USA
REFERENCE   7  (bases 1 to 18959)
  AUTHORS   Volchkov,V.E.
  TITLE     Direct Submission
  JOURNAL   Submitted (02-JUN-2000) Institute of Virology, Philipps-University
            Marburg, Robert-Koch-Str. 17, Marburg 35037, Germany
  REMARK    Sequence update by submitter
REFERENCE   8  (bases 1 to 18959)
  AUTHORS   Volchkov,V.E.
  TITLE     Direct Submission
  JOURNAL   Submitted (20-AUG-1998) Institute of Virology, Philipps-University
            Marburg, Robert-Koch-Str. 17, Marburg 35037, Germany
COMMENT     PROVISIONAL REFSEQ: This record has not yet been subject to final
            NCBI review. The reference sequence is identical to AF086833.
            COMPLETENESS: full length.
FEATURES             Location/Qualifiers
     source          1..18959
                     /organism="Zaire ebolavirus"
                     /mol_type="viral cRNA"
                     /isolate="Ebola virus
                     H.sapiens-tc/COD/1976/Yambuku-Mayinga"
                     /db_xref="taxon:186538"
     5'UTR           1..55
                     /note="putative leader region"
                     /citation=[1]
                     /function="regulation or initiation of RNA replication"
     gene            56..3026
                     /gene="NP"
                     /locus_tag="ZEBOVgp1"
                     /db_xref="GeneID:911830"
     mRNA            56..3026
                     /gene="NP"
                     /locus_tag="ZEBOVgp1"
                     /product="nucleoprotein"
                     /db_xref="GeneID:911830"
     misc_signal     56..67
                     /gene="NP"
                     /locus_tag="ZEBOVgp1"
                     /note="putative; transcription start signal"
                     /citation=[1]
     CDS             470..2689
                     /gene="NP"
                     /locus_tag="ZEBOVgp1"
                     /function="encapsidation of genomic RNA"
                     /codon_start=1
                     /product="nucleoprotein"
                     /protein_id="NP_066243.1"
                     /db_xref="GI:10314000"
                     /db_xref="GeneID:911830"
                     /translation="MDSRPQKIWMAPSLTESDMDYHKILTAGLSVQQGIVRQRVIPVY
                     QVNNLEEICQLIIQAFEAGVDFQESADSFLLMLCLHHAYQGDYKLFLESGAVKYLEGH
                     GFRFEVKKRDGVKRLEELLPAVSSGKNIKRTLAAMPEEETTEANAGQFLSFASLFLPK
                     LVVGEKACLEKVQRQIQVHAEQGLIQYPTAWQSVGHMMVIFRLMRTNFLIKFLLIHQG
                     MHMVAGHDANDAVISNSVAQARFSGLLIVKTVLDHILQKTERGVRLHPLARTAKVKNE
                     VNSFKAALSSLAKHGEYAPFARLLNLSGVNNLEHGLFPQLSAIALGVATAHGSTLAGV
                     NVGEQYQQLREAATEAEKQLQQYAESRELDHLGLDDQEKKILMNFHQKKNEISFQQTN
                     AMVTLRKERLAKLTEAITAASLPKTSGHYDDDDDIPFPGPINDDDNPGHQDDDPTDSQ
                     DTTIPDVVVDPDDGSYGEYQSYSENGMNAPDDLVLFDLDEDDEDTKPVPNRSTKGGQQ
                     KNSQKGQHIEGRQTQSRPIQNVPGPHRTIHHASAPLTDNDRRNEPSGSTSPRMLTPIN
                     EEADPLDDADDETSSLPPLESDDEEQDRDGTSNRTPTVAPPAPVYRDHSEKKELPQDE
                     QQDQDHTQEARNQDSDNTQSEHSFEEMYRHILRSQGPFDAVLYYHMMKDEPVVFSTSD
                     GKEYTYPDSLEEEYPPWLTEKEAMNEENRFVTLDGQQFYWPVMNHKNKFMAILQHHQ"
     misc_feature    524..2671
                     /gene="NP"
                     /locus_tag="ZEBOVgp1"
                     /note="Ebola nucleoprotein; Region: Ebola_NP; pfam05505"
                     /db_xref="CDD:147601"
     polyA_signal    3015..3026
                     /gene="NP"
                     /locus_tag="ZEBOVgp1"
     misc_feature    3027..3031
                     /note="intergenic region"
     gene            3032..4407
                     /gene="VP35"
                     /locus_tag="ZEBOVgp2"
                     /db_xref="GeneID:911827"
     mRNA            3032..4407
                     /gene="VP35"
                     /locus_tag="ZEBOVgp2"
                     /product="VP35"
                     /citation=[5]
                     /db_xref="GeneID:911827"
     misc_signal     3032..3043
                     /gene="VP35"
                     /locus_tag="ZEBOVgp2"
                     /note="putative; transcription start signal"
                     /citation=[5]
     CDS             3129..4151
                     /gene="VP35"
                     /locus_tag="ZEBOVgp2"
                     /function="polymerase complex protein"
                     /citation=[5]
                     /codon_start=1
                     /product="polymerase complex protein"
                     /protein_id="NP_066244.1"
                     /db_xref="GI:10313992"
                     /db_xref="GeneID:911827"
                     /translation="MTTRTKGRGHTAATTQNDRMPGPELSGWISEQLMTGRIPVSDIF
                     CDIENNPGLCYASQMQQTKPNPKTRNSQTQTDPICNHSFEEVVQTLASLATVVQQQTI
                     ASESLEQRITSLENGLKPVYDMAKTISSLNRVCAEMVAKYDLLVMTTGRATATAAATE
                     AYWAEHGQPPPGPSLYEESAIRGKIESRDETVPQSVREAFNNLNSTTSLTEENFGKPD
                     ISAKDLRNIMYDHLPGFGTAFHQLVQVICKLGKDSNSLDIIHAEFQASLAEGDSPQCA
                     LIQITKRVPIFQDAAPPVIHIRSRGDIPRACQKSLRPVPPSPKIDRGWVCVFQLQDGK
                     TLGLKI"
     misc_feature    3186..4148
                     /gene="VP35"
                     /locus_tag="ZEBOVgp2"
                     /note="Filoviridae VP35; Region: Filo_VP35; pfam02097"
                     /db_xref="CDD:145320"
     gene            4390..5894
                     /gene="VP40"
                     /locus_tag="ZEBOVgp3"
                     /db_xref="GeneID:911825"
     mRNA            4390..5894
                     /gene="VP40"
                     /locus_tag="ZEBOVgp3"
                     /product="VP40"
                     /citation=[5]
                     /db_xref="GeneID:911825"
     misc_signal     4390..4401
                     /gene="VP40"
                     /locus_tag="ZEBOVgp3"
                     /note="transcription start signal"
                     /citation=[5]
     polyA_signal    4397..4407
                     /gene="VP35"
                     /locus_tag="ZEBOVgp2"
                     /citation=[5]
     CDS             4479..5459
                     /gene="VP40"
                     /locus_tag="ZEBOVgp3"
                     /citation=[5]
                     /codon_start=1
                     /product="matrix protein"
                     /protein_id="NP_066245.1"
                     /db_xref="GI:10313993"
                     /db_xref="GeneID:911825"
                     /translation="MRRVILPTAPPEYMEAIYPVRSNSTIARGGNSNTGFLTPESVNG
                     DTPSNPLRPIADDTIDHASHTPGSVSSAFILEAMVNVISGPKVLMKQIPIWLPLGVAD
                     QKTYSFDSTTAAIMLASYTITHFGKATNPLVRVNRLGPGIPDHPLRLLRIGNQAFLQE
                     FVLPPVQLPQYFTFDLTALKLITQPLPAATWTDDTPTGSNGALRPGISFHPKLRPILL
                     PNKSGKKGNSADLTSPEKIQAIMTSLQDFKIVPIDPTKNIMGIEVPETLVHKLTGKKV
                     TSKNGQPIIPVLLPKYIGLDPVAPGDLTMVITQDCDTCHSPASLPAVIEK"
     misc_feature    4479..5363
                     /gene="VP40"
                     /locus_tag="ZEBOVgp3"
                     /note="Matrix protein VP40; Region: VP40; pfam07447"
                     /db_xref="CDD:116068"
     polyA_signal    5883..5894
                     /gene="VP40"
                     /locus_tag="ZEBOVgp3"
                     /citation=[5]
     misc_feature    5895..5899
                     /note="intergenic region"
     gene            5900..8305
                     /gene="GP"
                     /locus_tag="ZEBOVgp4"
                     /db_xref="GeneID:911829"
     mRNA            5900..8305
                     /gene="GP"
                     /locus_tag="ZEBOVgp4"
                     /product="sGP"
                     /note="unedited mRNA"
                     /citation=[4]
                     /db_xref="GeneID:911829"
     misc_signal     5900..5911
                     /gene="GP"
                     /locus_tag="ZEBOVgp4"
                     /note="putative; transcription start signal"
                     /citation=[4]
     CDS             join(6039..6923,6923..8068)
                     /gene="GP"
                     /locus_tag="ZEBOVgp4"
                     /function="receptor binding and fusion"
                     /artificial_location="low-quality sequence region"
                     /note="virion spike glycoprotein precursor; an addition A
                     residue is inserted during transcription; encodes two
                     disulfide linked subunits GP1 and GP2"
                     /citation=[2]
                     /citation=[3]
                     /citation=[4]
                     /codon_start=1
                     /product="spike glycoprotein"
                     /protein_id="NP_066246.1"
                     /db_xref="GI:10313995"
                     /db_xref="GeneID:911829"
                     /translation="MGVTGILQLPRDRFKRTSFFLWVIILFQRTFSIPLGVIHNSTLQ
                     VSDVDKLVCRDKLSSTNQLRSVGLNLEGNGVATDVPSATKRWGFRSGVPPKVVNYEAG
                     EWAENCYNLEIKKPDGSECLPAAPDGIRGFPRCRYVHKVSGTGPCAGDFAFHKEGAFF
                     LYDRLASTVIYRGTTFAEGVVAFLILPQAKKDFFSSHPLREPVNATEDPSSGYYSTTI
                     RYQATGFGTNETEYLFEVDNLTYVQLESRFTPQFLLQLNETIYTSGKRSNTTGKLIWK
                     VNPEIDTTIGEWAFWETKKNLTRKIRSEELSFTVVSNGAKNISGQSPARTSSDPGTNT
                     TTEDHKIMASENSSAMVQVHSQGREAAVSHLTTLATISTSPQSLTTKPGPDNSTHNTP
                     VYKLDISEATQVEQHHRRTDNDSTASDTPSATTAAGPPKAENTNTSKSTDFLDPATTT
                     SPQNHSETAGNNNTHHQDTGEESASSGKLGLITNTIAGVAGLITGGRRTRREAIVNAQ
                     PKCNPNLHYWTTQDEGAAIGLAWIPYFGPAAEGIYIEGLMHNQDGLICGLRQLANETT
                     QALQLFLRATTELRTFSILNRKAIDFLLQRWGGTCHILGPDCCIEPHDWTKNITDKID
                     QIIHDFVDKTLPDQGDNDNWWTGWRQWIPAGIGVTGVIIAVIALFCICKFVF"
     misc_feature    7529..7540
                     /gene="GP"
                     /locus_tag="ZEBOVgp4"
                     /note="encodes the glycoprotein cleavage site, precursor
                     GP is cleaved by subtilisin-like cellular protease furin
                     into subunits GP1 and GP2 that are linked by a disulfide
                     bond"
                     /citation=[3]
     misc_feature    7793..7870
                     /gene="GP"
                     /locus_tag="ZEBOVgp4"
                     /note="immunosuppressive motif; other site"
     misc_feature    7988..8053
                     /gene="GP"
                     /locus_tag="ZEBOVgp4"
                     /note="transmembrane anchor; transmembrane region"
     misc_feature    7706..7924
                     /gene="GP"
                     /locus_tag="ZEBOVgp4"
                     /note="heptad repeat 1-heptad repeat 2 region of the
                     transmembrane subunit of Filoviridae viruses, Ebola virus
                     and Marburg virus, and related domains; Region:
                     Ebola-like_HR1-HR2; cd09850"
                     /db_xref="CDD:197367"
     misc_feature    join(6081..6923,6923..7153)
                     /gene="GP"
                     /locus_tag="ZEBOVgp4"
                     /note="Filovirus glycoprotein; Region: Filo_glycop;
                     pfam01611"
                     /db_xref="CDD:110602"
     misc_feature    7706..7732
                     /gene="GP"
                     /locus_tag="ZEBOVgp4"
                     /note="HR1A; other site"
                     /db_xref="CDD:197367"
     misc_feature    7733..7762
                     /gene="GP"
                     /locus_tag="ZEBOVgp4"
                     /note="HR1B; other site"
                     /db_xref="CDD:197367"
     misc_feature    7763..7783
                     /gene="GP"
                     /locus_tag="ZEBOVgp4"
                     /note="HR1C; other site"
                     /db_xref="CDD:197367"
     misc_feature    7784..7831
                     /gene="GP"
                     /locus_tag="ZEBOVgp4"
                     /note="HR1D; other site"
                     /db_xref="CDD:197367"
     misc_feature    7787..7837
                     /gene="GP"
                     /locus_tag="ZEBOVgp4"
                     /note="immunosuppressive region; other site"
                     /db_xref="CDD:197367"
     misc_feature    order(7838..7858,7859..7861)
                     /gene="GP"
                     /locus_tag="ZEBOVgp4"
                     /note="CX(6,7)C motif; other site"
                     /db_xref="CDD:197367"
     misc_feature    7886..7924
                     /gene="GP"
                     /locus_tag="ZEBOVgp4"
                     /note="HR2; other site"
                     /db_xref="CDD:197367"
     misc_feature    order(7784..7786,7793..7795)
                     /gene="GP"
                     /locus_tag="ZEBOVgp4"
                     /note="Cl binding site [ion binding]; other site"
                     /db_xref="CDD:197367"
     misc_feature    order(7706..7714,7718..7723,7727..7732,7736..7744,
                     7748..7756,7760..7765,7769..7777,7781..7807,7811..7819,
                     7823..7828,7844..7849,7856..7858,7865..7876,7880..7882,
                     7889..7894,7901..7903,7910..7915,7922..7924)
                     /gene="GP"
                     /locus_tag="ZEBOVgp4"
                     /note="homotrimer interface [polypeptide binding]; other
                     site"
                     /db_xref="CDD:197367"
     misc_feature    order(7706..7714,7718..7726,7730..7735,7739..7747,
                     7754..7768,7772..7783,7787..7792,7796..7804,7808..7813,
                     7817..7819)
                     /gene="GP"
                     /locus_tag="ZEBOVgp4"
                     /note="HR1-GP1 interface [polypeptide binding]; other
                     site"
                     /db_xref="CDD:197367"
     CDS             6039..7133
                     /gene="GP"
                     /locus_tag="ZEBOVgp4"
                     /note="sGP, small non-structural, secreted glycoprotein;
                     sGP secreted as a anti-parallel oriented homodimer"
                     /citation=[4]
                     /codon_start=1
                     /product="small secreted glycoprotein"
                     /protein_id="NP_066247.1"
                     /db_xref="GI:10313994"
                     /db_xref="GeneID:911829"
                     /translation="MGVTGILQLPRDRFKRTSFFLWVIILFQRTFSIPLGVIHNSTLQ
                     VSDVDKLVCRDKLSSTNQLRSVGLNLEGNGVATDVPSATKRWGFRSGVPPKVVNYEAG
                     EWAENCYNLEIKKPDGSECLPAAPDGIRGFPRCRYVHKVSGTGPCAGDFAFHKEGAFF
                     LYDRLASTVIYRGTTFAEGVVAFLILPQAKKDFFSSHPLREPVNATEDPSSGYYSTTI
                     RYQATGFGTNETEYLFEVDNLTYVQLESRFTPQFLLQLNETIYTSGKRSNTTGKLIWK
                     VNPEIDTTIGEWAFWETKKTSLEKFAVKSCLSQLYQTEPKTSVVRVRRELLPTQGPTQ
                     QLKTTKSWLQKIPLQWFKCTVKEGKLQCRI"
     misc_feature    6081..7130
                     /gene="GP"
                     /locus_tag="ZEBOVgp4"
                     /note="Filovirus glycoprotein; Region: Filo_glycop;
                     pfam01611"
                     /db_xref="CDD:110602"
     CDS             join(6039..6922,6924..6933)
                     /gene="GP"
                     /locus_tag="ZEBOVgp4"
                     /artificial_location="low-quality sequence region"
                     /note="ssGP; second non-structural secreted glycoprotein;
                     secreted in a monomeric form; one A residue is deleted or
                     two additional A residues are inserted at the editing site
                     during transcription of the GP gene"
                     /citation=[4]
                     /codon_start=1
                     /product="second secreted glycoprotein"
                     /protein_id="NP_066248.1"
                     /db_xref="GI:10313996"
                     /db_xref="GeneID:911829"
                     /translation="MGVTGILQLPRDRFKRTSFFLWVIILFQRTFSIPLGVIHNSTLQ
                     VSDVDKLVCRDKLSSTNQLRSVGLNLEGNGVATDVPSATKRWGFRSGVPPKVVNYEAG
                     EWAENCYNLEIKKPDGSECLPAAPDGIRGFPRCRYVHKVSGTGPCAGDFAFHKEGAFF
                     LYDRLASTVIYRGTTFAEGVVAFLILPQAKKDFFSSHPLREPVNATEDPSSGYYSTTI
                     RYQATGFGTNETEYLFEVDNLTYVQLESRFTPQFLLQLNETIYTSGKRSNTTGKLIWK
                     VNPEIDTTIGEWAFWETKKPH"
     misc_feature    join(6081..6922,6924..>6924)
                     /gene="GP"
                     /locus_tag="ZEBOVgp4"
                     /note="Filovirus glycoprotein; Region: Filo_glycop;
                     pfam01611"
                     /db_xref="CDD:110602"
     misc_signal     6918..6924
                     /gene="GP"
                     /locus_tag="ZEBOVgp4"
                     /note="additional A residues are inserted or deleted
                     during transcription of the GP gene by the viral
                     polymerase"
                     /citation=[4]
                     /function="RNA editing"
     gene            8288..9740
                     /gene="VP30"
                     /locus_tag="ZEBOVgp5"
                     /db_xref="GeneID:911826"
     mRNA            8288..9740
                     /gene="VP30"
                     /locus_tag="ZEBOVgp5"
                     /product="VP30"
                     /db_xref="GeneID:911826"
     misc_signal     8288..8299
                     /gene="VP30"
                     /locus_tag="ZEBOVgp5"
                     /note="putative; transcription start signal"
     polyA_signal    8295..8305
                     /gene="GP"
                     /locus_tag="ZEBOVgp4"
                     /citation=[4]
     CDS             8509..9375
                     /gene="VP30"
                     /locus_tag="ZEBOVgp5"
                     /note="polymerase complex protein"
                     /codon_start=1
                     /product="minor nucleoprotein"
                     /protein_id="NP_066249.1"
                     /db_xref="GI:10313997"
                     /db_xref="GeneID:911826"
                     /translation="MEASYERGRPRAARQHSRDGHDHHVRARSSSRENYRGEYRQSRS
                     ASQVRVPTVFHKKRVEPLTVPPAPKDICPTLKKGFLCDSSFCKKDHQLESLTDRELLL
                     LIARKTCGSVEQQLNITAPKDSRLANPTADDFQQEEGPKITLLTLIKTAEHWARQDIR
                     TIEDSKLRALLTLCAVMTRKFSKSQLSLLCETHLRREGLGQDQAEPVLEVYQRLHSDK
                     GGSFEAALWQQWDRQSLIMFITAFLNIALQLPCESSAVVVSGLRTLVPQSDNEEASTN
                     PGTCSWSDEGTP"
     misc_feature    8932..9321
                     /gene="VP30"
                     /locus_tag="ZEBOVgp5"
                     /note="Ebola virus-specific transcription factor VP30;
                     Region: Transcript_VP30; pfam11507"
                     /db_xref="CDD:151944"
     polyA_signal    9730..9740
                     /gene="VP30"
                     /locus_tag="ZEBOVgp5"
                     /note="putative"
     misc_feature    9741..9884
                     /note="intergenic region"
     gene            9885..11518
                     /gene="VP24"
                     /locus_tag="ZEBOVgp6"
                     /note="putative"
                     /db_xref="GeneID:911828"
     mRNA            9885..11496
                     /gene="VP24"
                     /locus_tag="ZEBOVgp6"
                     /product="VP24"
                     /db_xref="GeneID:911828"
     misc_signal     9885..9896
                     /gene="VP24"
                     /locus_tag="ZEBOVgp6"
                     /note="transcription start signal"
     CDS             10345..11100
                     /gene="VP24"
                     /locus_tag="ZEBOVgp6"
                     /codon_start=1
                     /product="membrane-associated protein"
                     /protein_id="NP_066250.1"
                     /db_xref="GI:10313998"
                     /db_xref="GeneID:911828"
                     /translation="MAKATGRYNLISPKKDLEKGVVLSDLCNFLVSQTIQGWKVYWAG
                     IEFDVTHKGMALLHRLKTNDFAPAWSMTRNLFPHLFQNPNSTIESPLWALRVILAAGI
                     QDQLIDQSLIEPLAGALGLISDWLLTTNTNHFNMRTQRVKEQLSLKMLSLIRSNILKF
                     INKLDALHVVNYNGLLSSIEIGTQNHTIIITRTNMGFLVELQEPDKSAMNRMKPGPAK
                     FSLLHESTLKAFTQGSSTRMQSLILEFNSSLAI"
     misc_feature    10369..11040
                     /gene="VP24"
                     /locus_tag="ZEBOVgp6"
                     /note="Filovirus membrane-associated protein VP24; Region:
                     Filo_VP24; pfam06389"
                     /db_xref="CDD:253701"
     polyA_signal    11485..11496
                     /gene="VP24"
                     /locus_tag="ZEBOVgp6"
                     /note="putative"
     misc_feature    11497..11500
                     /gene="VP24"
                     /locus_tag="ZEBOVgp6"
                     /note="intergenic region"
     gene            11501..18282
                     /gene="L"
                     /locus_tag="ZEBOVgp7"
                     /db_xref="GeneID:911824"
     mRNA            11501..18282
                     /gene="L"
                     /locus_tag="ZEBOVgp7"
                     /product="polymerase"
                     /citation=[1]
                     /db_xref="GeneID:911824"
     misc_signal     11501..11512
                     /gene="L"
                     /locus_tag="ZEBOVgp7"
                     /note="transcription start signal"
                     /citation=[1]
     polyA_signal    11508..11518
                     /gene="VP24"
                     /locus_tag="ZEBOVgp6"
     CDS             11581..18219
                     /gene="L"
                     /locus_tag="ZEBOVgp7"
                     /function="synthesis of viral RNAs; transcriptional RNA
                     editing"
                     /note="polymerase"
                     /citation=[1]
                     /codon_start=1
                     /product="RNA-dependent RNA polymerase"
                     /protein_id="NP_066251.1"
                     /db_xref="GI:10313999"
                     /db_xref="GeneID:911824"
                     /translation="MATQHTQYPDARLSSPIVLDQCDLVTRACGLYSSYSLNPQLRNC
                     KLPKHIYRLKYDVTVTKFLSDVPVATLPIDFIVPVLLKALSGNGFCPVEPRCQQFLDE
                     IIKYTMQDALFLKYYLKNVGAQEDCVDEHFQEKILSSIQGNEFLHQMFFWYDLAILTR
                     RGRLNRGNSRSTWFVHDDLIDILGYGDYVFWKIPISMLPLNTQGIPHAAMDWYQASVF
                     KEAVQGHTHIVSVSTADVLIMCKDLITCRFNTTLISKIAEIEDPVCSDYPNFKIVSML
                     YQSGDYLLSILGSDGYKIIKFLEPLCLAKIQLCSKYTERKGRFLTQMHLAVNHTLEEI
                     TEMRALKPSQAQKIREFHRTLIRLEMTPQQLCELFSIQKHWGHPVLHSETAIQKVKKH
                     ATVLKALRPIVIFETYCVFKYSIAKHYFDSQGSWYSVTSDRNLTPGLNSYIKRNQFPP
                     LPMIKELLWEFYHLDHPPLFSTKIISDLSIFIKDRATAVERTCWDAVFEPNVLGYNPP
                     HKFSTKRVPEQFLEQENFSIENVLSYAQKLEYLLPQYRNFSFSLKEKELNVGRTFGKL
                     PYPTRNVQTLCEALLADGLAKAFPSNMMVVTEREQKESLLHQASWHHTSDDFGEHATV
                     RGSSFVTDLEKYNLAFRYEFTAPFIEYCNRCYGVKNVFNWMHYTIPQCYMHVSDYYNP
                     PHNLTLENRDNPPEGPSSYRGHMGGIEGLQQKLWTSISCAQISLVEIKTGFKLRSAVM
                     GDNQCITVLSVFPLETDADEQEQSAEDNAARVAASLAKVTSACGIFLKPDETFVHSGF
                     IYFGKKQYLNGVQLPQSLKTATRMAPLSDAIFDDLQGTLASIGTAFERSISETRHIFP
                     CRITAAFHTFFSVRILQYHHLGFNKGFDLGQLTLGKPLDFGTISLALAVPQVLGGLSF
                     LNPEKCFYRNLGDPVTSGLFQLKTYLRMIEMDDLFLPLIAKNPGNCTAIDFVLNPSGL
                     NVPGSQDLTSFLRQIVRRTITLSAKNKLINTLFHASADFEDEMVCKWLLSSTPVMSRF
                     AADIFSRTPSGKRLQILGYLEGTRTLLASKIINNNTETPVLDRLRKITLQRWSLWFSY
                     LDHCDNILAEALTQITCTVDLAQILREYSWAHILEGRPLIGATLPCMIEQFKVFWLKP
                     YEQCPQCSNAKQPGGKPFVSVAVKKHIVSAWPNASRISWTIGDGIPYIGSRTEDKIGQ
                     PAIKPKCPSAALREAIELASRLTWVTQGSSNSDLLIKPFLEARVNLSVQEILQMTPSH
                     YSGNIVHRYNDQYSPHSFMANRMSNSATRLIVSTNTLGEFSGGGQSARDSNIIFQNVI
                     NYAVALFDIKFRNTEATDIQYNRAHLHLTKCCTREVPAQYLTYTSTLDLDLTRYRENE
                     LIYDSNPLKGGLNCNISFDNPFFQGKRLNIIEDDLIRLPHLSGWELAKTIMQSIISDS
                     NNSSTDPISSGETRSFTTHFLTYPKIGLLYSFGAFVSYYLGNTILRTKKLTLDNFLYY
                     LTTQIHNLPHRSLRILKPTFKHASVMSRLMSIDPHFSIYIGGAAGDRGLSDAARLFLR
                     TSISSFLTFVKEWIINRGTIVPLWIVYPLEGQNPTPVNNFLYQIVELLVHDSSRQQAF
                     KTTISDHVHPHDNLVYTCKSTASNFFHASLAYWRSRHRNSNRKYLARDSSTGSSTNNS
                     DGHIERSQEQTTRDPHDGTERNLVLQMSHEIKRTTIPQENTHQGPSFQSFLSDSACGT
                     ANPKLNFDRSRHNVKFQDHNSASKREGHQIISHRLVLPFFTLSQGTRQLTSSNESQTQ
                     DEISKYLRQLRSVIDTTVYCRFTGIVSSMHYKLDEVLWEIESFKSAVTLAEGEGAGAL
                     LLIQKYQVKTLFFNTLATESSIESEIVSGMTTPRMLLPVMSKFHNDQIEIILNNSASQ
                     ITDITNPTWFKDQRARLPKQVEVITMDAETTENINRSKLYEAVYKLILHHIDPSVLKA
                     VVLKVFLSDTEGMLWLNDNLAPFFATGYLIKPITSSARSSEWYLCLTNFLSTTRKMPH
                     QNHLSCKQVILTALQLQIQRSPYWLSHLTQYADCELHLSYIRLGFPSLEKVLYHRYNL
                     VDSKRGPLVSITQHLAHLRAEIRELTNDYNQQRQSRTQTYHFIRTAKGRITKLVNDYL
                     KFFLIVQALKHNGTWQAEFKKLPELISVCNRFYHIRDCNCEERFLVQTLYLHRMQDSE
                     VKLIERLTGLLSLFPDGLYRFD"
     misc_feature    11608..14853
                     /gene="L"
                     /locus_tag="ZEBOVgp7"
                     /note="Mononegavirales RNA dependent RNA polymerase;
                     Region: Mononeg_RNA_pol; pfam00946"
                     /db_xref="CDD:250248"
     misc_feature    15223..18192
                     /gene="L"
                     /locus_tag="ZEBOVgp7"
                     /note="mRNA capping enzyme, paramyxovirus family; Region:
                     paramyx_RNAcap; TIGR04198"
                     /db_xref="CDD:234496"
     polyA_signal    18272..18282
                     /gene="L"
                     /locus_tag="ZEBOVgp7"
                     /citation=[1]
     3'UTR           18283..18959
                     /note="putative trailer region"
                     /citation=[1]
                     /function="regulation or initiation of RNA replication"
ORIGIN      
        1 cggacacaca aaaagaaaga agaattttta ggatcttttg tgtgcgaata actatgagga
       61 agattaataa ttttcctctc attgaaattt atatcggaat ttaaattgaa attgttactg
      121 taatcacacc tggtttgttt cagagccaca tcacaaagat agagaacaac ctaggtctcc
      181 gaagggagca agggcatcag tgtgctcagt tgaaaatccc ttgtcaacac ctaggtctta
      241 tcacatcaca agttccacct cagactctgc agggtgatcc aacaacctta atagaaacat
      301 tattgttaaa ggacagcatt agttcacagt caaacaagca agattgagaa ttaaccttgg
      361 ttttgaactt gaacacttag gggattgaag attcaacaac cctaaagctt ggggtaaaac
      421 attggaaata gttaaaagac aaattgctcg gaatcacaaa attccgagta tggattctcg
      481 tcctcagaaa atctggatgg cgccgagtct cactgaatct gacatggatt accacaagat
      541 cttgacagca ggtctgtccg ttcaacaggg gattgttcgg caaagagtca tcccagtgta
      601 tcaagtaaac aatcttgaag aaatttgcca acttatcata caggcctttg aagcaggtgt
      661 tgattttcaa gagagtgcgg acagtttcct tctcatgctt tgtcttcatc atgcgtacca
      721 gggagattac aaacttttct tggaaagtgg cgcagtcaag tatttggaag ggcacgggtt
      781 ccgttttgaa gtcaagaagc gtgatggagt gaagcgcctt gaggaattgc tgccagcagt
      841 atctagtgga aaaaacatta agagaacact tgctgccatg ccggaagagg agacaactga
      901 agctaatgcc ggtcagtttc tctcctttgc aagtctattc cttccgaaat tggtagtagg
      961 agaaaaggct tgccttgaga aggttcaaag gcaaattcaa gtacatgcag agcaaggact
     1021 gatacaatat ccaacagctt ggcaatcagt aggacacatg atggtgattt tccgtttgat
     1081 gcgaacaaat tttctgatca aatttctcct aatacaccaa gggatgcaca tggttgccgg
     1141 gcatgatgcc aacgatgctg tgatttcaaa ttcagtggct caagctcgtt tttcaggctt
     1201 attgattgtc aaaacagtac ttgatcatat cctacaaaag acagaacgag gagttcgtct
     1261 ccatcctctt gcaaggaccg ccaaggtaaa aaatgaggtg aactccttta aggctgcact
     1321 cagctccctg gccaagcatg gagagtatgc tcctttcgcc cgacttttga acctttctgg
     1381 agtaaataat cttgagcatg gtcttttccc tcaactatcg gcaattgcac tcggagtcgc
     1441 cacagcacac gggagtaccc tcgcaggagt aaatgttgga gaacagtatc aacaactcag
     1501 agaggctgcc actgaggctg agaagcaact ccaacaatat gcagagtctc gcgaacttga
     1561 ccatcttgga cttgatgatc aggaaaagaa aattcttatg aacttccatc agaaaaagaa
     1621 cgaaatcagc ttccagcaaa caaacgctat ggtaactcta agaaaagagc gcctggccaa
     1681 gctgacagaa gctatcactg ctgcgtcact gcccaaaaca agtggacatt acgatgatga
     1741 tgacgacatt ccctttccag gacccatcaa tgatgacgac aatcctggcc atcaagatga
     1801 tgatccgact gactcacagg atacgaccat tcccgatgtg gtggttgatc ccgatgatgg
     1861 aagctacggc gaataccaga gttactcgga aaacggcatg aatgcaccag atgacttggt
     1921 cctattcgat ctagacgagg acgacgagga cactaagcca gtgcctaata gatcgaccaa
     1981 gggtggacaa cagaagaaca gtcaaaaggg ccagcatata gagggcagac agacacaatc
     2041 caggccaatt caaaatgtcc caggccctca cagaacaatc caccacgcca gtgcgccact
     2101 cacggacaat gacagaagaa atgaaccctc cggctcaacc agccctcgca tgctgacacc
     2161 aattaacgaa gaggcagacc cactggacga tgccgacgac gagacgtcta gccttccgcc
     2221 cttggagtca gatgatgaag agcaggacag ggacggaact tccaaccgca cacccactgt
     2281 cgccccaccg gctcccgtat acagagatca ctctgaaaag aaagaactcc cgcaagacga
     2341 gcaacaagat caggaccaca ctcaagaggc caggaaccag gacagtgaca acacccagtc
     2401 agaacactct tttgaggaga tgtatcgcca cattctaaga tcacaggggc catttgatgc
     2461 tgttttgtat tatcatatga tgaaggatga gcctgtagtt ttcagtacca gtgatggcaa
     2521 agagtacacg tatccagact cccttgaaga ggaatatcca ccatggctca ctgaaaaaga
     2581 ggctatgaat gaagagaata gatttgttac attggatggt caacaatttt attggccggt
     2641 gatgaatcac aagaataaat tcatggcaat cctgcaacat catcagtgaa tgagcatgga
     2701 acaatgggat gattcaaccg acaaatagct aacattaagt agtcaaggaa cgaaaacagg
     2761 aagaattttt gatgtctaag gtgtgaatta ttatcacaat aaaagtgatt cttatttttg
     2821 aatttaaagc tagcttatta ttactagccg tttttcaaag ttcaatttga gtcttaatgc
     2881 aaataggcgt taagccacag ttatagccat aattgtaact caatattcta actagcgatt
     2941 tatctaaatt aaattacatt atgcttttat aacttaccta ctagcctgcc caacatttac
     3001 acgatcgttt tataattaag aaaaaactaa tgatgaagat taaaaccttc atcatcctta
     3061 cgtcaattga attctctagc actcgaagct tattgtcttc aatgtaaaag aaaagctggt
     3121 ctaacaagat gacaactaga acaaagggca ggggccatac tgcggccacg actcaaaacg
     3181 acagaatgcc aggccctgag ctttcgggct ggatctctga gcagctaatg accggaagaa
     3241 ttcctgtaag cgacatcttc tgtgatattg agaacaatcc aggattatgc tacgcatccc
     3301 aaatgcaaca aacgaagcca aacccgaaga cgcgcaacag tcaaacccaa acggacccaa
     3361 tttgcaatca tagttttgag gaggtagtac aaacattggc ttcattggct actgttgtgc
     3421 aacaacaaac catcgcatca gaatcattag aacaacgcat tacgagtctt gagaatggtc
     3481 taaagccagt ttatgatatg gcaaaaacaa tctcctcatt gaacagggtt tgtgctgaga
     3541 tggttgcaaa atatgatctt ctggtgatga caaccggtcg ggcaacagca accgctgcgg
     3601 caactgaggc ttattgggcc gaacatggtc aaccaccacc tggaccatca ctttatgaag
     3661 aaagtgcgat tcggggtaag attgaatcta gagatgagac cgtccctcaa agtgttaggg
     3721 aggcattcaa caatctaaac agtaccactt cactaactga ggaaaatttt gggaaacctg
     3781 acatttcggc aaaggatttg agaaacatta tgtatgatca cttgcctggt tttggaactg
     3841 ctttccacca attagtacaa gtgatttgta aattgggaaa agatagcaac tcattggaca
     3901 tcattcatgc tgagttccag gccagcctgg ctgaaggaga ctctcctcaa tgtgccctaa
     3961 ttcaaattac aaaaagagtt ccaatcttcc aagatgctgc tccacctgtc atccacatcc
     4021 gctctcgagg tgacattccc cgagcttgcc agaaaagctt gcgtccagtc ccaccatcgc
     4081 ccaagattga tcgaggttgg gtatgtgttt ttcagcttca agatggtaaa acacttggac
     4141 tcaaaatttg agccaatctc ccttccctcc gaaagaggcg aataatagca gaggcttcaa
     4201 ctgctgaact atagggtacg ttacattaat gatacacttg tgagtatcag ccctggataa
     4261 tataagtcaa ttaaacgacc aagataaaat tgttcatatc tcgctagcag cttaaaatat
     4321 aaatgtaata ggagctatat ctctgacagt attataatca attgttatta agtaacccaa
     4381 accaaaagtg atgaagatta agaaaaacct acctcggctg agagagtgtt ttttcattaa
     4441 ccttcatctt gtaaacgttg agcaaaattg ttaaaaatat gaggcgggtt atattgccta
     4501 ctgctcctcc tgaatatatg gaggccatat accctgtcag gtcaaattca acaattgcta
     4561 gaggtggcaa cagcaataca ggcttcctga caccggagtc agtcaatggg gacactccat
     4621 cgaatccact caggccaatt gccgatgaca ccatcgacca tgccagccac acaccaggca
     4681 gtgtgtcatc agcattcatc cttgaagcta tggtgaatgt catatcgggc cccaaagtgc
     4741 taatgaagca aattccaatt tggcttcctc taggtgtcgc tgatcaaaag acctacagct
     4801 ttgactcaac tacggccgcc atcatgcttg cttcatacac tatcacccat ttcggcaagg
     4861 caaccaatcc acttgtcaga gtcaatcggc tgggtcctgg aatcccggat catcccctca
     4921 ggctcctgcg aattggaaac caggctttcc tccaggagtt cgttcttccg ccagtccaac
     4981 taccccagta tttcaccttt gatttgacag cactcaaact gatcacccaa ccactgcctg
     5041 ctgcaacatg gaccgatgac actccaacag gatcaaatgg agcgttgcgt ccaggaattt
     5101 catttcatcc aaaacttcgc cccattcttt tacccaacaa aagtgggaag aaggggaaca
     5161 gtgccgatct aacatctccg gagaaaatcc aagcaataat gacttcactc caggacttta
     5221 agatcgttcc aattgatcca accaaaaata tcatgggaat cgaagtgcca gaaactctgg
     5281 tccacaagct gaccggtaag aaggtgactt ctaaaaatgg acaaccaatc atccctgttc
     5341 ttttgccaaa gtacattggg ttggacccgg tggctccagg agacctcacc atggtaatca
     5401 cacaggattg tgacacgtgt cattctcctg caagtcttcc agctgtgatt gagaagtaat
     5461 tgcaataatt gactcagatc cagttttata gaatcttctc agggatagtg ataacatcta
     5521 tttagtaatc cgtccattag aggagacact tttaattgat caatatacta aaggtgcttt
     5581 acaccattgt cttttttctc tcctaaatgt agaacttaac aaaagactca taatatactt
     5641 gtttttaaag gattgattga tgaaagatca taactaataa cattacaaat aatcctacta
     5701 taatcaatac ggtgattcaa atgttaatct ttctcattgc acatactttt tgcccttatc
     5761 ctcaaattgc ctgcatgctt acatctgagg atagccagtg tgacttggat tggaaatgtg
     5821 gagaaaaaat cgggacccat ttctaggttg ttcacaatcc aagtacagac attgcccttc
     5881 taattaagaa aaaatcggcg atgaagatta agccgacagt gagcgtaatc ttcatctctc
     5941 ttagattatt tgttttccag agtaggggtc gtcaggtcct tttcaatcgt gtaaccaaaa
     6001 taaactccac tagaaggata ttgtggggca acaacacaat gggcgttaca ggaatattgc
     6061 agttacctcg tgatcgattc aagaggacat cattctttct ttgggtaatt atccttttcc
     6121 aaagaacatt ttccatccca cttggagtca tccacaatag cacattacag gttagtgatg
     6181 tcgacaaact agtttgtcgt gacaaactgt catccacaaa tcaattgaga tcagttggac
     6241 tgaatctcga agggaatgga gtggcaactg acgtgccatc tgcaactaaa agatggggct
     6301 tcaggtccgg tgtcccacca aaggtggtca attatgaagc tggtgaatgg gctgaaaact
     6361 gctacaatct tgaaatcaaa aaacctgacg ggagtgagtg tctaccagca gcgccagacg
     6421 ggattcgggg cttcccccgg tgccggtatg tgcacaaagt atcaggaacg ggaccgtgtg
     6481 ccggagactt tgccttccat aaagagggtg ctttcttcct gtatgatcga cttgcttcca
     6541 cagttatcta ccgaggaacg actttcgctg aaggtgtcgt tgcatttctg atactgcccc
     6601 aagctaagaa ggacttcttc agctcacacc ccttgagaga gccggtcaat gcaacggagg
     6661 acccgtctag tggctactat tctaccacaa ttagatatca ggctaccggt tttggaacca
     6721 atgagacaga gtacttgttc gaggttgaca atttgaccta cgtccaactt gaatcaagat
     6781 tcacaccaca gtttctgctc cagctgaatg agacaatata tacaagtggg aaaaggagca
     6841 ataccacggg aaaactaatt tggaaggtca accccgaaat tgatacaaca atcggggagt
     6901 gggccttctg ggaaactaaa aaaacctcac tagaaaaatt cgcagtgaag agttgtcttt
     6961 cacagttgta tcaaacggag ccaaaaacat cagtggtcag agtccggcgc gaacttcttc
     7021 cgacccaggg accaacacaa caactgaaga ccacaaaatc atggcttcag aaaattcctc
     7081 tgcaatggtt caagtgcaca gtcaaggaag ggaagctgca gtgtcgcatc taacaaccct
     7141 tgccacaatc tccacgagtc cccaatccct cacaaccaaa ccaggtccgg acaacagcac
     7201 ccataataca cccgtgtata aacttgacat ctctgaggca actcaagttg aacaacatca
     7261 ccgcagaaca gacaacgaca gcacagcctc cgacactccc tctgccacga ccgcagccgg
     7321 acccccaaaa gcagagaaca ccaacacgag caagagcact gacttcctgg accccgccac
     7381 cacaacaagt ccccaaaacc acagcgagac cgctggcaac aacaacactc atcaccaaga
     7441 taccggagaa gagagtgcca gcagcgggaa gctaggctta attaccaata ctattgctgg
     7501 agtcgcagga ctgatcacag gcgggagaag aactcgaaga gaagcaattg tcaatgctca
     7561 acccaaatgc aaccctaatt tacattactg gactactcag gatgaaggtg ctgcaatcgg
     7621 actggcctgg ataccatatt tcgggccagc agccgaggga atttacatag aggggctaat
     7681 gcacaatcaa gatggtttaa tctgtgggtt gagacagctg gccaacgaga cgactcaagc
     7741 tcttcaactg ttcctgagag ccacaactga gctacgcacc ttttcaatcc tcaaccgtaa
     7801 ggcaattgat ttcttgctgc agcgatgggg cggcacatgc cacattctgg gaccggactg
     7861 ctgtatcgaa ccacatgatt ggaccaagaa cataacagac aaaattgatc agattattca
     7921 tgattttgtt gataaaaccc ttccggacca gggggacaat gacaattggt ggacaggatg
     7981 gagacaatgg ataccggcag gtattggagt tacaggcgtt ataattgcag ttatcgcttt
     8041 attctgtata tgcaaatttg tcttttagtt tttcttcaga ttgcttcatg gaaaagctca
     8101 gcctcaaatc aatgaaacca ggatttaatt atatggatta cttgaatcta agattacttg
     8161 acaaatgata atataataca ctggagcttt aaacatagcc aatgtgattc taactccttt
     8221 aaactcacag ttaatcataa acaaggtttg acatcaatct agttatctct ttgagaatga
     8281 taaacttgat gaagattaag aaaaaggtaa tctttcgatt atctttaatc ttcatccttg
     8341 attctacaat catgacagtt gtctttagtg acaagggaaa gaagcctttt tattaagttg
     8401 taataatcag atctgcgaac cggtagagtt tagttgcaac ctaacacaca taaagcattg
     8461 gtcaaaaagt caatagaaat ttaaacagtg agtggagaca acttttaaat ggaagcttca
     8521 tatgagagag gacgcccacg agctgccaga cagcattcaa gggatggaca cgaccaccat
     8581 gttcgagcac gatcatcatc cagagagaat tatcgaggtg agtaccgtca atcaaggagc
     8641 gcctcacaag tgcgcgttcc tactgtattt cataagaaga gagttgaacc attaacagtt
     8701 cctccagcac ctaaagacat atgtccgacc ttgaaaaaag gatttttgtg tgacagtagt
     8761 ttttgcaaaa aagatcacca gttggagagt ttaactgata gggaattact cctactaatc
     8821 gcccgtaaga cttgtggatc agtagaacaa caattaaata taactgcacc caaggactcg
     8881 cgcttagcaa atccaacggc tgatgatttc cagcaagagg aaggtccaaa aattaccttg
     8941 ttgacactga tcaagacggc agaacactgg gcgagacaag acatcagaac catagaggat
     9001 tcaaaattaa gagcattgtt gactctatgt gctgtgatga cgaggaaatt ctcaaaatcc
     9061 cagctgagtc ttttatgtga gacacaccta aggcgcgagg ggcttgggca agatcaggca
     9121 gaacccgttc tcgaagtata tcaacgatta cacagtgata aaggaggcag ttttgaagct
     9181 gcactatggc aacaatggga ccgacaatcc ctaattatgt ttatcactgc attcttgaat
     9241 attgctctcc agttaccgtg tgaaagttct gctgtcgttg tttcagggtt aagaacattg
     9301 gttcctcaat cagataatga ggaagcttca accaacccgg ggacatgctc atggtctgat
     9361 gagggtaccc cttaataagg ctgactaaaa cactatataa ccttctactt gatcacaata
     9421 ctccgtatac ctatcatcat atatttaatc aagacgatat cctttaaaac ttattcagta
     9481 ctataatcac tctcgtttca aattaataag atgtgcatga ttgccctaat atatgaagag
     9541 gtatgataca accctaacag tgatcaaaga aaatcataat ctcgtatcgc tcgtaatata
     9601 acctgccaag catacctctt gcacaaagtg attcttgtac acaaataatg ttttactcta
     9661 caggaggtag caacgatcca tcccatcaaa aaataagtat ttcatgactt actaatgatc
     9721 tcttaaaata ttaagaaaaa ctgacggaac ataaattctt tatgcttcaa gctgtggagg
     9781 aggtgtttgg tattggctat tgttatatta caatcaataa caagcttgta aaaatattgt
     9841 tcttgtttca agaggtagat tgtgaccgga aatgctaaac taatgatgaa gattaatgcg
     9901 gaggtctgat aagaataaac cttattattc agattaggcc ccaagaggca ttcttcatct
     9961 ccttttagca aagtactatt tcagggtagt ccaattagtg gcacgtcttt tagctgtata
    10021 tcagtcgccc ctgagatacg ccacaaaagt gtctctaagc taaattggtc tgtacacatc
    10081 ccatacattg tattaggggc aataatatct aattgaactt agccgtttaa aatttagtgc
    10141 ataaatctgg gctaacacca ccaggtcaac tccattggct gaaaagaagc ttacctacaa
    10201 cgaacatcac tttgagcgcc ctcacaatta aaaaatagga acgtcgttcc aacaatcgag
    10261 cgcaaggttt caaggttgaa ctgagagtgt ctagacaaca aaatattgat actccagaca
    10321 ccaagcaaga cctgagaaaa aaccatggct aaagctacgg gacgatacaa tctaatatcg
    10381 cccaaaaagg acctggagaa aggggttgtc ttaagcgacc tctgtaactt cttagttagc
    10441 caaactattc aggggtggaa ggtttattgg gctggtattg agtttgatgt gactcacaaa
    10501 ggaatggccc tattgcatag actgaaaact aatgactttg cccctgcatg gtcaatgaca
    10561 aggaatctct ttcctcattt atttcaaaat ccgaattcca caattgaatc accgctgtgg
    10621 gcattgagag tcatccttgc agcagggata caggaccagc tgattgacca gtctttgatt
    10681 gaacccttag caggagccct tggtctgatc tctgattggc tgctaacaac caacactaac
    10741 catttcaaca tgcgaacaca acgtgtcaag gaacaattga gcctaaaaat gctgtcgttg
    10801 attcgatcca atattctcaa gtttattaac aaattggatg ctctacatgt cgtgaactac
    10861 aacggattgt tgagcagtat tgaaattgga actcaaaatc atacaatcat cataactcga
    10921 actaacatgg gttttctggt ggagctccaa gaacccgaca aatcggcaat gaaccgcatg
    10981 aagcctgggc cggcgaaatt ttccctcctt catgagtcca cactgaaagc atttacacaa
    11041 ggatcctcga cacgaatgca aagtttgatt cttgaattta atagctctct tgctatctaa
    11101 ctaaggtaga atacttcata ttgagctaac tcatatatgc tgactcaata gttatcttga
    11161 catctctgct ttcataatca gatatataag cataataaat aaatactcat atttcttgat
    11221 aatttgttta accacagata aatcctcact gtaagccagc ttccaagttg acacccttac
    11281 aaaaaccagg actcagaatc cctcaaacaa gagattccaa gacaacatca tagaattgct
    11341 ttattatatg aataagcatt ttatcaccag aaatcctata tactaaatgg ttaattgtaa
    11401 ctgaacccgc aggtcacatg tgttaggttt cacagattct atatattact aactctatac
    11461 tcgtaattaa cattagataa gtagattaag aaaaaagcct gaggaagatt aagaaaaact
    11521 gcttattggg tctttccgtg ttttagatga agcagttgaa attcttcctc ttgatattaa
    11581 atggctacac aacataccca atacccagac gctaggttat catcaccaat tgtattggac
    11641 caatgtgacc tagtcactag agcttgcggg ttatattcat catactccct taatccgcaa
    11701 ctacgcaact gtaaactccc gaaacatatc taccgtttga aatacgatgt aactgttacc
    11761 aagttcttga gtgatgtacc agtggcgaca ttgcccatag atttcatagt cccagttctt
    11821 ctcaaggcac tgtcaggcaa tggattctgt cctgttgagc cgcggtgcca acagttctta
    11881 gatgaaatca ttaagtacac aatgcaagat gctctcttct tgaaatatta tctcaaaaat
    11941 gtgggtgctc aagaagactg tgttgatgaa cactttcaag agaaaatctt atcttcaatt
    12001 cagggcaatg aatttttaca tcaaatgttt ttctggtatg atctggctat tttaactcga
    12061 aggggtagat taaatcgagg aaactctaga tcaacatggt ttgttcatga tgatttaata
    12121 gacatcttag gctatgggga ctatgttttt tggaagatcc caatttcaat gttaccactg
    12181 aacacacaag gaatccccca tgctgctatg gactggtatc aggcatcagt attcaaagaa
    12241 gcggttcaag ggcatacaca cattgtttct gtttctactg ccgacgtctt gataatgtgc
    12301 aaagatttaa ttacatgtcg attcaacaca actctaatct caaaaatagc agagattgag
    12361 gatccagttt gttctgatta tcccaatttt aagattgtgt ctatgcttta ccagagcgga
    12421 gattacttac tctccatatt agggtctgat gggtataaaa ttattaagtt cctcgaacca
    12481 ttgtgcttgg ccaaaattca attatgctca aagtacactg agaggaaggg ccgattctta
    12541 acacaaatgc atttagctgt aaatcacacc ctagaagaaa ttacagaaat gcgtgcacta
    12601 aagccttcac aggctcaaaa gatccgtgaa ttccatagaa cattgataag gctggagatg
    12661 acgccacaac aactttgtga gctattttcc attcaaaaac actgggggca tcctgtgcta
    12721 catagtgaaa cagcaatcca aaaagttaaa aaacatgcta cggtgctaaa agcattacgc
    12781 cctatagtga ttttcgagac atactgtgtt tttaaatata gtattgccaa acattatttt
    12841 gatagtcaag gatcttggta cagtgttact tcagatagga atctaacacc gggtcttaat
    12901 tcttatatca aaagaaatca attccctccg ttgccaatga ttaaagaact actatgggaa
    12961 ttttaccacc ttgaccaccc tccacttttc tcaaccaaaa ttattagtga cttaagtatt
    13021 tttataaaag acagagctac cgcagtagaa aggacatgct gggatgcagt attcgagcct
    13081 aatgttctag gatataatcc acctcacaaa tttagtacta aacgtgtacc ggaacaattt
    13141 ttagagcaag aaaacttttc tattgagaat gttctttcct acgcacaaaa actcgagtat
    13201 ctactaccac aatatcggaa cttttctttc tcattgaaag agaaagagtt gaatgtaggt
    13261 agaaccttcg gaaaattgcc ttatccgact cgcaatgttc aaacactttg tgaagctctg
    13321 ttagctgatg gtcttgctaa agcatttcct agcaatatga tggtagttac ggaacgtgag
    13381 caaaaagaaa gcttattgca tcaagcatca tggcaccaca caagtgatga ttttggtgaa
    13441 catgccacag ttagagggag tagctttgta actgatttag agaaatacaa tcttgcattt
    13501 agatatgagt ttacagcacc ttttatagaa tattgcaacc gttgctatgg tgttaagaat
    13561 gtttttaatt ggatgcatta tacaatccca cagtgttata tgcatgtcag tgattattat
    13621 aatccaccac ataacctcac actggagaat cgagacaacc cccccgaagg gcctagttca
    13681 tacaggggtc atatgggagg gattgaagga ctgcaacaaa aactctggac aagtatttca
    13741 tgtgctcaaa tttctttagt tgaaattaag actggtttta agttacgctc agctgtgatg
    13801 ggtgacaatc agtgcattac tgttttatca gtcttcccct tagagactga cgcagacgag
    13861 caggaacaga gcgccgaaga caatgcagcg agggtggccg ccagcctagc aaaagttaca
    13921 agtgcctgtg gaatcttttt aaaacctgat gaaacatttg tacattcagg ttttatctat
    13981 tttggaaaaa aacaatattt gaatggggtc caattgcctc agtcccttaa aacggctaca
    14041 agaatggcac cattgtctga tgcaattttt gatgatcttc aagggaccct ggctagtata
    14101 ggcactgctt ttgagcgatc catctctgag acacgacata tctttccttg caggataacc
    14161 gcagctttcc atacgttttt ttcggtgaga atcttgcaat atcatcatct cgggttcaat
    14221 aaaggttttg accttggaca gttaacactc ggcaaacctc tggatttcgg aacaatatca
    14281 ttggcactag cggtaccgca ggtgcttgga gggttatcct tcttgaatcc tgagaaatgt
    14341 ttctaccgga atctaggaga tccagttacc tcaggcttat tccagttaaa aacttatctc
    14401 cgaatgattg agatggatga tttattctta cctttaattg cgaagaaccc tgggaactgc
    14461 actgccattg actttgtgct aaatcctagc ggattaaatg tccctgggtc gcaagactta
    14521 acttcatttc tgcgccagat tgtacgcagg accatcaccc taagtgcgaa aaacaaactt
    14581 attaatacct tatttcatgc gtcagctgac ttcgaagacg aaatggtttg taaatggcta
    14641 ttatcatcaa ctcctgttat gagtcgtttt gcggccgata tcttttcacg cacgccgagc
    14701 gggaagcgat tgcaaattct aggatacctg gaaggaacac gcacattatt agcctctaag
    14761 atcatcaaca ataatacaga gacaccggtt ttggacagac tgaggaaaat aacattgcaa
    14821 aggtggagcc tatggtttag ttatcttgat cattgtgata atatcctggc ggaggcttta
    14881 acccaaataa cttgcacagt tgatttagca cagattctga gggaatattc atgggctcat
    14941 attttagagg gaagacctct tattggagcc acactcccat gtatgattga gcaattcaaa
    15001 gtgttttggc tgaaacccta cgaacaatgt ccgcagtgtt caaatgcaaa gcaaccaggt
    15061 gggaaaccat tcgtgtcagt ggcagtcaag aaacatattg ttagtgcatg gccgaacgca
    15121 tcccgaataa gctggactat cggggatgga atcccataca ttggatcaag gacagaagat
    15181 aagataggac aacctgctat taaaccaaaa tgtccttccg cagccttaag agaggccatt
    15241 gaattggcgt cccgtttaac atgggtaact caaggcagtt cgaacagtga cttgctaata
    15301 aaaccatttt tggaagcacg agtaaattta agtgttcaag aaatacttca aatgacccct
    15361 tcacattact caggaaatat tgttcacagg tacaacgatc aatacagtcc tcattctttc
    15421 atggccaatc gtatgagtaa ttcagcaacg cgattgattg tttctacaaa cactttaggt
    15481 gagttttcag gaggtggcca gtctgcacgc gacagcaata ttattttcca gaatgttata
    15541 aattatgcag ttgcactgtt cgatattaaa tttagaaaca ctgaggctac agatatccaa
    15601 tataatcgtg ctcaccttca tctaactaag tgttgcaccc gggaagtacc agctcagtat
    15661 ttaacataca catctacatt ggatttagat ttaacaagat accgagaaaa cgaattgatt
    15721 tatgacagta atcctctaaa aggaggactc aattgcaata tctcattcga taatccattt
    15781 ttccaaggta aacggctgaa cattatagaa gatgatctta ttcgactgcc tcacttatct
    15841 ggatgggagc tagccaagac catcatgcaa tcaattattt cagatagcaa caattcatct
    15901 acagacccaa ttagcagtgg agaaacaaga tcattcacta cccatttctt aacttatccc
    15961 aagataggac ttctgtacag ttttggggcc tttgtaagtt attatcttgg caatacaatt
    16021 cttcggacta agaaattaac acttgacaat tttttatatt acttaactac tcaaattcat
    16081 aatctaccac atcgctcatt gcgaatactt aagccaacat tcaaacatgc aagcgttatg
    16141 tcacggttaa tgagtattga tcctcatttt tctatttaca taggcggtgc tgcaggtgac
    16201 agaggactct cagatgcggc caggttattt ttgagaacgt ccatttcatc ttttcttaca
    16261 tttgtaaaag aatggataat taatcgcgga acaattgtcc ctttatggat agtatatccg
    16321 ctagagggtc aaaacccaac acctgtgaat aattttctct atcagatcgt agaactgctg
    16381 gtgcatgatt catcaagaca acaggctttt aaaactacca taagtgatca tgtacatcct
    16441 cacgacaatc ttgtttacac atgtaagagt acagccagca atttcttcca tgcatcattg
    16501 gcgtactgga ggagcagaca cagaaacagc aaccgaaaat acttggcaag agactcttca
    16561 actggatcaa gcacaaacaa cagtgatggt catattgaga gaagtcaaga acaaaccacc
    16621 agagatccac atgatggcac tgaacggaat ctagtcctac aaatgagcca tgaaataaaa
    16681 agaacgacaa ttccacaaga aaacacgcac cagggtccgt cgttccagtc ctttctaagt
    16741 gactctgctt gtggtacagc aaatccaaaa ctaaatttcg atcgatcgag acacaatgtg
    16801 aaatttcagg atcataactc ggcatccaag agggaaggtc atcaaataat ctcacaccgt
    16861 ctagtcctac ctttctttac attatctcaa gggacacgcc aattaacgtc atccaatgag
    16921 tcacaaaccc aagacgagat atcaaagtac ttacggcaat tgagatccgt cattgatacc
    16981 acagtttatt gtagatttac cggtatagtc tcgtccatgc attacaaact tgatgaggtc
    17041 ctttgggaaa tagagagttt caagtcggct gtgacgctag cagagggaga aggtgctggt
    17101 gccttactat tgattcagaa ataccaagtt aagaccttat ttttcaacac gctagctact
    17161 gagtccagta tagagtcaga aatagtatca ggaatgacta ctcctaggat gcttctacct
    17221 gttatgtcaa aattccataa tgaccaaatt gagattattc ttaacaactc agcaagccaa
    17281 ataacagaca taacaaatcc tacttggttt aaagaccaaa gagcaaggct acctaagcaa
    17341 gtcgaggtta taaccatgga tgcagagaca acagagaata taaacagatc gaaattgtac
    17401 gaagctgtat ataaattgat cttacaccat attgatccta gcgtattgaa agcagtggtc
    17461 cttaaagtct ttctaagtga tactgagggt atgttatggc taaatgataa tttagccccg
    17521 ttttttgcca ctggttattt aattaagcca ataacgtcaa gtgctagatc tagtgagtgg
    17581 tatctttgtc tgacgaactt cttatcaact acacgtaaga tgccacacca aaaccatctc
    17641 agttgtaaac aggtaatact tacggcattg caactgcaaa ttcaacgaag cccatactgg
    17701 ctaagtcatt taactcagta tgctgactgt gagttacatt taagttatat ccgccttggt
    17761 tttccatcat tagagaaagt actataccac aggtataacc tcgtcgattc aaaaagaggt
    17821 ccactagtct ctatcactca gcacttagca catcttagag cagagattcg agaattaact
    17881 aatgattata atcaacagcg acaaagtcgg actcaaacat atcactttat tcgtactgca
    17941 aaaggacgaa tcacaaaact agtcaatgat tatttaaaat tctttcttat tgtgcaagca
    18001 ttaaaacata atgggacatg gcaagctgag tttaagaaat taccagagtt gattagtgtg
    18061 tgcaataggt tctaccatat tagagattgc aattgtgaag aacgtttctt agttcaaacc
    18121 ttatatttac atagaatgca ggattctgaa gttaagctta tcgaaaggct gacagggctt
    18181 ctgagtttat ttccggatgg tctctacagg tttgattgaa ttaccgtgca tagtatcctg
    18241 atacttgcaa aggttggtta ttaacataca gattataaaa aactcataaa ttgctctcat
    18301 acatcatatt gatctaatct caataaacaa ctatttaaat aacgaaagga gtccctatat
    18361 tatatactat atttagcctc tctccctgcg tgataatcaa aaaattcaca atgcagcatg
    18421 tgtgacatat tactgccgca atgaatttaa cgcaacataa taaactctgc actctttata
    18481 attaagcttt aacgaaaggt ctgggctcat attgttattg atataataat gttgtatcaa
    18541 tatcctgtca gatggaatag tgttttggtt gataacacaa cttcttaaaa caaaattgat
    18601 ctttaagatt aagtttttta taattatcat tactttaatt tgtcgtttta aaaacggtga
    18661 tagccttaat ctttgtgtaa aataagagat taggtgtaat aaccttaaca tttttgtcta
    18721 gtaagctact atttcataca gaatgataaa attaaaagaa aaggcaggac tgtaaaatca
    18781 gaaatacctt ctttacaata tagcagacta gataataatc ttcgtgttaa tgataattaa
    18841 gacattgacc acgctcatca gaaggctcgc cagaataaac gttgcaaaaa ggattcctgg
    18901 aaaaatggtc gcacacaaaa atttaaaaat aaatctattt cttctttttt gtgtgtcca
//