LOCUS       NZ_CP024199            20846 bp    DNA     circular CON 03-MAY-2024
DEFINITION  Thalassospira marina strain CSC3H3 chromosome, complete genome.
ACCESSION   NZ_CP024199
VERSION     NZ_CP024199.1
KEYWORDS    RefSeq.
SOURCE      Thalassospira marina
  ORGANISM  Thalassospira marina
            Bacteria; Pseudomonadota; Alphaproteobacteria; Rhodospirillales;
            Thalassospiraceae; Thalassospira.
REFERENCE   1  (bases 1 to 4530245)
  AUTHORS   Dong,C., Liu,R. and Shao,Z.
  TITLE     Biodiversity and function of Thalassospira species in the
            particle-attached aromatic-hydrocarbon-degrading consortia from the
            surface seawater of the China South Sea
  JOURNAL   Unpublished
REFERENCE   2  (bases 1 to 4530245)
  AUTHORS   Dong,C., Liu,R. and Shao,Z.
  TITLE     Direct Submission
  JOURNAL   Submitted (25-OCT-2017) State Oceanic Administration, The Third
            Institute, Room 329, Daxue Road 184, Xiamen, Fujian 361005, P.R.
            China
COMMENT     ##Genome-Assembly-Data-START##
            Assembly Method                   :: SMRT Analysis v. 2.3
            Genome Coverage                   :: 300
            Sequencing Technology             :: PacBio
            ##Genome-Assembly-Data-END##
            ##Genome-Annotation-Data-START##
            Annotation Provider               :: NCBI RefSeq
            Annotation Name                   :: GCF_002844375.1-RS_2024_05_03
            Annotation Date                   :: 05/03/2024 04:06:34
            Annotation Pipeline :: NCBI Prokaryotic Genome Annotation Pipeline
            (PGAP)
            Annotation Method :: Best-placed reference protein set; GeneMarkS-2+
            Annotation Software revision      :: 6.7
            Features Annotated                :: Gene; CDS; rRNA; tRNA; ncRNA
            Genes (total)                     :: 4,760
            CDSs (total)                      :: 4,670
            Genes (coding)                    :: 4,656
            CDSs (with protein)               :: 4,656
            Genes (RNA)                       :: 90
            rRNAs                             :: 7, 6, 6 (5S, 16S, 23S)
            complete rRNAs                    :: 7, 6, 6 (5S, 16S, 23S)
            tRNAs                             :: 67
            ncRNAs                            :: 4
            Pseudo Genes (total)              :: 14
            CDSs (without protein)            :: 14
            Pseudo Genes (ambiguous residues) :: 0 of 14
            Pseudo Genes (frameshifted)       :: 2 of 14
            Pseudo Genes (incomplete)         :: 13 of 14
            Pseudo Genes (internal stop)      :: 4 of 14
            Pseudo Genes (multiple problems)  :: 4 of 14
            ##Genome-Annotation-Data-END##
            ##antiSMASH-Data-START##
            Version                           :: 8.dev-cf2fc5ee(changed)
            Run date                          :: 2025-09-12 22:03:19
            NOTE :: This is a single region extracted from a larger record!
            Orig. start                       :: 2014066
            Orig. end                         :: 2034912
            ##antiSMASH-Data-END##
            REFSEQ INFORMATION: The reference sequence is identical to
            CP024199.1.
            Bacteria and source DNA available from Dr. Zongze Shao; The Third
            Institute of State Oceanic Administration, Daxue Road 184#, Xiamen,
            Fujian, 3610015, China, Tel: 86-592-2195321.
            The annotation was added by the NCBI Prokaryotic Genome Annotation
            Pipeline (PGAP). Information about PGAP can be found here:
            https://www.ncbi.nlm.nih.gov/genome/annotation_prok/
            COMPLETENESS: full length.
FEATURES             Location/Qualifiers
     region          1..20846
                     /candidate_cluster_numbers="1"
                     /contig_edge="False"
                     /product="terpene"
                     /region_number="3"
                     /rules="(PT_phytoene_like or phytoene_synt or Lycopene_cycl
                     or Lycopene_cycl_fung or T1TS or T1TS_KS or T2TS or TS_UbiA
                     or TS_Pyr4)"
                     /tool="antismash"
     cand_cluster    1..20846
                     /candidate_cluster_number="1"
                     /contig_edge="False"
                     /detection_rules="(PT_phytoene_like or phytoene_synt or
                     Lycopene_cycl or Lycopene_cycl_fung or T1TS or T1TS_KS or
                     T2TS or TS_UbiA or TS_Pyr4)"
                     /kind="single"
                     /product="terpene"
                     /protoclusters="1"
                     /tool="antismash"
     protocluster    1..20846
                     /aStool="rule-based-clusters"
                     /category="terpene"
                     /contig_edge="False"
                     /core_location="[10000:10846](+)"
                     /cutoff="20000"
                     /detection_rule="(PT_phytoene_like or phytoene_synt or
                     Lycopene_cycl or Lycopene_cycl_fung or T1TS or T1TS_KS or
                     T2TS or TS_UbiA or TS_Pyr4)"
                     /neighbourhood="10000"
                     /product="terpene"
                     /protocluster_number="1"
                     /tool="antismash"
     proto_core      10001..10846
                     /aStool="rule-based-clusters"
                     /tool="antismash"
                     /cutoff="20000"
                     /detection_rule="(PT_phytoene_like or phytoene_synt or
                     Lycopene_cycl or Lycopene_cycl_fung or T1TS or T1TS_KS or
                     T2TS or TS_UbiA or TS_Pyr4)"
                     /neighbourhood="10000"
                     /product="terpene"
                     /protocluster_number="1"
     gene            1026..1793
                     /gene="surE"
                     /locus_tag="CSC3H3_RS09200"
                     /old_locus_tag="CSC3H3_09195"
     CDS             1026..1793
                     /EC_number="3.1.3.6"
                     /GO_function="GO:0008252 - nucleotidase activity [Evidence
                     IEA]"
                     /codon_start=1
                     /gene="surE"
                     /inference="COORDINATES: similar to AA
                     sequence:RefSeq:WP_007089629.1"
                     /locus_tag="CSC3H3_RS09200"
                     /note="Derived by automated computational analysis using
                     gene prediction method: Protein Homology."
                     /old_locus_tag="CSC3H3_09195"
                     /product="5'/3'-nucleotidase SurE"
                     /protein_id="WP_101284648.1"
                     /transl_table=11
                     /translation="MIDLHNARILICNDDGIDAPGIKLLESLARQFSDDVWVVAPSIEQ
                     SGAGHSLTLRRPLRIHQRDERHFAVDGTPTDCILLALQVIMRETPPDIVFSGINRGGNL
                     GEDVTYSGTVAAAMESALLNVPAIAFSQYFSQKMISWDIARAHLTNVVTSLVKEEWPRN
                     VLINVNFPEKEREGGAKIRITRQGQRKIGDHVAERQDPHGEPYYWIGAIRSELPEDENA
                     DLHVIEKGDISVTPIGLDFTDNVTLEKLTKAFQ"
     gene            1805..2443
                     /locus_tag="CSC3H3_RS09205"
                     /old_locus_tag="CSC3H3_09200"
     CDS             1805..2443
                     /EC_number="2.1.1.77"
                     /GO_function="GO:0004719 - protein-L-isoaspartate
                     (D-aspartate) O-methyltransferase activity [Evidence IEA]"
                     /GO_process="GO:0036211 - protein modification process
                     [Evidence IEA]"
                     /codon_start=1
                     /gene_functions="biosynthetic-additional
                     (rule-based-clusters) PCMT"
                     /gene_kind="biosynthetic-additional"
                     /inference="COORDINATES: similar to AA
                     sequence:RefSeq:WP_011384900.1"
                     /locus_tag="CSC3H3_RS09205"
                     /note="Derived by automated computational analysis using
                     gene prediction method: Protein Homology."
                     /old_locus_tag="CSC3H3_09200"
                     /product="protein-L-isoaspartate(D-aspartate)
                     O-methyltransferase"
                     /protein_id="WP_172963462.1"
                     /sec_met_domain="PCMT (E-value: 7.5e-61, bitscore: 203.4,
                     seeds: 9, tool: rule-based-clusters)"
                     /transl_table=11
                     /translation="MARAPEIDALLETLRQDGIHDEAVLKALADIPRELFVAEPFASRA
                     WENVALPISKGQTISQPYIVALMTQALQLNDRMKVLEVGTGSGYQAAILARLARRVYTL
                     ERHKPLLREAEDRFRKLGLHTISTLHADGGLGWKAQAPFDRIIVTASAQDVPPVLVDQL
                     AIGGIMVVPVGEVSHTQILLRVMRTPTGIDVTELMPVRFVPMLAGTEEF"
     gene            2514..3665
                     /locus_tag="CSC3H3_RS09210"
                     /old_locus_tag="CSC3H3_09205"
     CDS             2514..3665
                     /codon_start=1
                     /inference="COORDINATES: similar to AA
                     sequence:RefSeq:WP_008890994.1"
                     /locus_tag="CSC3H3_RS09210"
                     /note="Derived by automated computational analysis using
                     gene prediction method: Protein Homology."
                     /old_locus_tag="CSC3H3_09205"
                     /product="peptidoglycan DD-metalloendopeptidase family
                     protein"
                     /protein_id="WP_245881352.1"
                     /transl_table=11
                     /translation="MMKNYVKHVVHNVRTLLFVCGTTAALSGCLLNSKPVPVVYDEGTN
                     YSLSAMPELRNGERVITVKRGDSVSLLAERYHADFRKFAARNSLSSPYVIYPGQRLVLP
                     PWQAEYDSRPPETAVAQSSGNSRATKNVVIAGRAQPVEITSAPLDAPSSAASQPASAAP
                     ASDLSPVTDITSSKSTVIAGANLPVPGIKPQDIAAQAERQFASGNSWRNTTGTKVASSS
                     PTSTGGSAAVVPSSAASHPSRVKVDDIVPAGRQGFIWPVQGKVVLGYGAGPGGLFNDGI
                     NIAAERGTAILATDNGVVTYVGNELRGFGNLILIKHADGYVSAYAHTENPLVARGDVVS
                     RGQRISSVGATGAVNRPQLHFEIRQGRKSRDPVKYLPQAVASR"
     gene            complement(3749..4612)
                     /locus_tag="CSC3H3_RS09215"
                     /old_locus_tag="CSC3H3_09210"
     CDS             complement(3749..4612)
                     /codon_start=1
                     /inference="COORDINATES: similar to AA
                     sequence:RefSeq:WP_009539876.1"
                     /locus_tag="CSC3H3_RS09215"
                     /note="Derived by automated computational analysis using
                     gene prediction method: Protein Homology."
                     /old_locus_tag="CSC3H3_09210"
                     /product="ATP-binding protein"
                     /protein_id="WP_101286174.1"
                     /transl_table=11
                     /translation="MTDTLADLLPTLSRIADALERMAPPARQQNDLSVADAFVWHAETG
                     YLQPVKQVNRVDIGLLQGIEQQKRILYNNTKRFADGLPANNALLWGARGTGKSSIVKAM
                     HALINMETPSSLALVEIHREDIPTLPVLLKILQDSGRKTVLFCDDLSFDLQDESYKSLK
                     AVLDGGIEGRPANVIFYATSNRRHLMARDMIENERSTAINPSEAVEEKVSLSDRFGLWL
                     GFHNISQDVYFEIIAGYAKEYDLDIAEDELFARAKEWTVTRGSRSGRVAWQFIQEIAGE
                     QGKHIA"
     misc_feature    complement(4728..4743)
                     /note="TFBS match to CRP, Cyclic AMP receptor protein,
                     confidence: strong, score: 19.26"
                     /tool="antismash"
     misc_feature    4896..4914
                     /note="TFBS match to DmdR1, Iron(II)-dependent repressor,
                     confidence: weak, score: 24.44"
                     /tool="antismash"
     gene            4914..5303
                     /gene="yajC"
                     /locus_tag="CSC3H3_RS09220"
                     /old_locus_tag="CSC3H3_09215"
     CDS             4914..5303
                     /GO_component="GO:0005886 - plasma membrane [Evidence IEA]"
                     /GO_function="GO:0015450 - protein-transporting ATPase
                     activity [Evidence IEA]"
                     /GO_process="GO:0009306 - protein secretion [Evidence IEA]"
                     /codon_start=1
                     /gene="yajC"
                     /inference="COORDINATES: similar to AA
                     sequence:RefSeq:WP_007089625.1"
                     /locus_tag="CSC3H3_RS09220"
                     /note="Derived by automated computational analysis using
                     gene prediction method: Protein Homology."
                     /old_locus_tag="CSC3H3_09215"
                     /product="preprotein translocase subunit YajC"
                     /protein_id="WP_101265798.1"
                     /transl_table=11
                     /translation="MLISPAFAQGAGGAAGGADMLTSFLPLILIFVVFYFLLIRPQQKK
                     QKEHKAMLAAVRRGDKIVTAGGIIGTVAKVVNDDEVSVEIADGVKVKVARGMISNVLSR
                     TEPAKGGSDAEDKADEKKTEEVKKD"
     gene            5451..7022
                     /gene="secD"
                     /locus_tag="CSC3H3_RS09225"
                     /old_locus_tag="CSC3H3_09220"
     CDS             5451..7022
                     /GO_component="GO:0031522 - cell envelope Sec protein
                     transport complex [Evidence IEA]"
                     /GO_function="GO:0015450 - protein-transporting ATPase
                     activity [Evidence IEA]"
                     /GO_process="GO:0043952 - protein transport by the Sec
                     complex [Evidence IEA]"
                     /codon_start=1
                     /gene="secD"
                     /inference="COORDINATES: similar to AA
                     sequence:RefSeq:WP_007089624.1"
                     /locus_tag="CSC3H3_RS09225"
                     /note="Derived by automated computational analysis using
                     gene prediction method: Protein Homology."
                     /old_locus_tag="CSC3H3_09220"
                     /product="protein translocase subunit SecD"
                     /protein_id="WP_101265796.1"
                     /transl_table=11
                     /translation="MLYFTKWKATLVLLVCALGVIYAAPNFLPKGTFPTESSWMPGKQI
                     SLGLDLRGGSHMLLEVGVDSVIRERYDSLTENMRDALREQRIRYRGLRGSADGASVTIV
                     SDDDVEKARSLMAKLDPLTVVGGEGNHIMLTYNEQTRREMVDRAISQSLEVVRRRVDEL
                     GTTEPSIQRQGEDRIVVQVPGLDDPSRLRDILGRTAKMNFHLINETKTPQQAKATGIPP
                     GAVIMPGADNASPGEPEEYLVDRRVVVSGDNLIDSQPTFNDGRPVVSFRFDAAGGARFG
                     DVTSKNTGRRLAIVLDGKVISAPRINEPIMGGSGIITGQFGVQEANDLSLLLRAGALPA
                     PMKILEERSVGPGLGQDSIDSGEIAAVLGMVFVVAFMAVSYGLFGIFANLALIINLVLI
                     LAIMSVLQATLTLPGIAGIVLTVGMAVDANVLIFERIREERKIGRSIISSIDAGYRSAM
                     STILDANITTLIAALVLFSFGSGPIKGFAVTLAIGIITSMFAAIWVTRLIVAFWVKHRR
                     PTELVL"
     gene            7042..7980
                     /gene="secF"
                     /locus_tag="CSC3H3_RS09230"
                     /old_locus_tag="CSC3H3_09225"
     CDS             7042..7980
                     /GO_component="GO:0031522 - cell envelope Sec protein
                     transport complex [Evidence IEA]"
                     /GO_function="GO:0015450 - protein-transporting ATPase
                     activity [Evidence IEA]"
                     /GO_process="GO:0043952 - protein transport by the Sec
                     complex [Evidence IEA]"
                     /codon_start=1
                     /gene="secF"
                     /inference="COORDINATES: similar to AA
                     sequence:RefSeq:WP_008890998.1"
                     /locus_tag="CSC3H3_RS09230"
                     /note="Derived by automated computational analysis using
                     gene prediction method: Protein Homology."
                     /old_locus_tag="CSC3H3_09225"
                     /product="protein translocase subunit SecF"
                     /protein_id="WP_101265793.1"
                     /transl_table=11
                     /translation="MKPLHLVPDDVNIPFLKIRKLFYIFSLSLVVLSGVLFFTKGLNFG
                     IDFRGGILLEIKTDGPADIPGLRDNLGSLGLGDVAIQQFGEPDDVLIQLQRQDGDEQAQ
                     MAALETVTKALGDGVSIRRSELVGPKVGDELKEAGLYSVVISLALIMIYIWFRFEWQFA
                     VASVVALLHDVIITVGLFVISGIQFDLATLAAILTVAGYSINDTVVVFDRIREFMRKYR
                     KMDLIELLDLSINTTLSRTVMTSLTTLLALIALFLFGGEAIRGFTFALIFGIVIGTYSS
                     ICVASPLLVVLKLRRKIPEGEEANPEEANAL"
     gene            8121..8501
                     /locus_tag="CSC3H3_RS09235"
                     /old_locus_tag="CSC3H3_09230"
     CDS             8121..8501
                     /codon_start=1
                     /inference="COORDINATES: similar to AA
                     sequence:RefSeq:WP_007089621.1"
                     /locus_tag="CSC3H3_RS09235"
                     /note="Derived by automated computational analysis using
                     gene prediction method: Protein Homology."
                     /old_locus_tag="CSC3H3_09230"
                     /product="Mth938-like domain-containing protein"
                     /protein_id="WP_245881353.1"
                     /transl_table=11
                     /translation="MSMDITPMVASGRKIVSSYGDGLFRFGAEKAEGSVLLFPNEFMRW
                     PHIDMTTVTEDSFADVIERAGQIDILLLGMGTRMQPVKTAWRNALRPHGIVIEPMDTGA
                     ACRTFNVLLSEERRVAAALIAV"
     gene            8747..9955
                     /locus_tag="CSC3H3_RS09240"
                     /old_locus_tag="CSC3H3_09235"
     CDS             8747..9955
                     /GO_function="GO:0008381 - mechanosensitive monoatomic ion
                     channel activity [Evidence IEA]"
                     /codon_start=1
                     /inference="COORDINATES: similar to AA
                     sequence:RefSeq:WP_008891002.1"
                     /locus_tag="CSC3H3_RS09240"
                     /note="Derived by automated computational analysis using
                     gene prediction method: Protein Homology."
                     /old_locus_tag="CSC3H3_09235"
                     /product="mechanosensitive ion channel family protein"
                     /protein_id="WP_101284649.1"
                     /transl_table=11
                     /translation="MEEVTSGFEDLNDKVVAKFTDWADMVGGGDIGRAIVAIVILLFFI
                     TIRKFLAHGVVGGIRRLADQHKHQTMAMILDALQTPIEVFVVLLGIYFSVEVMEFSDKT
                     DAAIFNVLRVFVIATIFWAFYRVVEPLALGFNTFAGRFGAGLADDLRQFFIRCVRTLVV
                     VLAGVALLEDWGIDVSAFLGGLGLAGMAVALAAKDSVANVFGGLTIFADKLFKRGDWIE
                     TPHFEGVVEFVGLRATKVRTFSKAMVVMPNAEIVDSPVINWSRMTHRRIKMVIGVEYRT
                     SADQIEKIIERLRNFLQQDEDVAQEDVPQMVHLADFGSSSINIDLYYFTNTTDWQEWRD
                     IRNRHIIAFKRIIEEEGASFAFPSQSLYVESMPDMKMSRASANQDDQNKHQNQRQQTPE
                     KMG"
     gene            10001..10846
                     /locus_tag="CSC3H3_RS09245"
                     /old_locus_tag="CSC3H3_09240"
     CDS             10001..10846
                     /EC_number="2.5.1.-"
                     /GO_function="GO:0004659 - prenyltransferase activity
                     [Evidence IEA]"
                     /GO_function="GO:0046872 - metal ion binding [Evidence
                     IEA]"
                     /GO_process="GO:0008299 - isoprenoid biosynthetic process
                     [Evidence IEA]"
                     /codon_start=1
                     /gene_functions="biosynthetic (rule-based-clusters)
                     terpene: PT_phytoene_like"
                     /gene_functions="biosynthetic (rule-based-clusters)
                     terpene: phytoene_synt"
                     /gene_kind="biosynthetic"
                     /inference="COORDINATES: similar to AA
                     sequence:RefSeq:WP_007089618.1"
                     /locus_tag="CSC3H3_RS09245"
                     /note="Derived by automated computational analysis using
                     gene prediction method: Protein Homology."
                     /old_locus_tag="CSC3H3_09240"
                     /product="phytoene/squalene synthase family protein"
                     /protein_id="WP_101284650.1"
                     /sec_met_domain="phytoene_synt (E-value: 1.8e-17, bitscore:
                     61.3, seeds: 8, tool: rule-based-clusters)"
                     /sec_met_domain="PT_phytoene_like (E-value: 3.3e-16,
                     bitscore: 57.2, seeds: 61, tool: rule-based-clusters)"
                     /transl_table=11
                     /translation="MSDTPSLSYCADYVRRNDKDRFLCALFAPAEKREDLFTLYAFNQE
                     ISKTREMVSEAMLGQIRLQWWRDALADIAKGEVRKHEVVEPLAGLIGQGRVSPATLEAM
                     IDAREFDLFDNAPKDMAALENYIDATSGQLSEVAAGVLGAKSDDAARAARLAGNAYGLV
                     GILRAMVFHGRAKRQYIPENVMEKHGVATGDIFEFRKTDAVKAMTHDLASRAKEKIREA
                     RKMRQALPKLASPAALPIVLADNYLKKLKKAEYDPFSAEFGLARPASFRLTIRAVSGYW
                     "
     gene            10955..11515
                     /locus_tag="CSC3H3_RS09250"
                     /old_locus_tag="CSC3H3_09245"
     CDS             10955..11515
                     /codon_start=1
                     /inference="COORDINATES: similar to AA
                     sequence:RefSeq:WP_008891004.1"
                     /locus_tag="CSC3H3_RS09250"
                     /note="Derived by automated computational analysis using
                     gene prediction method: Protein Homology."
                     /old_locus_tag="CSC3H3_09245"
                     /product="hypothetical protein"
                     /protein_id="WP_215907570.1"
                     /transl_table=11
                     /translation="MKSIRNIALLVIIPVGVLITLWTFAKPAAATSIDPFLGVYHGTAI
                     SEETHELKARDLDVTIKKTGNAFSIDWSTVIYKSDGREKASAISIDFFSTDRADIFGSA
                     MRTGLFGKRVPNDPLKGQPFVWARIVGPTLTVHAMYILDDGGYEMQVYERTLDPHGNLD
                     LVFKRFRNGKSIREITGELTRVK"
     gene            complement(11628..12971)
                     /gene="trmFO"
                     /locus_tag="CSC3H3_RS09255"
                     /old_locus_tag="CSC3H3_09250"
     CDS             complement(11628..12971)
                     /GO_function="GO:0050660 - flavin adenine dinucleotide
                     binding [Evidence IEA]"
                     /GO_process="GO:0008033 - tRNA processing [Evidence IEA]"
                     /codon_start=1
                     /gene="trmFO"
                     /inference="COORDINATES: similar to AA
                     sequence:RefSeq:WP_007089616.1"
                     /locus_tag="CSC3H3_RS09255"
                     /note="Derived by automated computational analysis using
                     gene prediction method: Protein Homology."
                     /old_locus_tag="CSC3H3_09250"
                     /product="methylenetetrahydrofolate--tRNA-(uracil(54)-
                     C(5))-methyltransferase (FADH(2)-oxidizing) TrmFO"
                     /protein_id="WP_101284651.1"
                     /transl_table=11
                     /translation="METVTPVHIIGAGLAGSEAAWQLAEAGIPVIIHEMRPERPTDAHQ
                     TDKCAELVCSNSFRSDDKEYNAVGLIHDEMRRANSLILRCADEHKVPAGAALAVDREGF
                     SQAVTDALMNHPLISMERGEISGLPPAEWGNTIIATGPLTSGSLAAAILEQTGEQSLAF
                     FDAIAPILYKESIDFDKAWFQSRYDKGDGKDYINCPMNKEQYNNFIDALLAGDKTEFKD
                     WEKNTPYFDGCLPIEVMAERGRETLRHGPMKPVGLTNPHDPLVKSYAIVQLRQDNALGT
                     LYNMVGFQTKLKYGAQVEVFKTIPGLENADFARLGGIHRNTFLNSPRLLDPTLRLKSRR
                     DIRFAGQITGCEGYVESAAVGLMTGRFTAAEKQGRELPLPPAVTAMGALLGHITGGANE
                     DSFQPMNVNFGLFPELDLKKRVKGRDRKKLYTDRAIPAFDEWLAEIGN"
     gene            13231..14055
                     /locus_tag="CSC3H3_RS09260"
                     /old_locus_tag="CSC3H3_09255"
     CDS             13231..14055
                     /codon_start=1
                     /inference="COORDINATES: ab initio prediction:GeneMarkS-2+"
                     /locus_tag="CSC3H3_RS09260"
                     /note="Derived by automated computational analysis using
                     gene prediction method: GeneMarkS-2+."
                     /old_locus_tag="CSC3H3_09255"
                     /product="hypothetical protein"
                     /protein_id="WP_101284652.1"
                     /transl_table=11
                     /translation="MTDPANTSSPNTPGTPNATGISPATSSAGPSNSISTPVAAVSGRA
                     ADHPQTSGISGDSSATRPQNGTDASANPATPAGVTPVSPLNSFVAVLTGGGDVAKLSAA
                     SLALVFSPLVLLAVLATFFTIDVLVSSWGGRLFGFSYDLVFFVGVFGALGATVSVMLYI
                     SDTQNWQNLTPLDIFFRFLFRPFLGFVFAILALLIVRAGIVPDSLSQLQAIRDLNDLGT
                     LKLSDESGQQVAIMLLIAFLAGFSERLVKAILQTVEGRIRALAIGSDSAKTT"
     gene            complement(14095..16923)
                     /gene="uvrA"
                     /locus_tag="CSC3H3_RS09265"
                     /old_locus_tag="CSC3H3_09260"
     CDS             complement(14095..16923)
                     /EC_number="3.1.25.-"
                     /GO_function="GO:0003677 - DNA binding [Evidence IEA]"
                     /GO_function="GO:0005524 - ATP binding [Evidence IEA]"
                     /GO_function="GO:0016887 - ATP hydrolysis activity
                     [Evidence IEA]"
                     /GO_process="GO:0006289 - nucleotide-excision repair
                     [Evidence IEA]"
                     /codon_start=1
                     /gene="uvrA"
                     /gene_functions="transport (smcogs) SMCOG1000: ABC
                     transporter ATP-binding protein"
                     /gene_kind="transport"
                     /inference="COORDINATES: similar to AA
                     sequence:RefSeq:WP_007089614.1"
                     /locus_tag="CSC3H3_RS09265"
                     /note="Derived by automated computational analysis using
                     gene prediction method: Protein Homology."
                     /old_locus_tag="CSC3H3_09260"
                     /product="excinuclease ABC subunit UvrA"
                     /protein_id="WP_101284653.1"
                     /transl_table=11
                     /translation="MLTKISVRGAKEHNLQNIDVELPRDSLVVITGLSGSGKSSLAFDT
                     IYAEGQRRYVESLSAYARQFLEMMQKPDVEYIDGLSPAISIEQKTTSRNPRSTVGTVTE
                     IYDYMRLLWARVGIPYSPATGLPIVSQTVSQMVDRVMEMEEGTRLYLLAPIVRGRKGEY
                     KKELKDLRARGFQRVKIDGEMYDIDEAPDLNKKLKHDISVVVDRLVVKDGIQTRLADSF
                     ETALELTDGIVFAEDARSGDSTLFSAKFACPVSGFTIEEIEPRLFSFNNPFGACPVCDG
                     LGTQMEFDPELVVPDTSKTLDDGAIAPWASSTSKYYLQTLRAIAKHFGFKTNVAWDKLP
                     EKAKHIILHGSGEEEVTVTYDDGSRSYKVTRPFEGVIPNMSRRWRETDSQWARDELAKY
                     QTVSDCDACHGHRLKPEALAVKVDKCTISEITALSIGKAVEWFASLEPKLDEKQTEIAR
                     RILREINDRLKFLSDVGLDYLSMSRTSGTLSGGESQRIRLASQIGSGLTGVLYVLDEPS
                     IGLHQRDNDRLLETLKRLRDLGNTVLVVEHDEDAIRAADYVVDMGPGAGIHGGRIVAQG
                     TPDEIQQNPESLTGKYLTGIEQIAVPAKRRDGNGKAITISGARANNLQNITVDFPLGRF
                     VCVTGVSGGGKSSLVIETLYKSLAKRMHNARTQPGAHDSISGIEHIDKIIDIDQSPIGR
                     TPRSNPVTYTGAFTPIREWFAQLPEAKTRGYKPGRFSFNVKGGRCEACQGDGVIKIEMH
                     FLPDVYVTCDQCKGKRYNRETLEISFKGKSISDVLDMTVEEGADFFKAVPSIRDKMETL
                     KQVGLSYVHLGQQATTLSGGEAQRVKLAKELSKRATGRTIYILDEPTTGLHFHDIRKLL
                     EVLHTLVDQGNTVVVIEHNLDVIKTADWIIDIGPEGGDGGGELVSAGTPEDIMKNDRSY
                     TGKYLKRHLRLS"
     gene            complement(17121..18200)
                     /gene="fbaA"
                     /locus_tag="CSC3H3_RS09270"
                     /old_locus_tag="CSC3H3_09265"
     CDS             complement(17121..18200)
                     /EC_number="4.1.2.13"
                     /GO_function="GO:0004332 - fructose-bisphosphate aldolase
                     activity [Evidence IEA]"
                     /GO_function="GO:0008270 - zinc ion binding [Evidence IEA]"
                     /GO_process="GO:0005975 - carbohydrate metabolic process
                     [Evidence IEA]"
                     /GO_process="GO:0006096 - glycolytic process [Evidence
                     IEA]"
                     /codon_start=1
                     /gene="fbaA"
                     /inference="COORDINATES: similar to AA
                     sequence:RefSeq:WP_007089613.1"
                     /locus_tag="CSC3H3_RS09270"
                     /note="Derived by automated computational analysis using
                     gene prediction method: Protein Homology."
                     /old_locus_tag="CSC3H3_09265"
                     /product="class II fructose-bisphosphate aldolase"
                     /protein_id="WP_101265781.1"
                     /transl_table=11
                     /translation="MPSVKPGVVTGAAYRELVAACKQDGYALPAVNITTSSTLNAALEA
                     AANAGSDIIIQLSNGGAQFFAGKGIKDANAARVIGAVSAARHTQLLAEYYGVAVVLHTD
                     HANRKFVPWVDGMLSWGEKAYKETGKPLFSSHMLDLSEEPLEENLATCEEFLKRLSAVD
                     MSLEIELGVTGGEEDGIGKELDTTQDDIDPRLYTRPEEVEEAYRRLKPLGHFSVAAAFG
                     NVHGVYKPGNVKLRPEILLESQKYVADKHGLDGFPLDFVFHGGSGSEKEKIEEAVGYGV
                     FKMNIDTDTQFAYATAVGKYVEENDRAFKYQVDPEDDKPYKKQYDPRVWVREAELSMAA
                     RLEEAFKDLGATGKSIAKG"
     gene            complement(18221..19294)
                     /locus_tag="CSC3H3_RS09275"
                     /old_locus_tag="CSC3H3_09270"
     CDS             complement(18221..19294)
                     /GO_function="GO:0016491 - oxidoreductase activity
                     [Evidence IEA]"
                     /GO_function="GO:0016651 - oxidoreductase activity, acting
                     on NAD(P)H [Evidence IEA]"
                     /codon_start=1
                     /gene_functions="biosynthetic-additional
                     (rule-based-clusters) GFO_IDH_MocA"
                     /gene_functions="biosynthetic-additional
                     (rule-based-clusters) GFO_IDH_MocA_C3"
                     /gene_functions="biosynthetic-additional (smcogs)
                     SMCOG1079: oxidoreductase"
                     /gene_kind="biosynthetic-additional"
                     /inference="COORDINATES: similar to AA
                     sequence:RefSeq:WP_020593771.1"
                     /locus_tag="CSC3H3_RS09275"
                     /note="Derived by automated computational analysis using
                     gene prediction method: Protein Homology."
                     /old_locus_tag="CSC3H3_09270"
                     /product="Gfo/Idh/MocA family oxidoreductase"
                     /protein_id="WP_172963409.1"
                     /sec_met_domain="GFO_IDH_MocA (E-value: 2.1e-25, bitscore:
                     87.6, seeds: 39, tool: rule-based-clusters)"
                     /sec_met_domain="GFO_IDH_MocA_C3 (E-value: 1e-10, bitscore:
                     39.5, seeds: 4226, tool: rule-based-clusters)"
                     /transl_table=11
                     /translation="MTSKPVRILVVGLGNMGMSHAKAYHALEGFEIVGLCTRSPIAPGT
                     LPEGFESYPQFTDYEAALAQTGPDAVSINTYTETHANYAIAAFEAGAHVFIEKPLAATV
                     VDAERVIESAQKHGRKLVIGYILRHHPSWEKFVELAKTLGKPLVMRMNLNQQSSGEFWE
                     THKRLLNSVSPIVDCGVHYLDMMCLMTAAKPISVHAMGVRLSPDVAEDMYNYGQLQVRF
                     DDGSIGWYEAGWGPMISETAFFVKDVIGPKGSVSITDVEEKGAGSADIDTHTKTNALKL
                     HHAATDRHGKFSHSDEIIRTDDEPDHDGLCLREQQFFLKSITENLDLTQHMTDALGSLR
                     IALAADQSVRTGNIIML"
     gene            complement(19339..20472)
                     /gene="nagA"
                     /locus_tag="CSC3H3_RS09280"
                     /old_locus_tag="CSC3H3_09275"
     CDS             complement(19339..20472)
                     /EC_number="3.5.1.25"
                     /GO_function="GO:0008448 - N-acetylglucosamine-6-phosphate
                     deacetylase activity [Evidence IEA]"
                     /GO_process="GO:0006040 - amino sugar metabolic process
                     [Evidence IEA]"
                     /codon_start=1
                     /gene="nagA"
                     /gene_functions="biosynthetic-additional
                     (rule-based-clusters) Amidohydro_1"
                     /gene_kind="biosynthetic-additional"
                     /inference="COORDINATES: similar to AA
                     sequence:RefSeq:WP_007089611.1"
                     /locus_tag="CSC3H3_RS09280"
                     /note="Derived by automated computational analysis using
                     gene prediction method: Protein Homology."
                     /old_locus_tag="CSC3H3_09275"
                     /product="N-acetylglucosamine-6-phosphate deacetylase"
                     /protein_id="WP_215907571.1"
                     /sec_met_domain="Amidohydro_1 (E-value: 2e-30, bitscore:
                     104.0, seeds: 57, tool: rule-based-clusters)"
                     /transl_table=11
                     /translation="MTVLAIKNARLFDGEQFHDDACVIVENGKVSAVSPARIPDGADIF
                     DARGNVLAPGFIDVQVNGGGGVLLNHTPSVDGIRQIMAGHRQFGTTAMLPTLITDTREK
                     MLAAIEAVKNAIDEGVPGIVGIHLEGPYLNTERKGVHDANIIRPMEDDAIEVLSSLPNG
                     RILVTMAPEKAKPGTIEKLCANGVLVCAGHTAGTYDDIQKAIGEGLRGFTHLFNAMSPL
                     AHREPGVAGAAIADKNTWCGLIADGYHVHPAAMQVAVRAKQTGKIMLVTDAMPTVGAKT
                     KRFVLGGEEIIAIDGRCALADGTLAGSDLDMISAVKNSVEMIGVDLGEALRMASLYPAA
                     FLKLDDRKGKIMPGFDADMVLFDPQTFQVKHTWISGI"
     misc_feature    complement(20686..20701)
                     /note="TFBS match to ColR, Phenol-responsive regulator,
                     confidence: weak, score: 21.14"
                     /tool="antismash"
ORIGIN
        1 ggcaggcagc cctgaatgat cagcttgacc tgctgctgac atcctttccg aacatccttg
       61 atgcatcggt gcccaagggc gcggacgaag atgacaatgt cgaaattcgt cgctggggca
      121 ccccgggcga atttgacttt accccgggcg aacattttga cctgggcgaa aagctcggca
      181 tgatggattt tgccacggcg gcaaagctgt cgggttcgcg ctttgttatt ctgcgcggtc
      241 agcttgcgcg tctggaacgt gccctggcac agttcatgat tgacatgcat gtcaacgaac
      301 atggctatga ggaaaccacc acgccgatgc tggtgaagga cgatgctatg tttggcactg
      361 gccagttgcc gaaatttgcc gaagattcgt ttaaaacaac caatgacatg tggttgatcc
      421 cgacatcgga agtcacactg accaaccagg ttgccggcga aatcgtcgat actgccagcc
      481 tgccgttgcg ctataccgca ctgacccagt gtttccgttc cgaggcggga tcggccgggc
      541 gtgatacgcg cggtatgatc cgccagcacc agttctcgaa ggtcgaaatg gtttccatcg
      601 tcgatccgga acagtcggat gccgaactgg aacgcaaaac acagtgcgct gaagaagtgc
      661 tgcagcgttt gggcctgcct taccgcacca ttttgctgtg ttcgggcgat accggttttt
      721 cggccatgaa aacctatgac atcgaagtct ggttgccggg gcagggccgt taccgcgaaa
      781 tttcgtcctg ctcgaacacg ggtgaatttc aggcacgccg catgaatgcc cgttgccgtg
      841 cccagggcga caagaaaacc cgtttcgtcc ataccctgaa tggatcgggc cttgcggtcg
      901 ggcgctgcct gattgcggtc atggaaaatt atcagcaggc agatggttcc attaaggttc
      961 cggaagcatt ggtgccttat atgggcggta tgactgaaat ttccgtgacg aagtaaggtt
     1021 tatccatgat cgatctgcat aatgcccgca ttctgatctg caatgatgac ggtatagatg
     1081 cccctggcat caaattgctt gaaagtcttg cgcggcagtt ttcagatgat gtgtgggtgg
     1141 ttgcaccgtc gatagaacaa agtggcgcag ggcattcgct taccctgcgc cgcccgttac
     1201 gtatccatca gcgtgacgaa cgccattttg ctgttgatgg cacgccgacg gactgcatcc
     1261 tgctggcctt gcaggttatc atgcgtgaaa caccacctga tattgtgttt tcaggcatca
     1321 atcgcggcgg caatctgggc gaggatgtaa cctattccgg cacggtggcg gccgccatgg
     1381 aatctgctct gctgaatgtg ccggccattg cattttcgca gtatttttcc cagaaaatga
     1441 tcagttggga tattgccagg gcgcatttga ccaatgtggt tacttcactt gtgaaggaag
     1501 aatggccgcg taacgttttg attaacgtta attttcctga aaaggaacgc gaaggtgggg
     1561 cgaaaattcg cattacacgg caggggcagc gtaaaattgg tgatcacgtt gccgaacgtc
     1621 aggacccgca cggcgaaccc tattactgga tcggtgccat tcggtcggaa ttgcccgaag
     1681 atgaaaacgc cgatttacat gtgatcgaaa agggcgatat ttctgttacg ccaatcggcc
     1741 ttgattttac cgataatgtg accctcgaaa aactgacaaa ggcatttcag tgagtacgcc
     1801 tgccattgcc cgtgcaccgg aaatcgatgc cctgctggaa acactgcgtc aggacggcat
     1861 tcacgacgaa gccgttctga aagcgctggc tgatattccg cgtgaattgt ttgttgccga
     1921 gccatttgcc agccgcgcct gggaaaatgt cgcattgcca atcagcaagg ggcaaaccat
     1981 ttcccagccc tatatcgtcg cgctgatgac acaggccctg cagcttaatg atcgcatgaa
     2041 ggtgcttgag gtcggtacgg gatctggcta tcaggctgca atccttgccc gtctggcgcg
     2101 ccgtgtttac acacttgagc ggcataaacc cctgctgcgc gaggcagaag accgttttcg
     2161 caaactgggc ctgcacacca tttcaaccct tcatgccgat ggcgggctgg ggtggaaggc
     2221 acaggcaccg tttgaccgta tcattgtgac ggcatcggca caggatgtgc caccggtact
     2281 ggttgatcag cttgccattg gcgggatcat ggtggtgccg gtaggcgaag tttcccatac
     2341 ccagattttg ctgcgtgtga tgcgcacccc gaccggcata gacgttaccg aactgatgcc
     2401 ggtgcggttt gtgcccatgc tggctggcac cgaagaattc tgaaatcttg ccatctgtga
     2461 aaattttcgc ctcgttaccg attattaacc ggtgatgcgg gaaaatcttc gacatgatga
     2521 aaaattatgt caaacacgtc gttcataatg tccggacgtt gctgtttgtc tgcgggacaa
     2581 cagcggcgct aagtgggtgt ttgcttaatt caaagccggt tccggtcgtt tatgacgaag
     2641 gcacaaatta ttcgctttcg gccatgccag agctgcgcaa tggcgaacgc gtgatcaccg
     2701 ttaaacgtgg tgacagtgtt tcgctgctgg ccgaacgcta tcacgccgat ttccgcaaat
     2761 tcgccgccag aaacagcctt agctcgccgt atgtcatcta tcccggacaa cggctggttt
     2821 taccgccctg gcaggccgaa tatgacagcc gcccgccaga aacggctgtg gcgcaatcat
     2881 caggcaattc gcgagcaacc aaaaacgttg tcattgctgg ccgcgcgcaa ccggttgaaa
     2941 taacctcggc accgcttgat gcgccttcat cggcagcatc acagcctgca tcggctgcac
     3001 cggcatccga tcttagcccg gttactgaca tcaccagttc caaaagcacg gttattgccg
     3061 gcgccaattt acctgttccg gggataaagc cacaggatat tgccgcacag gccgaacgcc
     3121 agtttgcgtc gggaaatagc tggcgcaata ccaccggcac caaggttgcc agcagcagcc
     3181 cgaccagcac gggtggtagc gctgctgttg ttccttccag tgcggcatcg cacccatcgc
     3241 gggtcaaggt tgatgatatc gttcctgcgg gccggcaggg ctttatctgg ccggtacagg
     3301 gcaaggttgt tctgggatat ggtgccgggc ccggtggcct gtttaatgat ggcatcaata
     3361 ttgcggctga acgcggcact gccattctgg cgaccgacaa tggggttgtt acctatgtcg
     3421 gcaatgaact gcgtggtttt ggcaacctga tcctgatcaa acatgccgat ggttatgtgt
     3481 cggcctatgc gcacactgaa aacccgctgg tggcgcgcgg cgatgtggtt agccgagggc
     3541 agcgcatttc ttcggtcggc gcgacgggtg cggtcaaccg accacagctt cattttgaaa
     3601 tccgtcaggg tcgcaaatca cgcgatccgg taaaatactt gcctcaggca gttgcatccc
     3661 gctgacccat taaaaaacgg cctgattgcg caatcaggcc gtttctgttt gggctgtcta
     3721 tgagcattcg atgggcttgt atgcctgttc aagcgatatg tttgccctgt tcccctgcaa
     3781 tttcctgaat gaactgccag gcgactcgtc cggaccggct gccacgcgtt acggtccatt
     3841 ccttggcgcg cgcaaatagt tcgtcctcgg caatatccag gtcatattcc ttggcatatc
     3901 cggcgatgat ctcgaaataa acatcctgcg aaatattgtg gaagcccagc cacagcccga
     3961 accggtcgga aagcgatact ttttcttcaa ctgcttctga cgggttgatt gcagttgagc
     4021 gttcgttttc gatcatgtcg cgcgccatca ggtggcgacg gttggatgtg gcatagaaaa
     4081 tcacgttggc cgggcggcct tcaatcccgc catccagcac tgctttcaat gatttatagg
     4141 attcgtcctg caggtcaaaa gacaggtcat cacagaacag aaccgtcttg cgcccgctat
     4201 cttgcaggat tttcagcaaa acgggcaggg tgggaatatc ctcgcggtga atttccacca
     4261 gcgccaggga agacggggtt tccatattga ttagcgcatg catcgccttg acgatcgagc
     4321 ttttgcccgt accgcgcgcg ccccacagca gcgcattatt ggctggcagg ccatcggcaa
     4381 agcgtttggt gttgttataa agaatacgct tttgctgttc gatcccttgc aaaaggccaa
     4441 tatccacccg gttaacctgt ttgacgggct gcaaataccc ggtttcggca tgccagacaa
     4501 aggcatcagc cacagaaagg tcgttctgct ggcgggcagg cggtgccatg cgttcaaggg
     4561 catcggcaat gcgggacaat gttggcagca aatcggcaag ggtatcggtc atgggggcct
     4621 cttggatgtt ttcatttttg gtggcagcag aattcactgc attttatggg cgaaccctag
     4681 cacagcatgc ttcgcacgga atagtccgta taaaccggca ttgggactgt gaatctggtc
     4741 acaaagtcgc gcaacacaag gtaattcaac cgccaatgcg catggatgtg tgatttataa
     4801 ggcacgcaat ggcctgtttg aggcaaattt aagcgatctt tcttgcaacg gtgaacgaca
     4861 ccgttatatt ccggcagttg attggttttt ctgttttagg ttaggagttc ctaatgttga
     4921 tttcaccggc ttttgctcag ggtgctggtg gcgctgctgg cggagcagat atgctgacta
     4981 gctttctgcc gctgatcctg atttttgtgg ttttctactt tttgctcatt cgtccgcagc
     5041 agaagaaaca gaaagaacac aaagcaatgc tggctgcggt tcgccgcggc gacaaaatcg
     5101 ttacggctgg tgggatcatc ggaaccgttg ccaaggtcgt caatgacgat gaagtttccg
     5161 tcgagattgc tgatggcgtg aaagtcaaag tcgcacgggg catgatctcg aatgtcctgt
     5221 cgcgcaccga accggccaag ggtggttcgg atgccgaaga caaggcagac gagaagaaaa
     5281 ctgaagaagt caaaaaagac tgatttctgt caaagctaca ggcggaacct cccggttccg
     5341 cctttctgca tattactggc ggtcggccgg gcaagtcctt ccttattatc ctgattgagc
     5401 cctgagcccg atgtcgcaat gttccccaga catctctggt taaaggacac atgctttatt
     5461 ttaccaaatg gaaagcaaca ctggttttgc tggtgtgtgc tttgggcgtg atatatgccg
     5521 cgccaaactt ccttcccaag ggtacgttcc cgacagaatc aagctggatg cccggtaagc
     5581 agatcagcct tggtcttgac ctgcgtggtg gctcgcacat gcttttggaa gtgggggtcg
     5641 atagcgttat tcgagaacgt tacgactcac tgaccgaaaa catgcgagac gcccttcgcg
     5701 agcagcgtat tcgctatcgc ggcctgcgcg gttcggccga tggtgcatct gtaaccatcg
     5761 ttagcgatga tgatgtcgag aaagcccgtt ccctgatggc aaagctggac ccgctgacgg
     5821 ttgtcggtgg tgaaggcaac cacatcatgc tgacttacaa cgaacagacg cgccgcgaaa
     5881 tggtcgatcg ggccatcagc cagtcgctgg aagttgtccg tcgccgtgtg gacgaactgg
     5941 gtacgacaga gccgtcgatc cagcgtcagg gggaagaccg catcgttgtt caggttcctg
     6001 gtcttgatga tccgtcgcgc ctgcgtgaca ttcttggccg tacggcaaag atgaactttc
     6061 atctgatcaa cgaaaccaaa accccgcagc aggccaaggc caccggcatt ccgcccggtg
     6121 ccgtgatcat gcccggtgct gataacgcat cacccggcga accggaagaa tatctggttg
     6181 atcgccgtgt cgtggtttcc ggtgacaacc tgatcgacag ccagccaacc ttcaatgatg
     6241 gccgcccggt ggtttccttc cgctttgatg cggcaggcgg ggcacgtttt ggcgatgtca
     6301 ccagcaaaaa taccggccgc cgtcttgcca tcgttcttga tggcaaggtg atttcggcac
     6361 cgcgcatcaa tgaaccgatc atgggtggca gcggcatcat taccggccag tttggcgttc
     6421 aggaagcaaa tgacctgtcg ctgctgctgc gtgccggtgc gttgcccgcg ccgatgaaaa
     6481 ttcttgaaga acgttcggtt ggtccgggcc ttggtcagga ttccattgat tcgggcgaaa
     6541 tcgctgctgt tcttggtatg gtctttgttg tggccttcat ggccgtcagc tatggcctgt
     6601 ttggcatttt tgccaacctt gccctgatca tcaacctggt gctgatcctg gcgatcatgt
     6661 cggttctaca ggcaaccctg acattgccgg gtattgccgg tatcgttctg acagtcggta
     6721 tggcggttga cgccaacgtt cttatttttg aacgtatccg cgaagaacga aaaatcggcc
     6781 gatcgatcat cagttcgatt gatgccggtt accgcagcgc catgtcgacc attcttgatg
     6841 ccaacatcac gaccctgatt gcagcgctgg tgctgttcag ctttggctcc ggcccgatca
     6901 agggctttgc cgtgacgctg gcaatcggga tcatcacgtc gatgttcgca gccatctggg
     6961 tgacacgtct gatcgtggca ttctgggtca agcatcgtcg tccgactgaa cttgtcttgt
     7021 gaggaaggga gcaattcgca catgaaaccg ttgcatcttg ttccggatga cgtgaacatc
     7081 ccgttcctca aaatccggaa gctgttttac atcttctcgc tctcgctcgt ggttctgtcg
     7141 ggcgtgctgt tcttcaccaa gggccttaat ttcggtatcg atttccgcgg cggtattctg
     7201 ctcgaaatca aaaccgatgg tccggccgat attccgggat tgcgtgacaa tcttggcagc
     7261 cttggcctgg gtgacgttgc cattcagcag ttcggtgaac ccgatgacgt gctgatccag
     7321 ttgcagcgcc aggacggcga tgaacaggcc cagatggcgg cacttgaaac cgtgaccaag
     7381 gcccttggtg atggtgttag tatccgccgc tccgagcttg tcggcccgaa ggttggtgac
     7441 gaacttaagg aagcaggcct ttattcggtt gtgatctcgc ttgcgctgat catgatctat
     7501 atctggttcc gctttgaatg gcagttcgct gtcgcctcgg tcgtggcact gttgcatgac
     7561 gtgatcatta cggttggcct gtttgtcatt tccggcatcc agttcgacct ggcaacgctg
     7621 gctgcgatcc tgacagtggc aggttattcc attaacgata ctgtggtcgt gtttgaccgt
     7681 attcgcgaat tcatgcgcaa atatcgtaaa atggacctga tcgagctttt ggatctgtcg
     7741 atcaacacca cgctgtcacg taccgtgatg acgtcgttga cgacgcttct tgcgctgatc
     7801 gccctgttcc tgtttggcgg cgaagccatt cgtggtttca ccttcgccct tattttcggc
     7861 atcgtgattg gtacatattc ctcgatctgt gtggcgtcgc cgctgctggt cgtgctgaag
     7921 ctgcgccgca agatcccgga aggcgaagaa gccaaccccg aagaggccaa tgccctgtaa
     7981 tagcggcagg cagtaatttc tgccggtact agacatttac aagaaggccg ggttcgtgcg
     8041 aaaacgaccc ggcctttttt tatgcccgaa gcaggcgata gaacgtttct ggcatccagt
     8101 caccaccaac gaggtttccc atgtcgatgg acatcacacc gatggtagca tcgggccgca
     8161 aaattgtttc cagctatggg gatggcctgt ttcggttcgg ggcggaaaag gccgaaggct
     8221 cggttttgct gtttcccaac gaattcatgc gctggcccca tatcgacatg acgaccgtga
     8281 ccgaagactc atttgccgat gtcatcgaac gggcagggca gatcgatatt ttgttactcg
     8341 gcatgggaac ccggatgcag ccggttaaaa ccgcatggcg taacgccctg cggccgcatg
     8401 gtatcgtgat cgagcccatg gataccggtg ccgcttgccg taccttcaat gtgcttctgt
     8461 ccgaagaacg ccgggttgct gctgccctga ttgcggtctg acggcccctt tttctatttt
     8521 catacctgcc ggtatagcgg ctgttccgat ctgtttcgga acaatttgcc tcgccttgcg
     8581 tgttgtatcc taagggatta acgcaaacag gcaggggata cctgcggccc aatcctttaa
     8641 cccgtgcaga cgcgcgtgtt ttgatgcaat cgctagcagg ttgtgtctgt tcgtgtggtc
     8701 ggtccattga ccacgataaa gggatttcag gccgtaaatc tggcatatgg aagaagttac
     8761 ttcgggcttt gaagacttaa acgacaaggt tgtcgcaaaa ttcaccgatt gggccgatat
     8821 ggttggcggc ggagatattg gccgcgccat tgttgccatc gttattttgc tgtttttcat
     8881 caccattcgt aaatttctgg cgcatggtgt ggtcggtggt attcgccgcc tggccgatca
     8941 gcataaacat cagacaatgg ccatgatcct cgatgcatta caaacaccga ttgaagtgtt
     9001 tgttgtattg ttggggatct atttttctgt cgaagtcatg gagttttcgg acaagacgga
     9061 tgccgcgatt ttcaatgttt tgcgggtttt tgttatcgcc accattttct gggcatttta
     9121 ccgggtggtc gaaccactcg cccttggctt taataccttt gccgggcgtt ttggcgcagg
     9181 cctggccgat gatttacgcc agttttttat ccgctgtgtg cgcacccttg ttgttgtgtt
     9241 ggctggtgtt gccctgctcg aagattgggg cattgatgtc agtgcgtttt tgggtggctt
     9301 gggcctggcc ggtatggcgg ttgcgctggc tgcgaaggat tcagttgcca atgtttttgg
     9361 cggattgacg atttttgccg acaagctgtt taaacgcggt gactggatcg aaacaccgca
     9421 ttttgaaggt gttgtcgaat ttgttggcct gcgggcaacc aaggtacgta ccttttccaa
     9481 ggcaatggtc gtgatgccca atgccgaaat tgttgattca ccagttatca actggagccg
     9541 catgacgcac aggcgcatca agatggtaat cggcgttgaa tatcgtacca gcgccgatca
     9601 gattgaaaag atcatcgagc gattgcgtaa ttttctgcag caggatgaag acgtggccca
     9661 ggaagatgtg ccgcaaatgg tacatttggc tgattttggt agcagttcga tcaatatcga
     9721 tctgtattat ttcaccaaca ccaccgattg gcaggaatgg cgcgacattc gcaatcgaca
     9781 tatcattgca tttaaacgga taatcgagga agaaggggca tcatttgcat tcccgtccca
     9841 gtctttgtat gtggaaagca tgccggacat gaaaatgtcc cgtgcatcgg caaaccagga
     9901 cgatcagaat aaacaccaaa accagcgaca acagacgcct gaaaaaatgg gatagtttcc
     9961 catcgggcgc tgggcgtgac cggcggataa cttaatttcg atgagtgata caccaagcct
    10021 ttcctactgt gccgattatg tacggcgaaa tgacaaggac cgttttttgt gcgcgctgtt
    10081 tgcgccggcc gaaaagcgcg aagacctttt cacgctttac gcctttaatc aggaaatatc
    10141 gaaaacccgc gaaatggtca gcgaagccat gctgggccag atccgcctgc aatggtggcg
    10201 tgatgcgctg gctgatattg ccaagggcga ggttcgcaaa catgaggtcg tcgaaccgct
    10261 ggcaggcttg attgggcagg ggcgtgtcag cccggcaacc ctggaagcca tgatagatgc
    10321 ccgcgaattc gacctgtttg ataatgcgcc caaggatatg gctgcacttg aaaactatat
    10381 tgatgcaaca tccggtcagt taagcgaggt tgcagctggc gttcttggtg caaaaagcga
    10441 cgatgccgcc cgcgctgccc gacttgcggg taatgcctat ggtctggtgg ggatattgcg
    10501 cgccatggtg tttcatggcc gggcaaagcg gcaatatatc cccgaaaatg tcatggaaaa
    10561 gcatggtgtg gcgacaggag atatttttga atttcgcaaa accgatgctg taaaagccat
    10621 gacacatgac cttgccagcc gggcaaagga aaaaatacgc gaagcacgca aaatgcgcca
    10681 ggcattgccc aaattggcgt ctcctgcggc attgccgatc gttcttgccg ataactattt
    10741 gaaaaaatta aaaaaggccg aatatgaccc gttttctgcg gagttcggcc tggcgcgacc
    10801 tgccagtttt cgcctgacga tacgggcagt ttcgggatat tggtgatgca gtgtctttac
    10861 aacgggcaaa aaggttataa atctatatac atcaaccggt tcctgacctg atttggctgg
    10921 ttgtccggtg ttagaagccc gcggggtcca aaaaatgaaa agcatcagaa atatcgcact
    10981 cttggtcatc atcccggtgg gtgttctgat cacgctatgg acctttgcaa agcccgcagc
    11041 ggcaacctct attgatccgt tccttggtgt ttatcacggc acggccatat ccgaagaaac
    11101 ccacgagcta aaggcccgtg accttgatgt aaccatcaag aaaaccggca atgccttttc
    11161 gattgactgg tccaccgtga tttacaaatc cgatggccgg gaaaaggcat cggcgatcag
    11221 cattgatttt ttcagcaccg accgggccga catttttggt agcgccatgc gtaccggcct
    11281 gtttggtaaa cgcgttccca atgacccgct aaaggggcag ccctttgtct gggcgcgcat
    11341 tgttgggcca acgctgacgg ttcacgccat gtatattctt gatgatggcg gctatgaaat
    11401 gcaggtctat gaacgtacgc tggacccgca cggcaatctt gatctggtgt tcaaacgatt
    11461 tcgcaacggc aaaagcattc gtgaaatcac cggcgagctg acccgggtta aataacgcaa
    11521 ttccatcgcc aaattgccct cacgaaaaac gcccgcgcaa gctgcacggg cgttttgtct
    11581 tgttgcatat tcagcactgg cttgggcaaa gacgatgcct tgatgcctta attgccaatc
    11641 tcggccagcc actcatcaaa ggcgggaatg gcacggtcgg tatacagctt tttgcgatca
    11701 cggcccttga cccgtttttt cagatcaagt tccgggaaca ggccgaaatt gacattcatc
    11761 ggctggaagc tgtcttcgtt ggcaccgccg gtaatatggc ccagaagcgc acccattgcc
    11821 gttactgccg ggggcagggg taattcgcgg ccctgttttt ccgctgcggt aaaacgaccg
    11881 gtcatcaggc cgacagcagc actttcgaca tagccttcac acccggtgat ctgcccggca
    11941 aaacggatgt cgcggcgtga tttaagccgc agcgtcgggt ccagcaagcg cggggaattg
    12001 aggaaggtgt tgcggtgaat accacccagg cgggcgaaat cggcattttc aaggcccggg
    12061 atggttttga aaacctcgac ctgggcgccg tatttcagtt tcgtctgaaa accgaccata
    12121 ttgtaaaggg tgcccagtgc attgtcctgg cgcaactgca caatggcata ggatttcacc
    12181 agcggatcat gcgggttggt caggccaacc ggtttcatcg ggccatggcg cagggtttca
    12241 cgcccgcgtt cagccattac ttcaatgggc aggcagccgt caaaatacgg ggtgtttttt
    12301 tcccaatcct tgaattcggt tttgtcaccg gccagcagcg catcaatgaa attgttgtac
    12361 tgctctttgt tcatcgggca gttgatgtaa tccttgccat cgcccttgtc atagcgtgac
    12421 tggaaccacg ccttatcaaa atcgatgctt tctttataca ggatcggcgc aatggcatcg
    12481 aaaaatgcca gcgactgttc tccggtttgt tccagaatag ctgccgcaag cgaacccgat
    12541 gtcagcgggc cggtggcgat aatggtattg ccccattccg cgggcggaag gccgctaatc
    12601 tcgccgcgct ccatggaaat cagcgggtgg ttcatcagcg catctgtaac ggcctgcgaa
    12661 aagccttcac ggtcaacggc caaggcggca ccagccggca ccttgtgttc atcagcacag
    12721 cgcagaataa gcgaatttgc ccggcgcatt tcatcatgga tcaggccaac agcgttatat
    12781 tccttgtcat cggaacggaa cgaattggaa cagaccagtt cggcgcattt gtcggtctgg
    12841 tgggcatcgg tcgggcgttc cgggcgcatt tcatgaatga tcacgggaat gccggcttcg
    12901 gccagctgcc aggcggcttc actgccagcc aagccagcac cgatgatatg aacaggtgtt
    12961 accgtttcca agctgaacct ccgtcagaac aggttttggc atcaggccaa gtttggcccc
    13021 acacatagcg gccccggcac ccaaatatca agtcaattgc agcaatttgc cggtatcgcc
    13081 gtgcgatttc ggttgggttt gggcttcttc gtcacatttg gcagcaccat ccgtcaaacg
    13141 ggtattgaaa agtggtccgg tgcctgtgtt ctcaaatggc ttgtttgctg cacttaaccg
    13201 gcttcaatgg ctttaaccag gtaaaaacaa atgaccgatc cggcaaacac cagttccccc
    13261 aatacccccg gtacccccaa tgcaacaggt atcagcccgg caacatcatc cgccgggccg
    13321 tcaaacagta tttcgacgcc cgtcgccgcc gtctcagggc gggcggcaga tcacccccaa
    13381 acatctggca tatccggcga tagcagtgcg acgcgcccgc aaaatggcac cgacgcatct
    13441 gccaatcccg caacgccagc aggggttaca cccgtttcgc ctctcaacag ctttgtcgcg
    13501 gttttgaccg ggggtggtga tgtggccaag ctttcggcgg caagcctggc tttggtcttt
    13561 tcgccgctgg ttctgcttgc ggtgctggca acgttcttta ccatcgatgt gctggtcagt
    13621 tcttggggcg ggcggttgtt tggtttcagc tatgatctgg tattttttgt cggcgtcttt
    13681 ggcgcattgg gcgcaacggt cagcgtgatg ctgtatatca gcgataccca gaactggcag
    13741 aacctgacac cgcttgatat tttcttccgc tttttatttc ggccgttttt ggggtttgtt
    13801 tttgcgattc tggcattgct gattgtgcgt gccgggatcg ttcccgacag tctgagccaa
    13861 cttcaggcca ttcgggatct caatgatctg gggacgctca agcttagtga tgaaagcggc
    13921 cagcaagtcg cgatcatgct tttgattgca tttttggcag ggtttagcga acggctggtc
    13981 aaggccattt tgcaaacagt cgaaggccgc atccgtgccc tggcgatcgg ttccgatagc
    14041 gccaaaacaa cctgatgcgg cccaaatcca cttaccaagt ccgtttacac gggtttatga
    14101 aaggcgcaaa tggcgcttca gatatttgcc ggtatagctg cggtcatttt tcatgatgtc
    14161 ttcgggtgta ccggcactaa ccagttcacc gccgccatca ccaccttcag ggccgatatc
    14221 aataatccag tcggcggttt tgatcacatc caggttatgt tcaatcacaa caaccgtatt
    14281 tccctgatcc accagggtat gcagcacctc aagcaattta cgaatgtcgt ggaaatgaag
    14341 gccggttgtc ggctcatcca gaatgtaaat tgttctgcct gtcgcgcgtt ttgacagttc
    14401 cttggcaagt ttgacacgct gtgcctcacc gccagaaagg gtggtcgcct gttggcccag
    14461 gtgaacgtag ctcagaccca cctgttttag tgtttccatc ttgtcacgga tgctgggaac
    14521 ggctttgaaa aagtccgcac cttcctcaac cgtcatgtca agaacgtcag atatcgattt
    14581 gcctttaaac gatatttcga gagtttctcg gttgtatcgt ttacctttgc actgatcaca
    14641 tgtaacatag acatcgggca aaaagtgcat ttcaattttg ataacgccat caccctggca
    14701 ggcttcgcac cggccaccct tgacgttgaa tgaaaaccgc cccggtttgt agccacgggt
    14761 tttggcctct ggcaactggg cgaaccattc acgaatgggg gtaaaggccc cggtataggt
    14821 aacaggattg gaacgcgggg tacggccaat cgggctttgg tcaatatcga taattttatc
    14881 gatatgttcg atgccggaaa tgctgtcatg ggcgccgggc tgggtgcggg cattatgcat
    14941 acgtttggcc agcgacttat atagcgtttc aatcaccagg ctggatttac cgccacctga
    15001 aacaccggtt acgcaaacaa acctgcccag cggaaaatca accgtgatgt tttgcaggtt
    15061 attggcccgc gcaccgctga tggtaatggc cttgccattg ccatcgcgcc gtttggccgg
    15121 aaccgcaatc tgttcaatcc cggtcagata tttgcccgtc aggctttccg gattttgctg
    15181 aatttcatct ggcgtgccct gggccacaat acggcccccg tgaataccgg cacccggtcc
    15241 catatcaaca acgtaatcgg cagcacggat ggcgtcttca tcatgttcga caaccagcac
    15301 cgtattaccc aaatcgcgca gacgtttcag ggtttccagc aggcggtcat tatcacgctg
    15361 atgcaggccg atggagggtt cgtcaagcac atataaaacg cccgtcaggc ccgagccaat
    15421 ttgcgatgcc aggcgaatac gctggctttc cccacccgaa agcgtgcccg atgtacgcga
    15481 catggaaaga taatccagtc ccacatcgct caggaatttc aggcgatcat taatctcgcg
    15541 cagaatacga cgggcaattt cggtctgttt ttcatccagt ttcggctcaa gggatgcaaa
    15601 ccattcaacc gccttgccaa ttgaaagggc ggtgatttcg gatatggtgc atttatcaac
    15661 cttgaccgcc agggcttccg gcttcaggcg atgcccatgg caggcgtcac aatcggaaac
    15721 ggtctgatat tttgccagtt catcgcgggc ccactggctg tcggtttcgc gccagcgacg
    15781 cgacatgttg ggaataaccc cttcaaaggg gcgtgttacc ttatagctgc gcgaaccgtc
    15841 atcataggtg acagtcactt cttcctcgcc cgaaccatgc aaaataatgt gcttggcttt
    15901 ttccggcagt ttgtcccacg ctacgttggt tttgaagcca aaatgcttgg caatggcgcg
    15961 cagcgtttgc aggtaatatt tcgacgtgct gcttgcccat ggggcaattg caccgtcatc
    16021 cagcgttttg ctggtatcgg gcaccaccag ttcggggtcg aattccattt gtgtgccaag
    16081 gccatcacaa accgggcagg cgccaaacgg gttgttgaac gaaaacaggc gcggttcaat
    16141 ttcttcgatg gtgaaacccg aaaccgggca ggcaaatttt gctgaaaaaa gcgtgctgtc
    16201 accggaacgg gcatcttcgg caaatacgat accgtcggtc aattccagtg cagtttcaaa
    16261 gctgtcggca agacgcgtct ggatgccgtc cttgactaca aggcggtcga cgacgacgga
    16321 aatatcgtgt ttcagctttt tgtttaggtc gggggcttcg tcgatgtcat acatttcgcc
    16381 atcgattttc acacgctgaa aaccacgtgc gcgaaggtcc ttcaattcct ttttatattc
    16441 gcccttgcgg ccacgtacga tgggcgccag caggtaaagg cgtgtgcctt cttccatttc
    16501 catcacgcgg tcaaccattt gcgatacagt ctggctgaca atgggaaggc cggttgccgg
    16561 ggaatagggg atgcccacac gcgcccaaag aaggcgcata taatcgtaaa tctcggtaac
    16621 agtaccaacg gtcgagcgcg gattgcgcga tgtggttttc tgttcaatcg aaatggcggg
    16681 cgacagaccg tcgatatatt cgacatccgg cttttgcatc atttcaagga actggcgcgc
    16741 ataggccgac aggctttcga cataacggcg ctgaccttcg gcatagatgg tatcaaatgc
    16801 cagcgaactt ttgcccgaac ccgacaggcc ggtaatcacc acaagggaat cgcggggcag
    16861 ttcgacatca atgttctgca gattgtgttc tttcgcaccg cgaaccgaaa tttttgtcag
    16921 catgtccggc cttttgttcc tgaattgttc gggcaatgta tagaggagcc agtgattata
    16981 cgcaagtgcg cccctgacag ttcctgtcag gatttgtcag caggaaaaat gacaaagcct
    17041 ggaaaggtag tggaggggaa ttggcgcaaa agaaaacccg ccagaatgac ctggcgggtt
    17101 ttgcaaaatc aggcagggga ttaacccttg gcaatcgact tgccggtggc acccagatcc
    17161 ttgaaagctt cttcaaggcg ggctgccatg gaaagttcgg cttcacgaac ccagacgcgc
    17221 gggtcgtact gctttttgta cggtttgtcg tcttcgggat cgacctggta tttgaaggca
    17281 cggtcgtttt cttcgacata tttgccaacg gctgtcgcat aggcaaattg ggtgtcagta
    17341 tcgatattca ttttgaaaac gccgtagcca acagcttcct cgattttctc tttttccgag
    17401 ccggaaccac cgtggaaaac gaaatcaagc gggaagccat caaggccatg tttatcagca
    17461 acgtatttct gcgattccag aaggatttcc gggcgcagtt taacattgcc cggcttgtaa
    17521 acgccgtgca cgttgccaaa tgccgccgca accgagaaat gccccagcgg tttcaggcgg
    17581 cgataggctt cttcgacttc ttccggacgg gtataaaggc gcgggtcgat atcgtcctga
    17641 gtggtatcga gttccttgcc aataccgtct tcttcgccac cggtaacgcc cagttcgatt
    17701 tccaggctca tatcgacggc ggaaagacgt ttcagaaatt cctcgcaggt tgccagattt
    17761 tcttcgagcg gttcttccga aagatcaagc atatgcgagc tgaacaaagg cttgccggtt
    17821 tccttgtaag ctttttcacc ccagctcagc ataccatcga cccacggaac aaatttccgg
    17881 ttggcatgat cggtatgcag aactacggcg acgccataat attcagccag aagctgggta
    17941 tggcgggcgg cggaaacagc gccaataacg cgagcagcat tggcatcctt gatgcccttg
    18001 ccggcaaaga actgtgcacc accattcgaa agctgaatga tgatgtcaga accggcattg
    18061 gcagcagctt caagggcagc gttcagcgtg ctcgacgtgg taatattaac tgcgggtaga
    18121 gcgtaaccgt cctgcttgca ggcagcaacg agttcacgat aggcagcacc ggttacaaca
    18181 ccgggtttaa cggaaggcat gtgggtactc ctcctggggg tcagagcatt ataatattcc
    18241 ccgtgcgaac cgattgatcc gcagcaaggg cgatccgtaa ggagcccaat gcatccgtca
    18301 tatgctgcgt aaggtcaaga ttttccgtaa tagactttag gaaaaactgc tgttcacgca
    18361 gacacaggcc atcatgatcg ggctcgtcat cggttctgat gatttcgtcc gaatgtgaaa
    18421 attttccgtg cctatctgtg gcagcatgat gtaattttaa ggcattcgtc ttggtatgtg
    18481 tgtcgatatc ggccgatccg gcgccttttt cttctacatc tgtgattgat accgaccctt
    18541 tgggtccgat gacatcctta acgaaaaatg ccgtttcgct gatcatgggc ccccagcccg
    18601 cttcgtacca gccaatcgac ccatcatcaa accggacctg caattgcccg taattataca
    18661 tgtcctctgc aacatcgggc gacagacgca cccccattgc atgaaccgaa atcggctttg
    18721 ccgctgtcat caggcacatc atatcaaggt aatgcacacc acaatcaaca atcggcgaca
    18781 cgctgtttaa cagccgttta tgcgtttccc aaaattcgcc gctgctttgc tggttcaagt
    18841 tcattcgcat caccagcggc ttgccaagtg tctttgccag ttcgacaaac ttttcccacg
    18901 acgggtgatg gcgcaggata tagccgatca ccagtttacg gccatgcttc tgcgcacttt
    18961 caatgacgcg ttcggcatcg accacggtgg cggcaagcgg cttttcgata aagacatgcg
    19021 cacctgcttc aaaggcggca atggcgtaat ttgcatgggt ttctgtatag gtgttgatcg
    19081 aaaccgcatc aggcccggtt tgggcaaggg cggcttcgta atcggtaaat tgcgggtagg
    19141 attcaaatcc ttctggcagg gtgcccggtg cgatgggtga acgggtacat agcccaacaa
    19201 tctcaaaccc ttccagggca tggtatgcct tggcatggga catccccata ttgcccagtc
    19261 cgacgacaag gatgcgtacg ggtttcgacg tcattgggat gtctcctttg atgtaaggtt
    19321 tcagataagg tgtttatctt aaatacctga aatccaggtg tgttttacct gaaatgtttg
    19381 gggatcaaac aggaccatat cggcatcaaa acccggcatg attttgcctt ttctgtcatc
    19441 aagtttcaaa aatgccgccg gataaagcga ggccatgcgc agcgcttcgc ccagatcaac
    19501 gccaatcatc tcaacgctgt ttttcactgc cgaaatcata tcaaggtccg acccggcaag
    19561 ggtgccgtcg gcaagtgcac aacggccatc aatggcaata atttcttcgc cgcccagaac
    19621 aaaacgtttg gttttagcac ccaccgttgg catggcgtcg gtcaccagca tgatcttgcc
    19681 tgtctgtttt gcgcgaacgg caacctgcat ggcggccgga tgcacatggt aaccatcggc
    19741 aatcaggccg caccatgtgt ttttgtcagc tatggcagca cctgcaacgc cgggttcccg
    19801 gtgggcaagg gggctcatgg cgttaaacaa gtgggtaaag ccacgcaaac cttcgccaat
    19861 tgccttttgg atatcgtcat aggtcccggc agtatgcccg gcacaaacca gtacgccatt
    19921 ggcgcacagt ttttcaattg tgcccggttt tgctttttcc ggggccatgg ttaccaaaat
    19981 ccgcccattg ggcaggctgg acagaacctc aatcgcgtca tcttccatcg ggcggatgat
    20041 gttggcgtca tgcacgccct tgcgttccgt gttcagatac ggaccttcaa ggtggatacc
    20101 aacaatgccc ggcacgcctt cgtcgatggc gtttttaacc gcctcgatgg cggccagcat
    20161 tttttcgcgt gtgtcggtga taagggtggg cagcatggcc gttgtgccaa actggcgatg
    20221 ccccgccata atctggcgaa tgccatcaac gctgggcgtg tgatttagta aaacaccgcc
    20281 gccgccatta acctgcacgt cgataaaccc cggtgccagc acatttcccc gggcgtcaaa
    20341 aatgtcagcg ccatccggga tgcgggccgg tgacaccgca ctgactttac cgttttcaac
    20401 gatcacgcag gcatcatcgt gaaactgctc gccatcgaac agccgggcat ttttaatggc
    20461 aagaacagtc atcaatgcgt ctccgttacc ttggaaagat gcggcggtgc atcggggtta
    20521 cagccgcgca gcaacgcaac atgttcagcc agggtataaa atgtctggat catcgcaatt
    20581 ggcgctagca gcggatggat gtcttcgaca accggcaaat ggcccttggt cgcggcatca
    20641 ccggtttcgg cggcaaacag gttgtcggtc tttgcgcgca tttcagtcag aaaaccctca
    20701 acgctatcgc gggtggcgtc atcctgtgaa aagaccagca agggaaattc ccgtgttacc
    20761 agggccatgg ggccgtgttt gacttcggcg gcactaaagg cttcggcatg caggctgctg
    20821 gtttccttga atttaagggc agcttc
//
