LOCUS NC_015953 11311 bp DNA linear CON 08-MAY-2024 DEFINITION Streptomyces sp. SirexAA-E, complete sequence. ACCESSION NC_015953 VERSION NC_015953.1 KEYWORDS GSC:MIGS:2.1; RefSeq. SOURCE Streptomyces sp. SirexAA-E ORGANISM Streptomyces sp. SirexAA-E Bacteria; Actinomycetota; Actinomycetes; Kitasatosporales; Streptomycetaceae; Streptomyces. REFERENCE 1 (bases 1 to 7414440) AUTHORS Takasuka,T.E., Acheson,J.F., Bianchetti,C.M., Prom,B.M., Bergeman,L.F., Book,A.J., Currie,C.R. and Fox,B.G. TITLE Biochemical properties and atomic resolution structure of a proteolytically processed beta-mannanase from cellulolytic Streptomyces sp. SirexAA-E JOURNAL PLoS One 9 (4), e94166 (2014) PUBMED 24710170 REMARK Publication Status: Online-Only REFERENCE 2 (bases 1 to 7414440) AUTHORS Lucas,S., Han,J., Lapidus,A., Cheng,J.-F., Goodwin,L., Pitluck,S., Peters,L., Ovchinnikova,G., Davenport,K., Detter,J.C., Han,C., Tapia,R., Land,M., Hauser,L., Kyrpides,N., Ivanova,N., Pagani,I., Adams,A., Raffa,K., Adams,S., Book,A., Currie,C. and Woyke,T. CONSRTM US DOE Joint Genome Institute TITLE Complete sequence of Streptomyces sp. SirexAA-E JOURNAL Unpublished REFERENCE 3 (bases 1 to 7414440) AUTHORS Lucas,S., Han,J., Lapidus,A., Cheng,J.-F., Goodwin,L., Pitluck,S., Peters,L., Ovchinnikova,G., Davenport,K., Detter,J.C., Han,C., Tapia,R., Land,M., Hauser,L., Kyrpides,N., Ivanova,N., Pagani,I., Adams,A., Raffa,K., Adams,S., Book,A., Currie,C. and Woyke,T. CONSRTM US DOE Joint Genome Institute TITLE Direct Submission JOURNAL Submitted (05-AUG-2011) US DOE Joint Genome Institute, 2800 Mitchell Drive B310, Walnut Creek, CA 94598-1698, USA COMMENT ##MIGS-Data-START## investigation_type :: bacteria_archaea project_name :: Streptomyces sp. SirexAA-E collection_date :: Missing lat_lon :: Missing depth :: Missing alt_elev :: Missing country :: Missing environment :: Soil, Terrestrial num_replicons :: 1 ref_biomaterial :: Missing biotic_relationship :: Free living trophic_level :: Missing rel_to_oxygen :: Aerobe isol_growth_condt :: Missing sequencing_meth :: WGS assembly :: Newbler v. 2.3 (pre-release) finishing_strategy :: Finished GOLD Stamp ID :: Gi05895 Funding Program :: DOE-GLBRC 2008 Source of Isolate :: Cameron Currie (currie@bact.wisc.edu) Cell Shape :: Filament-shaped Motility :: Nonmotile Sporulation :: Sporulating Temperature Range :: Mesophile Gram Staining :: Gram+ Diseases :: None ##MIGS-Data-END## ##Genome-Assembly-Data-START## Finishing Goal :: Finished Current Finishing Status :: Finished Assembly Method :: Newbler v. 2.3 Genome Coverage :: 30x Sequencing Technology :: 454/Illumina ##Genome-Assembly-Data-END## ##Genome-Annotation-Data-START## Annotation Provider :: NCBI RefSeq Annotation Name :: GCF_000177195.2-RS_2024_05_08 Annotation Date :: 05/08/2024 02:12:31 Annotation Pipeline :: NCBI Prokaryotic Genome Annotation Pipeline (PGAP) Annotation Method :: Best-placed reference protein set; GeneMarkS-2+ Annotation Software revision :: 6.7 Features Annotated :: Gene; CDS; rRNA; tRNA; ncRNA Genes (total) :: 6,619 CDSs (total) :: 6,532 Genes (coding) :: 6,385 CDSs (with protein) :: 6,385 Genes (RNA) :: 87 rRNAs :: 7, 6, 6 (5S, 16S, 23S) complete rRNAs :: 7, 6, 6 (5S, 16S, 23S) tRNAs :: 65 ncRNAs :: 3 Pseudo Genes (total) :: 147 CDSs (without protein) :: 147 Pseudo Genes (ambiguous residues) :: 0 of 147 Pseudo Genes (frameshifted) :: 61 of 147 Pseudo Genes (incomplete) :: 120 of 147 Pseudo Genes (internal stop) :: 8 of 147 Pseudo Genes (multiple problems) :: 39 of 147 ##Genome-Annotation-Data-END## ##antiSMASH-Data-START## Version :: 8.dev-cf2fc5ee(changed) Run date :: 2025-09-13 00:50:45 NOTE :: This is a single region extracted from a larger record! Orig. start :: 5953055 Orig. end :: 5964366 ##antiSMASH-Data-END## REFSEQ INFORMATION: The reference sequence is identical to CP002993.1. URL -- http://www.jgi.doe.gov JGI Project ID: 4086644 Source DNA and organism available from Cameron Currie (currie@bact.wisc.edu) Contacts: Cameron Currie (currie@bact.wisc.edu) Tanja Woyke (microbe@cuba.jgi-psf.org) Annotation done by JGI-ORNL and JGI-PGF Finishing done by JGI-LANL The JGI and collaborators endorse the principles for the distribution and use of large scale sequencing data adopted by the larger genome sequencing community and urge users of this data to follow them. it is our intention to publish the work of this project in a timely fashion and we welcome collaborative interaction on the project and analysis. (http://www.genome.gov/page.cfm?pageID=10506376). The annotation was added by the NCBI Prokaryotic Genome Annotation Pipeline (PGAP). Information about PGAP can be found here: https://www.ncbi.nlm.nih.gov/genome/annotation_prok/ COMPLETENESS: full length. FEATURES Location/Qualifiers region 1..11311 /candidate_cluster_numbers="1" /contig_edge="False" /product="RiPP-like" /region_number="20" /rules="(strepbact or Antimicrobial14 or Bacteriocin_IId or BacteriocIIc_cy or Bacteriocin_II or Bacteriocin_IIi or Lactococcin or Antimicrobial17 or Lactococcin_972 or Bacteriocin_IIc or LcnG-beta or Cloacin or Linocin_M18 or TIGR03651 or TIGR03693 or TIGR03601 or TIGR03795 or TIGR03975 or DUF692 or TIGR01193 or (YcaO or TIGR03604))" /tool="antismash" cand_cluster 1..11311 /candidate_cluster_number="1" /contig_edge="False" /detection_rules="(strepbact or Antimicrobial14 or Bacteriocin_IId or BacteriocIIc_cy or Bacteriocin_II or Bacteriocin_IIi or Lactococcin or Antimicrobial17 or Lactococcin_972 or Bacteriocin_IIc or LcnG-beta or Cloacin or Linocin_M18 or TIGR03651 or TIGR03693 or TIGR03601 or TIGR03795 or TIGR03975 or DUF692 or TIGR01193 or (YcaO or TIGR03604))" /kind="single" /product="RiPP-like" /protoclusters="1" /tool="antismash" protocluster 1..11311 /aStool="rule-based-clusters" /category="RiPP" /contig_edge="False" /core_location="[5000:6311](-)" /cutoff="20000" /detection_rule="(strepbact or Antimicrobial14 or Bacteriocin_IId or BacteriocIIc_cy or Bacteriocin_II or Bacteriocin_IIi or Lactococcin or Antimicrobial17 or Lactococcin_972 or Bacteriocin_IIc or LcnG-beta or Cloacin or Linocin_M18 or TIGR03651 or TIGR03693 or TIGR03601 or TIGR03795 or TIGR03975 or DUF692 or TIGR01193 or (YcaO or TIGR03604))" /neighbourhood="5000" /product="RiPP-like" /protocluster_number="1" /tool="antismash" proto_core complement(5001..6311) /aStool="rule-based-clusters" /tool="antismash" /cutoff="20000" /detection_rule="(strepbact or Antimicrobial14 or Bacteriocin_IId or BacteriocIIc_cy or Bacteriocin_II or Bacteriocin_IIi or Lactococcin or Antimicrobial17 or Lactococcin_972 or Bacteriocin_IIc or LcnG-beta or Cloacin or Linocin_M18 or TIGR03651 or TIGR03693 or TIGR03601 or TIGR03795 or TIGR03975 or DUF692 or TIGR01193 or (YcaO or TIGR03604))" /neighbourhood="5000" /product="RiPP-like" /protocluster_number="1" gene 663..2126 /gene="hemG" /locus_tag="SACTE_RS26160" /old_locus_tag="SACTE_5262" CDS 663..2126 /EC_number="1.3.3.4" /GO_function="GO:0004729 - oxygen-dependent protoporphyrinogen oxidase activity [Evidence IEA]" /GO_process="GO:0006779 - porphyrin-containing compound biosynthetic process [Evidence IEA]" /codon_start=1 /gene="hemG" /gene_functions="biosynthetic-additional (rule-based-clusters) DAO" /gene_functions="biosynthetic-additional (smcogs) SMCOG1222: dehydrogenase" /gene_kind="biosynthetic-additional" /inference="COORDINATES: similar to AA sequence:RefSeq:WP_015579252.1" /locus_tag="SACTE_RS26160" /note="Derived by automated computational analysis using gene prediction method: Protein Homology." /old_locus_tag="SACTE_5262" /product="protoporphyrinogen oxidase" /protein_id="WP_014049028.1" /sec_met_domain="DAO (E-value: 8e-07, bitscore: 27.5, seeds: 521, tool: rule-based-clusters)" /transl_table=11 /translation="MQRSLQHDPARTGHVVVIGGGIAGLAAAHRLLATGLRVTVLEATE RLGGKLMTGEVAGVRVDLGAESMLARRPEAVELATAAGLGDRLQPPATASASVWTGNAL RPMPKGHVMGVPGDPAALSGLLSSEGLARIAEERDLTPTAVGEDVGVGAYVADRLGREV VDRLVEPLLGGVYAGDAYRISMRAAVPQLFEVAREGGSLLDGVRRIQERAAVRQPTGPV FQGISGGLGTLPGAVADAVRAGGTEIRTATPVLGLTRTGQGWEVRTDSGVIAADGVVMA APAWSASTLLAAECPAASVELAGVEYASMALVTLAFRRSDVEGNEALAGRSGFLVPPVD GRTIKASTFSSNKWRWVADAAPDLFVLRTSVGRYGEEDHLHREDSELVAVSLKDLADAT GLTARPVDTVVTRWIGGLPQYPVGHLGRVARIREEVAKLPGLRVCGAVYDGVGIPACVA GAHRAADEIAEEIIATSTRVQGTPSEAGQ" gene 2131..2844 /gene="hemQ" /locus_tag="SACTE_RS26165" /old_locus_tag="SACTE_5263" CDS 2131..2844 /EC_number="1.3.98.5" /codon_start=1 /gene="hemQ" /inference="COORDINATES: similar to AA sequence:RefSeq:WP_018888541.1" /locus_tag="SACTE_RS26165" /note="Derived by automated computational analysis using gene prediction method: Protein Homology." /old_locus_tag="SACTE_5263" /product="hydrogen peroxide-dependent heme synthase" /protein_id="WP_014049029.1" /transl_table=11 /translation="MSAPEKLPNAGKKAKDLNEVIRYTLWSVFKLRDVLPADRAGYADE VQELFDQLAAKDVTVRGTYDVSGLRADADLMIWWHAETSDALQEAYNLFRRTTLGRALD PVWSNMALHRPAEFNKSHVPAFLADEEPRAYVSVYPFVRSYDWYLLPDEDRRRMLADHG KMARGFPDVRANTVPSFSLGDYEWILAFEADELYRIVDLMRHLRGSEARMHVREEVPFY TGRRKAVADLVAGLA" gene complement(3089..3895) /locus_tag="SACTE_RS26170" /old_locus_tag="SACTE_5264" CDS complement(3089..3895) /codon_start=1 /inference="COORDINATES: similar to AA sequence:RefSeq:WP_014049030.1" /locus_tag="SACTE_RS26170" /note="Derived by automated computational analysis using gene prediction method: Protein Homology." /old_locus_tag="SACTE_5264" /product="TIGR04222 domain-containing membrane protein" /protein_id="WP_014049030.1" /transl_table=11 /translation="MLWVFFLLAAWGVAAVSCIRLCLLTAGAALLPADAPGKPLSRELS LYETAFLAGGPHRVADLALVSMHLRRRLLLAHTGWATVVDPEGRDEVERTVIRAIGPEG QSRIAPIRASAAAADTVRALADRLVAAGLAVPHGTRTALESAVRGVRGAAVLVVVLAAV TALMPGQDTGDAGPVLAWFGLPLVLTLGCLAIARVENQPYSPWASPSGQRWLDSLPAPV HGPDRDVLAAVAVRGVAAVGDPRLRAALLDGGRTVRPLGRASGAEI" misc_feature 3885..3905 /note="TFBS match to MntR, Manganese transport regulator, confidence: weak, score: 20.26" /tool="antismash" misc_feature complement(3886..3906) /note="TFBS match to MntR, Manganese transport regulator, confidence: weak, score: 20.42" /tool="antismash" misc_feature 3928..3943 /note="TFBS match to ColR, Phenol-responsive regulator, confidence: weak, score: 17.2" /tool="antismash" gene complement(3997..5004) /locus_tag="SACTE_RS26175" /old_locus_tag="SACTE_5265" CDS complement(3997..5004) /codon_start=1 /inference="COORDINATES: similar to AA sequence:RefSeq:WP_014153333.1" /locus_tag="SACTE_RS26175" /note="Derived by automated computational analysis using gene prediction method: Protein Homology." /old_locus_tag="SACTE_5265" /product="TIGR04222 domain-containing membrane protein" /protein_id="WP_014049031.1" /transl_table=11 /translation="MNALALLLTLAVAVSSVLLLTVPAAGRRRAPGEPEPSVHDLSEAA FLGGGPARVVDTALTALYEDGRLVVGGPGIVAVRRDEARDPVERAVLQELAAAPSGALH VLRKAVMRHPAVQEIGDGLAARGLLVPPAENRTRRRWALVQGIGCVVAAPFAVVLTVLQ YALREGYADVPIPFVVKVLPALLAGAVVGLVTAKSAKARLTEAGRRAATRYRWTRTHVP GAAHLVATQGLGALPHTELRDQLLAAARHPSPGRSVPAGAGATADLYAGDAAWCAGAGP GGGGCGGSGDSGTGSSCGSGGGGSSCGGGSSCSSGSSCGGGSSCGGGSSCGSSS" gene complement(5001..6311) /locus_tag="SACTE_RS26180" /old_locus_tag="SACTE_5266" CDS complement(5001..6311) /codon_start=1 /gene_functions="biosynthetic (rule-based-clusters) RiPP-like: DUF692" /gene_kind="biosynthetic" /inference="COORDINATES: similar to AA sequence:RefSeq:WP_014049032.1" /locus_tag="SACTE_RS26180" /note="Derived by automated computational analysis using gene prediction method: Protein Homology." /old_locus_tag="SACTE_5266" /product="DUF692 domain-containing protein" /protein_id="WP_014049032.1" /sec_met_domain="DUF692 (E-value: 3.7e-90, bitscore: 300.3, seeds: 7, tool: rule-based-clusters)" /transl_table=11 /translation="MKLGIGIGWRPEIADALEGLPGIDWVEAVAENICEDHLPGPLARL RERGVTVVPHGVSLGLGGADRPDPDRLAGLAARATLLGAPLVTEHVAFVRAGGPLTASP GLEAGHLLPVPRTWDALDVLCENVRIAQDSLPVPLALENIAALIAWPDEEMTEGQFLAE LVERTGVRLLIDVANLHTNHVNRGEDPAAALDELPVEAIAYVHVAGGIEKDGVWHDTHA HPVTRPVLDVLTELRSRVDPPGVLLERDDAFPPAGELAGELTAIRATLAAAAAAPAPDR TPAGATARPAPAPGARERVAMAQTTLLSALVAGSPAPEGFDHARLAVQSRALAAKRADV VAEVAPELPRILGDGFRKAFLAYARTRPLSGGYRRDALDFAEHLLIAGRPADEAARRRL TRWWQDRAAPRPPRRGARVARAARTVLAGARPGRGSR" misc_feature 6573..6586 /note="TFBS match to ANR, Anaerobic transcriptional regulator, confidence: strong, score: 24.25" /tool="antismash" gene 6913..8265 /locus_tag="SACTE_RS26185" /old_locus_tag="SACTE_5267" CDS 6913..8265 /GO_function="GO:0004553 - hydrolase activity, hydrolyzing O-glycosyl compounds [Evidence IEA]" /GO_process="GO:0005975 - carbohydrate metabolic process [Evidence IEA]" /codon_start=1 /inference="COORDINATES: similar to AA sequence:RefSeq:WP_014049033.1" /locus_tag="SACTE_RS26185" /note="Derived by automated computational analysis using gene prediction method: Protein Homology." /old_locus_tag="SACTE_5267" /product="family 43 glycosylhydrolase" /protein_id="WP_014049033.1" /transl_table=11 /translation="MTGTENRSHLDRRSLLTRAAGVGAAAALPTAAAGATPASASPASP ASPARRHPSNWPDPEPYGIADIRPDLWPREDNSFVLPLELRPRDKERGLVWMRDTYVNR FVVDGRPLYVATGTTRVPGLEAAGPWNDGIFVWLSRSLKGPWKLADTTRIRPGAEKGKV WSPEFVGENRPGRTVVAPWQEYWYDEQFGKRGQAWAPELHYFRGKWYMVACMGDHSKKV GSFMLVSEGGVEGPYRLVEGNVDKPFGDSFIGGPAFIEPGAYHHIDGSLYSEGDRAWLV LHNNLYARFRDDMEDIVTTTDLPLFKQTPYAPEPYLEGAYVFKHGGKYYLLHAAWDRTS INADGSTRQAYDTAGTGRVQYQYDAVVAVSDRFEGPYSRRWTAGVGAGHNNFFTDSDGT VWATFFRNPAFGHWSDPSRVADAAVPGVVRVEWTGPQGNRLYVRRRDGGRG" gene complement(8270..9043) /locus_tag="SACTE_RS26190" /old_locus_tag="SACTE_5268" CDS complement(8270..9043) /codon_start=1 /inference="COORDINATES: similar to AA sequence:RefSeq:WP_018514518.1" /locus_tag="SACTE_RS26190" /note="Derived by automated computational analysis using gene prediction method: Protein Homology." /old_locus_tag="SACTE_5268" /product="peptidyl-tRNA hydrolase" /protein_id="WP_019993147.1" /transl_table=11 /translation="MSSDDVSRPLSAPPADSPFRGEPTARDEAPQYVLPLVVHLEKTDP PGRTDAVRTAARAVLTMLSDERSLGDGEWAQAVRDWEDARIRKVVRRARGAEWRKASAL PGITVTGESAEVRVFPPIPLDGWPKELAKLQVSGTELDDPVPPPAPDGTGPVLWLNPGP DMSAGKTMAQAGHGAQLAWWELSDAEREEWREAGFPLSVAVATPESWRELTASGLPVVV DAGFTEIAPGATVVTEGGSRFCPLPRGRRPRGASA" gene 9115..9792 /locus_tag="SACTE_RS26195" /old_locus_tag="SACTE_5269" CDS 9115..9792 /codon_start=1 /inference="COORDINATES: similar to AA sequence:RefSeq:WP_014153329.1" /locus_tag="SACTE_RS26195" /note="Derived by automated computational analysis using gene prediction method: Protein Homology." /old_locus_tag="SACTE_5269" /product="AIM24 family protein" /protein_id="WP_014049035.1" /transl_table=11 /translation="MKSDLFSSEHMAQQATAPGMTLQNAKSIKYAVDGEMHARQGSMIA FRGNLQFERKGQGIGGMLKRAVTGEGLPLMAVRGQGEAWFAHEAANCFIVDMEQGDVLT INGRNVLCFDATLAYEIKTVKGAGMTGGGLFNSVFTGQGKLGLMCDGHPIVIPVSARQP VYVDTDAVVGWSAQLSTSLHRSQSFGSMVRGGSGEAVQLMLQGEGFVIVRPSEVKQEKA SAN" gene 10013..10192 /locus_tag="SACTE_RS26200" /old_locus_tag="SACTE_5270" CDS 10013..10192 /codon_start=1 /inference="COORDINATES: similar to AA sequence:RefSeq:WP_014049036.1" /locus_tag="SACTE_RS26200" /note="Derived by automated computational analysis using gene prediction method: Protein Homology." /old_locus_tag="SACTE_5270" /product="hypothetical protein" /protein_id="WP_014049036.1" /transl_table=11 /translation="MDDAYCETPAPAPVPEDTGGPYAECVLCREPTEYPESTKGATLCP VCAWQEAGRTACSG" gene complement(10208..11074) /locus_tag="SACTE_RS26205" /old_locus_tag="SACTE_5271" CDS complement(10208..11074) /GO_function="GO:0016810 - hydrolase activity, acting on carbon-nitrogen (but not peptide) bonds [Evidence IEA]" /GO_process="GO:0005975 - carbohydrate metabolic process [Evidence IEA]" /codon_start=1 /gene_functions="biosynthetic-additional (smcogs) SMCOG1235: polysaccharide deacetylase" /gene_kind="biosynthetic-additional" /inference="COORDINATES: similar to AA sequence:RefSeq:WP_014153326.1" /locus_tag="SACTE_RS26205" /note="Derived by automated computational analysis using gene prediction method: Protein Homology." /old_locus_tag="SACTE_5271" /product="polysaccharide deacetylase family protein" /protein_id="WP_014049037.1" /transl_table=11 /translation="MTALVILGGMALTGCGGPGAARTETLVPPASPSPVAKAAPERVEK PPTMAPGPAGLTPVFERRAPGGSGSATGSPGRDEKVVALTFDADMTADQGPRAEAGEHF DNPELIALLRRLKVPATVFMTGRWAEEYPSQARSIGTDPLFEIANHSYSHYAFSSPCYG LPTVEEGAMRGEVEQAFDAIRKTGARQVVPYFRFPGGCYDDASRKALAPTGVTAVQWDV VSGDAFATDADAVAQQVLDGVGPGSLVVMHCTRSAAPVTDEAVRQVVPALRERGYRFVK VSELMRG" ORIGIN 1 ctctcgacgt cgacgacctg ccccgtgacg tccttggcgt ccgacgaacg cgacagaagc 61 ctgcccgacc ccgccagctc ctccagcacg ccctcgtagc ggtcctgcgg cacgcgcagc 121 accaggtgcg agacctcgtg ggtgtcgtcc agccgctccg tggtctcctc ggcgaccagt 181 ccgccggcgc cttccgcggc cctgcgggcg gcggcgaccg ccttgggcac actcgccacc 241 tcgacgtcga gcgtggccgt acggatgacg tgggcggcga ccgcgggggc ctgcttcgcc 301 ttcgacgcgg ccccgtcggc ggaccgctca cccgccccct ccgcgcccgg tgccgccgcc 361 ttcccgtccg cggacgtggc ccccttgtcg gcggcggagc cgcctccgcc cccggccgag 421 cacccgccgg cgccgagcag cgccacgagc agggcggcgg ccggcacggc gcgtgccagg 481 cgcctgccgg accgtgcggg tggatgggtg tccgtgcggt gtccgtcgtg catgtcgcgt 541 cccccccaag ggccgttcgc cgatgatgcc ggttcgacgt acgggccggt gcgcgggttg 601 ccggacggcg gtgtcgaagc ggtcacgttc gggactcgtt cggggtttga gagagtggac 661 ccatgcagcg ttctcttcag cacgaccccg cgcgtacggg gcatgtcgtc gtcatcggcg 721 gcggcatcgc cggactcgcc gccgcccacc ggctgctcgc caccggactc cgggtcaccg 781 tcctggaggc gaccgagcgg ctcggcggca agctgatgac cggcgaggtc gccggcgtac 841 gggtggacct gggggccgag tcgatgctcg cgcggcgccc cgaggcggtc gagctggcca 901 ccgccgccgg tctcggcgac cggctccagc cgcccgccac cgcctccgcc tcggtctgga 961 ccgggaacgc cctgcgcccc atgcccaagg gccacgtcat gggcgtcccg ggggatccgg 1021 cggcgctcag cgggctgctc tcctccgagg ggctcgccag gatcgccgag gaacgcgacc 1081 tcacccccac cgccgtcggt gaggacgtcg gggtcggtgc ctatgtcgcc gaccggctcg 1141 gccgcgaggt cgtcgaccgg ctcgtggagc cgttgctcgg cggggtgtac gcgggcgacg 1201 cctaccggat ctcgatgcgg gccgccgttc cgcagctctt cgaggtcgcc cgggagggag 1261 gctcgctcct cgacggcgtc cggcggatcc aggagcgggc cgccgtcagg cagccgaccg 1321 gacccgtctt ccagggcatc tccggggggc tcggtacgct gccgggcgcc gtcgccgacg 1381 ccgtgcgggc gggcgggacc gagatccgta cggccacccc cgtcctcggc ctgacccgga 1441 ccgggcaggg ctgggaggtg cgcaccgaca gcggggtgat cgccgccgac ggtgtcgtga 1501 tggccgcgcc ggcctggtcc gcgtccaccc tgctcgccgc cgagtgcccg gcggcctccg 1561 tcgaactggc cggcgtggag tacgcctcga tggccctggt caccctcgcc ttccggcgct 1621 cggacgtcga gggcaacgag gcgctggcgg ggcgctccgg ctttctcgta ccgcccgtcg 1681 acggacgcac gatcaaggcg tccaccttct ccagcaacaa gtggcggtgg gtcgccgacg 1741 cggcccccga cctcttcgtg ctgcggacct cggtcgggcg ctacggcgag gaggaccacc 1801 tgcaccgaga ggactccgag ctcgtcgcgg tctcgctgaa ggacctggcc gacgcgaccg 1861 ggctgacggc gcggcccgtg gacaccgtcg tcacccggtg gatcggcggg ctgccgcagt 1921 accccgtcgg tcatctgggc cgggtcgccc ggatccgcga ggaggtggcg aagctgccgg 1981 gactgcgggt gtgcggcgcg gtctacgacg gggtgggcat ccccgcctgc gtggccggcg 2041 cccaccgggc cgcggacgag atcgcggaag agatcatcgc cacgtcgacc cgggttcagg 2101 gcactccgag cgaggcggga caatagccgt atgagtgcgc cagaaaagct tccgaacgca 2161 ggcaagaagg ccaaggacct caacgaggtc atccgctaca cgctgtggtc cgtcttcaag 2221 ctgcgagatg tgcttcccgc cgaccgggcc ggctacgccg acgaggtcca ggagctgttc 2281 gaccagctcg cggccaagga cgtcaccgtc cgcggcacct acgacgtctc cggtctgcgg 2341 gccgacgccg atctgatgat ctggtggcac gccgagacct ccgacgcgct ccaggaggcg 2401 tacaacctct tccggcgtac cacgctcggc cgcgcgctgg atccggtctg gtcgaacatg 2461 gccctgcacc ggcccgccga gttcaacaag tcgcacgtgc cggcgttcct cgccgacgag 2521 gagccgcgcg cctacgtcag cgtgtacccc ttcgtgcgca gctacgactg gtacctgctg 2581 ccggacgagg accgtcgccg tatgctcgcg gaccacggca agatggcccg tggcttcccg 2641 gacgtgcgcg ccaacacggt cccctcgttc tcgctcggcg actacgagtg gatcctggcc 2701 ttcgaggcgg acgagctgta ccgcatcgtc gacctcatgc gtcatctgcg aggctccgag 2761 gcgcggatgc acgtgcgcga agaggtcccg ttctacacgg gccgcaggaa ggccgtcgca 2821 gacttggtgg cggggctcgc atagccagcg tttcccggcc accccgaccc ggaagatcag 2881 ttggcgggca acggcgcgtg tgacgccgcc cgtcgttcca gcgacaccgg gtccggttcc 2941 gggagcacat cggccctgac cggcgaagtg cggtggcccg cgggggagtc agccattcct 3001 ccccgcgggt cgccgcctcg tgccggacgt ggccgatgtg tcccgggccg cacaaccgac 3061 gggtacgtcc cggggcgcct cgcagatctc agatctcggc gccggacgcc cggccgagcg 3121 ggcggaccgt acggccgccg tcgagcagcg ccgcccgcag ccgcgggtca cccaccgccg 3181 ccaccccccg cacggccacc gccgccagca cgtcgcggtc cgggccgtgg accggggccg 3241 gcagcgagtc cagccacctc tgcccggacg gcgaggccca cggactgtac ggctggttct 3301 ccacccgcgc gatggccagg cagcccagcg tgagcaccag cggcagcccg aaccacgcca 3361 ggaccgggcc cgcgtcgccc gtgtcctgac cgggcatcag cgcggtcacc gccgccagca 3421 ccaccaccag caccgccgcg ccgcgcaccc cccgcaccgc cgactccagg gcggtccgcg 3481 tgccgtgcgg tacggcgagc cccgccgcca ccaggcggtc ggcgagagcg cgtacggtgt 3541 cggcggcggc agcggacgcc cgtatcggcg ctatccggga ctgcccctcc gggccgatgg 3601 cccggatcac ggtgcgctcg acctcgtcgc ggccctccgg gtcgacgacc gtggcccagc 3661 ctgtgtgggc cagcagcaga cggcgccgca gatgcatcga gaccagggcc agatcggcga 3721 cccggtgcgg accgcctgcc aggaacgccg tctcgtacag gctgagttcg cgcgacagcg 3781 gcttaccagg ggcgtcggcc gggagcagcg cggcgcccgc ggtgaggagg cagagccgga 3841 tacaggagac ggccgccacg ccccacgcgg ccaggaggaa gaagacccag agcatgggcg 3901 atgtctatgc gaggcgggca cgcgagcgcc acggtctgtt cacgattcgg acagaagtcg 3961 gtcacagctg gtcacggctg gctgaaatcg tttggttcag gagctgctgc cacagctgga 4021 cccgccgccg cagctcgacc cgccgccgca gctggaccca ctgctgcagc tggagccgcc 4081 gccgcagctc gaaccgccac ctcccgaacc gcacgaggac cccgttcccg agtcgccgct 4141 cccgccgcag ccacccccgc cgggccccgc cccggcgcac cacgcggcgt cgcccgcgta 4201 caggtccgcg gtcgcgcccg cgcccgccgg gacggagcgc ccgggggaag ggtgccgggc 4261 ggcggcgagc agctggtccc gcagctccgt gtgcggcagt gcgcccagcc cctgggtcgc 4321 caccaggtgg gcggcaccgg gcacatgggt gcgggtccac cggtagcggg tggcggcccg 4381 gcgtcccgcc tcggtgagcc gggccttcgc ggatttggcg gtcaccagac cgacgacggc 4441 accggcgagg agcgccggca gcaccttgac gacgaacggg atcgggacgt ccgcgtaccc 4501 ctcccgaagg gcgtactgga gcacggtcag gacgacggcg aacggcgcgg ccacgacgca 4561 cccgatgccc tggacgagcg cccaccggcg gcgtgtgcgg ttctcggccg gtggcaccag 4621 cagtccccgg gcggccagcc cgtcaccgat ctcctgcacc gcagggtgcc gcatcacggc 4681 cttccgcagg acgtgcaggg cgccgctcgg cgcggcggcc agctcctgga gcacggcgcg 4741 ctccaccggg tcacgggcct cgtcccgtcg tacggcgacg atgccggggc cgccgaccac 4801 caggcggccg tcctcgtaca gggcggtgag cgcggtgtcg accaccctgg ccgggccgcc 4861 gccgaggaag gccgcctccg acaggtcgtg gaccgacggt tcgggttcgc cgggggcacg 4921 gcgccttccg gccgccggca cggtgagcag cagcacggag gagaccgcca cggcgagggt 4981 cagcagcagg gcgagggcgt tcaccggctt ccccttcccg ggcgggctcc ggcgaggacg 5041 gtgcgcgccg cccgggcgac ccgtgcgccg cggcgcggtg ggcgcggtgc cgcccggtcc 5101 tgccaccacc gggtcagcct ccgccgggcc gcctcgtcgg ccgggcgccc ggcgatgagc 5161 aggtgctccg cgaagtccag ggcgtcgcgg cggtaacccc cggacagggg gcgggtcctg 5221 gcgtacgcga ggaacgcctt ccggaagccg tcaccgagga tccggggaag ctccggggcc 5281 acctcggcga cgacgtcggc ccgcttcgcg gccagggcgc gactctgcac cgccagccgc 5341 gcgtggtcga agccctccgg cgcgggcgag cccgccacca gggcagacag cagcgtggtc 5401 tgcgccatcg ccacccgctc ccgggcgccc ggcgcgggtg ccggacgggc ggtggccccg 5461 gccggcgtcc ggtcaggggc cggggcggcc gccgccgcgg ccaaggtggc ccgtatcgcg 5521 gtcagttcgc ccgccagctc cccggccggc gggaaggcgt cgtcgcgttc cagcagcacc 5581 cccggcgggt ccacccgcga acgcagttcc gtcagcacgt ccagcacggg gcgcgtcacc 5641 gggtgggcgt gggtgtcgtg ccagacgccg tccttctcga tgccgcccgc cacatggacg 5701 tacgcgatgg cctccaccgg cagctcgtcc agtgcggccg cggggtcctc gccccggttg 5761 acgtggttgg tgtgcaggtt ggccacgtcg atcaggaggc gtacgccggt gcgctcgacc 5821 agctccgcca ggaactgccc ctcggtcatc tcctcgtccg gccaggcgat cagcgcggcg 5881 atgttctcca gggcgagcgg cacgggcagc gagtcctggg cgatgcggac gttctcgcac 5941 agcacgtcca gcgcgtccca ggtccgcggg acgggcagca gatggcccgc ctccaggccc 6001 ggcgacgcgg tgagcggccc cccggcccgg acgaaggcga cgtgctcggt caccagcggt 6061 gcgcccagca gtgtggcacg cgcggccagg ccggcgaggc ggtccgggtc gggccggtcc 6121 gccccgccca ggcccagcga gacgccgtgc gggacgacgg tgaccccgcg ttcccggagc 6181 cgtgcgaggg ggcccgggag gtggtcctcg cagatgttct ccgcgacggc ctcgacccag 6241 tcgatccccg gcagcccctc gagggcgtcc gcgatctccg gccgccagcc gatgccgatt 6301 cccagcttca tggtgtcccc tccccgcctc gtgcaggggt catggccccg cggcgcggcg 6361 ccgaatccga gaagggggac gttcagagct ggatctgagg ttcccggtgc cggggccttc 6421 cgcgcgcagg accgtccgcg acctggacga aaacctcggc gttatctgat cgtgatgttc 6481 cggccggtcg gtcatcacct cgcccacctg caaaaacacc gcgcacacgg cctcccggag 6541 ggccgggagc gccggattaa atcggcggtc tgttgacgga gatcaacagg cgttcgcatg 6601 ctgatcgaca tgtcaaccag ggacggctcc acggcacatt ccccgcgggg cgagacgtgc 6661 gcgcccgccg accgccgctc gcggaaggcg tgtccgggcg gggccgaggc ggccccggcg 6721 gccgtccctg tcccccgctg acaggcggac cgacgccccg gcccggccgc gacgcggtcg 6781 cgggcagacg tacggccccg ccgggcgcgg ggcatcccgt ggagagccgg cccgactcga 6841 tccgacccgg gcccgggcgg gcggtcctgc ccacgcaccg gcaacccgga cgaacgaagg 6901 agaaactgac tgatgaccgg caccgagaac aggtcgcacc tcgaccggcg gtccctgctg 6961 acccgagccg ccggtgtggg agcggcggcg gcactgccca cggccgccgc cggggcgacc 7021 ccggcgtcgg cctcgcccgc gtccccggcc tcgcccgccc gccgtcaccc ctccaactgg 7081 cccgaccccg agccctacgg catcgcggac atccggccgg acctgtggcc acgcgaggac 7141 aactccttcg tcctgccgct ggagttgcgc ccccgagaca aggaacgcgg cctggtgtgg 7201 atgcgggaca cctacgtcaa ccgcttcgtc gtcgacggcc gtccgctcta cgtcgcgacc 7261 ggaaccaccc gcgtacccgg gctggaggcg gccggaccgt ggaacgacgg catcttcgtc 7321 tggctgtccc gctccctcaa gggcccctgg aagctggccg acacgacccg catccggccc 7381 ggcgcggaga agggcaaggt gtggtcgccc gagttcgtcg gcgagaaccg gcccggccgc 7441 acggtcgtcg ccccctggca ggagtactgg tacgacgagc agttcggcaa gcgcggccag 7501 gcgtgggccc cggagctgca ctacttccgg ggcaagtggt acatggtcgc gtgcatgggt 7561 gaccactcga agaaggtcgg ctccttcatg ctcgtgagcg agggcggggt cgagggcccc 7621 taccggctcg tcgaggggaa cgtcgacaag cccttcggcg actcgttcat cgggggtccg 7681 gccttcatcg agcccggcgc ctaccaccac atcgacggga gcctctactc cgagggcgac 7741 cgcgcctggc tcgtgctcca caacaacctg tacgcccggt tccgcgacga catggaggac 7801 atcgtcacga cgacggacct cccgctgttc aagcagacgc cctacgcacc cgagccgtac 7861 ctcgaaggcg cctacgtctt caagcacggc ggcaagtact acctcctgca cgccgcgtgg 7921 gaccgtacgt cgatcaacgc cgacggcagc acccgccagg cctacgacac ggccggaacc 7981 ggccgcgtgc agtaccagta cgacgcggtg gtcgccgtct cggaccgctt cgagggcccg 8041 tactcacggc gatggaccgc cggggtcggc gccggccaca acaacttctt caccgactcc 8101 gacggcaccg tgtgggccac cttcttccgc aacccggcgt tcggccactg gtccgacccg 8161 tcgcgcgtcg ccgacgccgc cgtgcccggt gtcgtacggg tggagtggac cggcccgcag 8221 ggcaaccgcc tgtacgtccg gcggcgggac ggcgggcgcg gctgacgggt caggcggagg 8281 cgccccgggg ccgccgtccg cgcgggagcg ggcagaagcg actgcccccc tcggtcacga 8341 cggtcgcccc gggggcgatc tcggtgaaac cggcgtcgac caccaccggc aacccgctcg 8401 cggtgagttc ccgccagctc tcgggcgtcg cgacggcgac cgagagcggg aagccggcct 8461 cgcgccactc ctcgcgctcc gcgtcggaca gctcccacca ggcgagctgc gcgccgtgtc 8521 cggcctgcgc catcgtcttg cccgccgaca tgtccggccc ggggttgagc cagagcaccg 8581 ggccggtccc gtcgggcgcg ggcggcggga cggggtcgtc cagctccgta ccggacacct 8641 ggagcttggc cagctccttg ggccagccgt ccagagggat cggcgggaag acccgtacct 8701 cggcgctctc gcccgtgacc gtgatcccgg gcagtgccga ggccttgcgc cactccgcgc 8761 cgcgcgcccg gcgcaccacc ttgcggatcc gggcgtcctc ccagtcccgt acggcctggg 8821 cccactcccc gtcgcccagc gaacgctcgt ccgagagcat cgtgagcacc gcgcgggcgg 8881 cggtccgcac cgcgtcggtg cgccccgggg gatcggtctt ctccaggtgc accaccagcg 8941 gaagcacgta ctgcggtgcc tcgtcgcggg cggtcggctc accgcggaac gggctgtccg 9001 cgggcggggc ggaaaggggc cgggaaacgt cgtcactgct cacggaccca gtctgccagg 9061 cctccggaca gcacttcttg gcggaacaca cggctccggg tgaggatgca cgccatgaag 9121 agcgacctct tttccagcga gcacatggcc cagcaggcca ccgcccccgg tatgaccctg 9181 cagaacgcca aatccatcaa gtacgccgtc gacggcgaga tgcacgcgcg ccagggatcg 9241 atgatcgcct tccgcgggaa cctccagttc gagcgcaagg gccagggcat cggcggcatg 9301 ctcaagcgcg cggtcaccgg cgagggcctg ccgctcatgg cggtgcgcgg ccagggcgag 9361 gcctggttcg cccacgaggc ggccaactgc ttcatcgtgg acatggagca gggtgacgtc 9421 ctgaccatca acggccgcaa cgtgctgtgc ttcgacgcca ccctcgccta cgagatcaag 9481 accgtgaagg gcgcagggat gaccggcggc ggcctcttca acagcgtctt caccggccag 9541 ggcaagctcg gcctcatgtg cgacggccac cccatcgtga tccccgtcag cgcccggcag 9601 ccggtctacg tcgacacgga cgcggtcgtc ggctggagcg cccagctctc cacctcgctg 9661 caccggtccc agagcttcgg ctcgatggtg cgaggcgggt ccggcgaggc ggtccagctg 9721 atgctccagg gcgaggggtt cgtgatcgta cggcccagtg aggtcaagca ggagaaggcg 9781 tcggcgaact gaacccgcac ggggccggcc gccgtcacgg ccttcgcggg gagcgggtgc 9841 tgcgcggggt cggcctcgtg ctgcccccgg cacgccggtg cgcgtcgagg gcgccaacgg 9901 cacgggcagg tcggccgctc ccggcgtccg ccagggccgt gctcgccgcc gccgcggtgg 9961 ccgccggcac cgcccggctc gcctcgcggc gggagtgagt acgctcgacg gcatggacga 10021 cgcgtactgc gagacgcccg ctccggcgcc cgtacccgag gacacgggcg ggccgtatgc 10081 cgagtgcgtg ctgtgccggg agcccaccga gtatccggag tcgacgaagg gcgccaccct 10141 ctgcccggtc tgcgcctggc aggaggcggg ccgcacggcc tgctccggct gacggaggcc 10201 cacttcctca gccgcgcatc agctcggaga ccttcacgaa gcggtacccc cgctcgcgca 10261 gcgccgggac cacctggcgt acggcctcgt cggtgacggg cgccgcgctg cgcgtgcagt 10321 gcatgaccac cagcgaaccc ggcccgacgc cgtccagcac ctgctgggcc accgcgtccg 10381 cgtccgtggc gaaggcgtcg ccgctgacga cgtcccactg caccgccgtc acacccgtcg 10441 gcgccagggc cttgcgcgag gcgtcgtcgt agcagccgcc cgggaagcgg aagtacggca 10501 ccacctggcg tgcccccgtc ttccggatcg cgtcgaacgc ctgctccacc tcgccccgca 10561 tggcgccctc ctcgacggtc ggcagcccgt agcaggggga ggagaaggcg tagtggctgt 10621 aggagtggtt ggcgatctcg aacagcggat cggtgccgat cgagcgggcc tgggacgggt 10681 actcctcggc ccaccggccg gtcatgaaga ccgtcgccgg caccttcagc cggcgcagca 10741 gcgcgatcag ctccgggttg tcgaagtgct cgcccgcctc ggcgcgcggc ccctgatccg 10801 ccgtcatgtc ggcgtcgaag gtgagcgcca cgaccttctc gtcccgcccc ggggaaccgg 10861 tggccgagcc ggagccgccc ggcgcccggc gttcgaacac cggggtgaga ccggccggcc 10921 cgggggccat cgtgggcggc ttctccaccc gctccggggc cgccttcgcc accggactcg 10981 gactcgccgg gggcaccagg gtctcggtac gggcggcccc tgggccgccg cagcccgtga 11041 gtgccatccc gcccagaatt accagtgctg tcatcctact aaccgatttg atcactggag 11101 gacgttagct gacggcccgt cggatctggt cgcggcgctc cgggggagcg gtgctccggg 11161 ggagcggtga acgccgctca gtgccagggc cccgtcaccg cgaacgtggt ccccggggtg 11221 tagcagttca cgaacatcgt ggcgccgtcc ggcgcgaacg tgaccccggc gaactcgccc 11281 cactcgggct catcggcggt cccgatgtcc t //