LOCUS       NZ_GG657754            10405 bp    DNA     linear   CON 01-MAY-2024
DEFINITION  Streptomyces himastatinicus ATCC 53653 supercont1.1, whole genome
            shotgun sequence.
ACCESSION   NZ_GG657754
VERSION     NZ_GG657754.1
KEYWORDS    WGS; RefSeq.
SOURCE      Streptomyces himastatinicus ATCC 53653
  ORGANISM  Streptomyces himastatinicus ATCC 53653
            Bacteria; Actinomycetota; Actinomycetes; Kitasatosporales;
            Streptomycetaceae; Streptomyces; Streptomyces violaceusniger group.
REFERENCE   1  (bases 1 to 10901646)
  AUTHORS   Fischbach,M., Godfrey,P., Ward,D., Young,S., Zeng,Q., Koehrsen,M.,
            Alvarado,L., Berlin,A.M., Bochicchio,J., Borenstein,D.,
            Chapman,S.B., Chen,Z., Engels,R., Freedman,E., Gellesch,M.,
            Goldberg,J., Griggs,A., Gujja,S., Heilman,E.R., Heiman,D.I.,
            Hepburn,T.A., Howarth,C., Jen,D., Larson,L., Lewis,B., Mehta,T.,
            Park,D., Pearson,M., Richards,J., Roberts,A., Saif,S., Shea,T.D.,
            Shenoy,N., Sisk,P., Stolte,C., Sykes,S.N., Thomson,T., Walk,T.,
            White,J., Yandava,C., Straight,P., Clardy,J., Hung,D., Kolter,R.,
            Mekalanos,J., Walker,S., Walsh,C.T., Wieland-Brown,L.C., Haas,B.,
            Nusbaum,C. and Birren,B.
  CONSRTM   The Broad Institute Genome Sequencing Platform, Broad Institute
            Microbial Sequencing Center
  TITLE     Annotation of Streptomyces hygroscopicus strain ATCC 53653
  JOURNAL   Unpublished
REFERENCE   2  (bases 1 to 10901646)
  AUTHORS   Fischbach,M., Godfrey,P., Ward,D., Young,S., Kodira,C.D., Zeng,Q.,
            Koehrsen,M., Alvarado,L., Berlin,A.M., Borenstein,D., Chen,Z.,
            Engels,R., Freedman,E., Gellesch,M., Goldberg,J., Griggs,A.,
            Gujja,S., Heiman,D.I., Hepburn,T.A., Howarth,C., Jen,D., Larson,L.,
            Lewis,B., Mehta,T., Park,D., Pearson,M., Roberts,A., Saif,S.,
            Shea,T.D., Shenoy,N., Sisk,P., Stolte,C., Sykes,S.N., Walk,T.,
            White,J., Yandava,C., Straight,P., Clardy,J., Hung,D., Kolter,R.,
            Mekalanos,J., Walker,S., Walsh,C.T., Wieland-Brown,L.C., Galagan,J.,
            Nusbaum,C. and Birren,B.
  CONSRTM   The Broad Institute Genome Sequencing Platform, Broad Institute
            Microbial Sequencing Center
  TITLE     Direct Submission
  JOURNAL   Submitted (09-FEB-2009) Broad Institute of MIT and Harvard, 7
            Cambridge Center, Cambridge, MA 02142, USA
REFERENCE   3  (bases 1 to 10901646)
  AUTHORS   Fischbach,M., Godfrey,P., Ward,D., Young,S., Kodira,C.D., Zeng,Q.,
            Koehrsen,M., Alvarado,L., Berlin,A.M., Borenstein,D., Chen,Z.,
            Engels,R., Freedman,E., Gellesch,M., Goldberg,J., Griggs,A.,
            Gujja,S., Heiman,D.I., Hepburn,T.A., Howarth,C., Jen,D., Larson,L.,
            Lewis,B., Mehta,T., Park,D., Pearson,M., Roberts,A., Saif,S.,
            Shea,T.D., Shenoy,N., Sisk,P., Stolte,C., Sykes,S.N., Walk,T.,
            White,J., Yandava,C., Straight,P., Clardy,J., Hung,D., Kolter,R.,
            Mekalanos,J., Walker,S., Walsh,C.T., Wieland-Brown,L.C., Galagan,J.,
            Nusbaum,C. and Birren,B.
  CONSRTM   The Broad Institute Genome Sequencing Platform, Broad Institute
            Microbial Sequencing Center
  TITLE     Direct Submission
  JOURNAL   Submitted (09-FEB-2009) Broad Institute of MIT and Harvard, 7
            Cambridge Center, Cambridge, MA 02142, USA
COMMENT     ##Genome-Assembly-Data-START##
            Assembly Method                   :: Arachne v. April 2007
            Genome Coverage                   :: 6x
            Sequencing Technology             :: ABI
            ##Genome-Assembly-Data-END##
            ##Genome-Annotation-Data-START##
            Annotation Provider               :: NCBI RefSeq
            Annotation Name                   :: GCF_000158915.1-RS_2024_05_01
            Annotation Date                   :: 05/01/2024 03:02:38
            Annotation Pipeline :: NCBI Prokaryotic Genome Annotation Pipeline
            (PGAP)
            Annotation Method :: Best-placed reference protein set; GeneMarkS-2+
            Annotation Software revision      :: 6.7
            Features Annotated                :: Gene; CDS; rRNA; tRNA; ncRNA
            Genes (total)                     :: 9,680
            CDSs (total)                      :: 9,608
            Genes (coding)                    :: 8,720
            CDSs (with protein)               :: 8,720
            Genes (RNA)                       :: 72
            rRNAs                             :: 4, 1 (5S, 16S)
            complete rRNAs                    :: 4, 1 (5S, 16S)
            tRNAs                             :: 64
            ncRNAs                            :: 3
            Pseudo Genes (total)              :: 888
            CDSs (without protein)            :: 888
            Pseudo Genes (ambiguous residues) :: 0 of 888
            Pseudo Genes (frameshifted)       :: 423 of 888
            Pseudo Genes (incomplete)         :: 530 of 888
            Pseudo Genes (internal stop)      :: 36 of 888
            Pseudo Genes (multiple problems)  :: 97 of 888
            ##Genome-Annotation-Data-END##
            ##antiSMASH-Data-START##
            Version                           :: 8.dev-cf2fc5ee(changed)
            Run date                          :: 2025-09-12 20:01:42
            NOTE :: This is a single region extracted from a larger record!
            Orig. start                       :: 8099141
            Orig. end                         :: 8109546
            ##antiSMASH-Data-END##
            REFSEQ INFORMATION: The reference sequence is identical to
            GG657754.1.
            The annotation was added by the NCBI Prokaryotic Genome Annotation
            Pipeline (PGAP). Information about PGAP can be found here:
            https://www.ncbi.nlm.nih.gov/genome/annotation_prok/
            Annotation was added to the scaffolds in August 2010.
FEATURES             Location/Qualifiers
     region          1..10405
                     /candidate_cluster_numbers="1"
                     /contig_edge="False"
                     /product="ectoine"
                     /region_number="28"
                     /rules="ectoine_synt"
                     /tool="antismash"
     cand_cluster    1..10405
                     /candidate_cluster_number="1"
                     /contig_edge="False"
                     /detection_rules="ectoine_synt"
                     /kind="single"
                     /product="ectoine"
                     /protoclusters="1"
                     /tool="antismash"
     protocluster    1..10405
                     /aStool="rule-based-clusters"
                     /category="other"
                     /contig_edge="False"
                     /core_location="[5000:5405](-)"
                     /cutoff="20000"
                     /detection_rule="ectoine_synt"
                     /neighbourhood="5000"
                     /product="ectoine"
                     /protocluster_number="1"
                     /tool="antismash"
     proto_core      complement(5001..5405)
                     /aStool="rule-based-clusters"
                     /tool="antismash"
                     /cutoff="20000"
                     /detection_rule="ectoine_synt"
                     /neighbourhood="5000"
                     /product="ectoine"
                     /protocluster_number="1"
     gene            complement(162..932)
                     /locus_tag="SSOG_RS34540"
                     /old_locus_tag="SSOG_06823"
     CDS             complement(162..932)
                     /GO_function="GO:0003677 - DNA binding [Evidence IEA]"
                     /GO_function="GO:0003700 - DNA-binding transcription factor
                     activity [Evidence IEA]"
                     /GO_process="GO:0006355 - regulation of DNA-templated
                     transcription [Evidence IEA]"
                     /codon_start=1
                     /gene_functions="regulatory (smcogs) SMCOG1195: IclR family
                     transcriptional regulator"
                     /gene_kind="regulatory"
                     /inference="COORDINATES: similar to AA
                     sequence:RefSeq:WP_007381154.1"
                     /locus_tag="SSOG_RS34540"
                     /note="Derived by automated computational analysis using
                     gene prediction method: Protein Homology."
                     /old_locus_tag="SSOG_06823"
                     /product="IclR family transcriptional regulator"
                     /protein_id="WP_009718909.1"
                     /transl_table=11
                     /translation="MTAAETGGAQVKSAVRTVELLEFFAGRPGMHSLASVQEAVGYPKS
                     SLYMLLRTLVELGWVETDATGTRYGIGVRALLVGTSYIDGDEVVAAARPTLDRLSDDTT
                     ETIHLARLDGTNVVYLATRQSQHYLRPFTRVGRRLPAHSTSLGKALLATYTDEQVRKLL
                     PETLSPLTEHTITDREKLIEELHLIREQGYAVDREENTLGLRCFGIAIPYRTPSRDAIS
                     CSVPVARLTGAHEQMIKDALFDARDRLTLATRRL"
     gene            1019..2599
                     /locus_tag="SSOG_RS34545"
                     /old_locus_tag="SSOG_06824"
     CDS             1019..2599
                     /codon_start=1
                     /gene_functions="biosynthetic-additional
                     (rule-based-clusters) Aldedh"
                     /gene_functions="biosynthetic-additional (smcogs)
                     SMCOG1017: aldehyde dehydrogenase"
                     /gene_kind="biosynthetic-additional"
                     /inference="COORDINATES: similar to AA
                     sequence:RefSeq:WP_014059810.1"
                     /locus_tag="SSOG_RS34545"
                     /note="Derived by automated computational analysis using
                     gene prediction method: Protein Homology."
                     /old_locus_tag="SSOG_06824"
                     /product="aldehyde dehydrogenase (NADP(+))"
                     /protein_id="WP_078510655.1"
                     /sec_met_domain="Aldedh (E-value: 7.2e-42, bitscore: 142.6,
                     seeds: 89, tool: rule-based-clusters)"
                     /transl_table=11
                     /translation="MCETQAAVTQQEESAVSAPVWSVDPRTGKQREQVAVEATAEEVDK
                     VVRAADAARGALTDRTVRAAFLRTAADLLDEARDTLVEAADAETALGPARLTGELARTS
                     YQLRSFATIVDEGAFLDVHVDHADATQTPPWPDLRRYKLPLGVVAVYAASNFPLAFSVP
                     GGDTASALAAGCPVVVKAHPDHPATSEVAASALRRAAEQVGLPADVISVVHGFDAGLEL
                     IKHPLIAAAGFTGSIRGGRALYDAAAARPVPIPFHGELGSLNPVVVTEAAAAERGEQIG
                     SGLSGSMTLGVGQFCTKPGFVLAPAGAAGDALVKSLTEAVSNTDAGVLLDHRMRDNFVA
                     GVKERAELADVEAPVTPGSGGEHSVSPGFLTVPARRLAEQGPHDVLLEECFGPVTVVAR
                     YESDAEISAVLGRLQGNLTATVQISAAEAEGTEGRAGELIAELTPLAGRVLVNGWPTGV
                     AVAPAQHHGGPYPATTSTSTSVGGTAVERWLRPVSYQDTPPALLPPELRDDNPLGLPRR
                     VNGIREQQG"
     misc_feature    2785..2798
                     /note="TFBS match to ArgR, Regulator of arginine
                     biosynthesis genes, confidence: weak, score: 16.85"
                     /tool="antismash"
     gene            2794..3846
                     /locus_tag="SSOG_RS34550"
                     /old_locus_tag="SSOG_06825"
     CDS             2794..3846
                     /NRPS_PKS="Domain: Aminotran_5 (13-342). E-value: 3.3e-23.
                     Score: 74.1. Matches aSDomain:
                     nrpspksdomains_SSOG_RS34550_Aminotran_5.1"
                     /codon_start=1
                     /gene_functions="biosynthetic-additional
                     (rule-based-clusters) Aminotran_5"
                     /gene_functions="biosynthetic-additional (smcogs)
                     SMCOG1139: aminotransferase class V"
                     /gene_kind="biosynthetic-additional"
                     /inference="COORDINATES: similar to AA
                     sequence:RefSeq:WP_007448151.1"
                     /locus_tag="SSOG_RS34550"
                     /note="Derived by automated computational analysis using
                     gene prediction method: Protein Homology."
                     /old_locus_tag="SSOG_06825"
                     /product="aminotransferase class V-fold PLP-dependent
                     enzyme"
                     /protein_id="WP_106516700.1"
                     /sec_met_domain="Aminotran_5 (E-value: 7.3e-17, bitscore:
                     60.4, seeds: 34, tool: rule-based-clusters)"
                     /transl_table=11
                     /translation="MESLAGAEFAPQTTYLNTASSGLIPARSTAAMKAALDDAAVGVRY
                     TDAAFEAVEETRAAYARLVGVPARRVATGGSVVVYAGLIATSLPSGAEVLTAEGDFSSL
                     VNPFAVRRDLKLRTVPLEELAEAVRPETALIAVSAVQSADGRIADLDAIRDAARAHGAR
                     TLVDLSQAVGWLPVSAGEDDFTVAVGYKWLLCPHGTAFLTVPEDLGGLTPVFAGWRSGE
                     QPWDSCYGPVAEPARSARRYDESPALLAYVAARHSLALLHELGTEAVRAHDRALADRFR
                     AGITGLGRSCVPAPGSVIVSVPGLGEAAPRLAKADVQVSARAGNLRAAFHLYNTPADVD
                     RLLEVLGDGA"
     aSDomain        2833..3819
                     /aSDomain="Aminotran_5"
                     /aSTool="nrps_pks_domains"
                     /database="nrpspksdomains.hmm"
                     /detection="hmmscan"
                     /domain_id="nrpspksdomains_SSOG_RS34550_Aminotran_5.1"
                     /evalue="3.30E-23"
                     /label="SSOG_RS34550_Aminotran_5.1"
                     /locus_tag="SSOG_RS34550"
                     /protein_end="342"
                     /protein_start="13"
                     /score="74.1"
                     /tool="antismash"
                     /translation="TYLNTASSGLIPARSTAAMKAALDDAAVGVRYTDAAFEAVEETRA
                     AYARLVGVPARRVATGGSVVVYAGLIATSLPSGAEVLTAEGDFSSLVNPFAVRRDLKLR
                     TVPLEELAEAVRPETALIAVSAVQSADGRIADLDAIRDAARAHGARTLVDLSQAVGWLP
                     VSAGEDDFTVAVGYKWLLCPHGTAFLTVPEDLGGLTPVFAGWRSGEQPWDSCYGPVAEP
                     ARSARRYDESPALLAYVAARHSLALLHELGTEAVRAHDRALADRFRAGITGLGRSCVPA
                     PGSVIVSVPGLGEAAPRLAKADVQVSARAGNLRAAFHLYNTPADVDRL"
     gap             3871..3970
                     /estimated_length=unknown
     gene            complement(4109..4996)
                     /gene="thpD"
                     /locus_tag="SSOG_RS34555"
                     /old_locus_tag="SSOG_06826"
     CDS             complement(4109..4996)
                     /EC_number="1.14.11.55"
                     /codon_start=1
                     /gene="thpD"
                     /gene_functions="biosynthetic-additional
                     (rule-based-clusters) PhyH"
                     /gene_kind="biosynthetic-additional"
                     /inference="COORDINATES: similar to AA
                     sequence:RefSeq:WP_009718912.1"
                     /locus_tag="SSOG_RS34555"
                     /note="Derived by automated computational analysis using
                     gene prediction method: Protein Homology."
                     /old_locus_tag="SSOG_06826"
                     /product="ectoine hydroxylase"
                     /protein_id="WP_009718912.1"
                     /sec_met_domain="PhyH (E-value: 9.5e-38, bitscore: 129.7,
                     seeds: 39, tool: rule-based-clusters)"
                     /transl_table=11
                     /translation="MTTVTDLYPTRGPLEVATPRVDPVVWSDPGAEGPMQPSELSDFDR
                     DGFLAIEQLITPDEVAVYRAELDRLIVDPDVRADERSIIEPKSQDVRSVFEVHRISEVF
                     ANLVRDPRVVGRARQILGSDVYVHQSRINVKPGFGASGFYWHSDFETWHAEDGLPNMRT
                     VSVSVALTENFDTNGGLMIMPGSHKTFLGCAGQTPKDNYKKSLQMQDAGIPSDEALSGF
                     ADKHGIKLFTGKAGSATWFDCNCMHGSGDNITPYSRSNVFIVFNSVENTAVEPFAAPVR
                     RPEHIGARDFTPVR"
     gene            complement(5001..5405)
                     /locus_tag="SSOG_RS34560"
                     /old_locus_tag="SSOG_06827"
     CDS             complement(5001..5405)
                     /codon_start=1
                     /gene_functions="biosynthetic (rule-based-clusters)
                     ectoine: ectoine_synt"
                     /gene_kind="biosynthetic"
                     /inference="COORDINATES: similar to AA
                     sequence:RefSeq:WP_009718913.1"
                     /locus_tag="SSOG_RS34560"
                     /note="Derived by automated computational analysis using
                     gene prediction method: Protein Homology."
                     /old_locus_tag="SSOG_06827"
                     /product="ectoine synthase"
                     /protein_id="WP_009718913.1"
                     /sec_met_domain="ectoine_synt (E-value: 5.4e-53, bitscore:
                     177.3, seeds: 10, tool: rule-based-clusters)"
                     /transl_table=11
                     /translation="MIVRSFKDIEGTERHVKAKSGTWESKRIVLAKEKVGFSLHETTLY
                     AGTETSMWYANHIEAVLCVEGEAELTNDENGEKHIITPGTMYLLDGHEKHTMRIKKDFR
                     CVCVFNPPITGREDHDENGVYPLITETEEA"
     misc_feature    complement(5423..5439)
                     /note="TFBS match to HexR, Glucose-responsive regulator,
                     confidence: weak, score: 16.6"
                     /tool="antismash"
     gene            complement(5447..6709)
                     /gene="ectB"
                     /locus_tag="SSOG_RS34565"
                     /old_locus_tag="SSOG_06828"
     CDS             complement(5447..6709)
                     /EC_number="2.6.1.76"
                     /GO_function="GO:0045303 - diaminobutyrate-2-oxoglutarate
                     transaminase activity [Evidence IEA]"
                     /NRPS_PKS="Domain: Aminotran_3 (28-363). E-value: 3e-73.
                     Score: 238.7. Matches aSDomain:
                     nrpspksdomains_SSOG_RS34565_Aminotran_3.1"
                     /codon_start=1
                     /gene="ectB"
                     /gene_functions="biosynthetic-additional
                     (rule-based-clusters) Aminotran_3"
                     /gene_functions="biosynthetic-additional (smcogs)
                     SMCOG1013: aminotransferase class-III"
                     /gene_kind="biosynthetic-additional"
                     /inference="COORDINATES: similar to AA
                     sequence:RefSeq:WP_007448159.1"
                     /locus_tag="SSOG_RS34565"
                     /note="Derived by automated computational analysis using
                     gene prediction method: Protein Homology."
                     /old_locus_tag="SSOG_06828"
                     /product="diaminobutyrate--2-oxoglutarate transaminase"
                     /protein_id="WP_009718914.1"
                     /sec_met_domain="Aminotran_3 (E-value: 3.5e-77, bitscore:
                     258.9, seeds: 13, tool: rule-based-clusters)"
                     /transl_table=11
                     /translation="MTITPPALSVFEALESEVRSYCRGWPAVFDRAQGSHMYDEDGHTY
                     LDFFAGAGSLNYGHNNPVLKRALIDYIERDGVTHGLDMSTTAKRAFLESFQNNILRPRD
                     LPYKVMFPGPTGTNAVEAALKLARKVKGRESIVSFTNAFHGMSLGSLAVTGNAFKRAGA
                     GIPLVHGTPMPFDNYFDGAVEDFLWFERLLEDQGSGLNKPAAVIVETVQGEGGINVARA
                     EWLRALSELCERQDMLLIVDDIQMGCGRTGAFFSFEEAGITPDIVTVSKSISGYGMPLA
                     LTLFKPELDIWEPGEHNGTFRGNNPAFVTAAATLDTYWADGQMEKQTLARGEQVEQGLR
                     AICDENSGISHRGRGLVWGMEFEDKPRASAVCKRAFELGLLVETSGPESEVVKLLPALT
                     MTPEELDEGLRILARAVRETA"
     aSDomain        complement(5621..6625)
                     /aSDomain="Aminotran_3"
                     /aSTool="nrps_pks_domains"
                     /database="nrpspksdomains.hmm"
                     /detection="hmmscan"
                     /domain_id="nrpspksdomains_SSOG_RS34565_Aminotran_3.1"
                     /evalue="3.00E-73"
                     /label="SSOG_RS34565_Aminotran_3.1"
                     /locus_tag="SSOG_RS34565"
                     /protein_end="363"
                     /protein_start="28"
                     /score="238.7"
                     /tool="antismash"
                     /translation="FDRAQGSHMYDEDGHTYLDFFAGAGSLNYGHNNPVLKRALIDYIE
                     RDGVTHGLDMSTTAKRAFLESFQNNILRPRDLPYKVMFPGPTGTNAVEAALKLARKVKG
                     RESIVSFTNAFHGMSLGSLAVTGNAFKRAGAGIPLVHGTPMPFDNYFDGAVEDFLWFER
                     LLEDQGSGLNKPAAVIVETVQGEGGINVARAEWLRALSELCERQDMLLIVDDIQMGCGR
                     TGAFFSFEEAGITPDIVTVSKSISGYGMPLALTLFKPELDIWEPGEHNGTFRGNNPAFV
                     TAAATLDTYWADGQMEKQTLARGEQVEQGLRAICDENSGISHRGRGLVWGMEFE"
     gene            complement(6841..7389)
                     /gene="ectA"
                     /locus_tag="SSOG_RS34570"
                     /old_locus_tag="SSOG_06829"
     CDS             complement(6841..7389)
                     /EC_number="2.3.1.178"
                     /GO_function="GO:0033816 - diaminobutyrate
                     acetyltransferase activity [Evidence IEA]"
                     /GO_process="GO:0019491 - ectoine biosynthetic process
                     [Evidence IEA]"
                     /codon_start=1
                     /gene="ectA"
                     /inference="COORDINATES: similar to AA
                     sequence:RefSeq:WP_014059805.1"
                     /locus_tag="SSOG_RS34570"
                     /note="Derived by automated computational analysis using
                     gene prediction method: Protein Homology."
                     /old_locus_tag="SSOG_06829"
                     /product="diaminobutyrate acetyltransferase"
                     /protein_id="WP_009718915.1"
                     /transl_table=11
                     /translation="MTAAQADRVGARSEIDLPEGLSLDTPRVEDGAAIWRIARDSKTLD
                     LNSSYSYLLWCRDFAATSVVARDAEGGPVGFITGYIRPDRPETLVVWQVAVDQAWRGRG
                     LAATLLDGLTARVAATGIRGVETTITPDNTASNRLFTSFAERHGAPVEHEVLFDGGLFP
                     DGGHEPEVLYRIGPLADRG"
     misc_feature    complement(7547..7562)
                     /note="TFBS match to CRP, Cyclic AMP receptor protein,
                     confidence: weak, score: 16.03"
                     /tool="antismash"
     misc_feature    7557..7572
                     /note="TFBS match to AfsQ1, Two-component system AfsQ1-Q2,
                     activator of antibiotic production, confidence: medium,
                     score: 17.62"
                     /tool="antismash"
     gene            complement(7760..8539)
                     /locus_tag="SSOG_RS34575"
                     /old_locus_tag="SSOG_06830"
     CDS             complement(7760..8539)
                     /codon_start=1
                     /inference="COORDINATES: similar to AA
                     sequence:RefSeq:WP_014059804.1"
                     /locus_tag="SSOG_RS34575"
                     /note="Derived by automated computational analysis using
                     gene prediction method: Protein Homology."
                     /old_locus_tag="SSOG_06830"
                     /product="hypothetical protein"
                     /protein_id="WP_039939356.1"
                     /transl_table=11
                     /translation="MIDHRDDRRDVAAELREAAEAHQPDRARMLARVTAGMAAGERRSG
                     RRVPARQWTRVVGPAAGLAAAVAAAGIAVVTTDTADGPQTVRTSSEPAGPPAGEGGGTA
                     RRPGGGAGDGAGRHTPSAYPTSVGPNHLADPAVRSRGAINPNSNPYWAQSDITVTTSKP
                     LTSLTVELRVAENGGVHTTGSWSTLPVDDLAVSVRSEGGVLVYRWTLRKGATVPAGRHV
                     FAGQYNHAEGSRDAGRDRYSASGNGPSGAFAVRGDFP"
     gene            complement(8539..9234)
                     /locus_tag="SSOG_RS34580"
                     /old_locus_tag="SSOG_06831"
     CDS             complement(8539..9234)
                     /GO_function="GO:0003677 - DNA binding [Evidence IEA]"
                     /GO_function="GO:0003700 - DNA-binding transcription factor
                     activity [Evidence IEA]"
                     /GO_function="GO:0016987 - sigma factor activity [Evidence
                     IEA]"
                     /GO_process="GO:0006352 - DNA-templated transcription
                     initiation [Evidence IEA]"
                     /GO_process="GO:0006355 - regulation of DNA-templated
                     transcription [Evidence IEA]"
                     /codon_start=1
                     /gene_functions="regulatory (smcogs) SMCOG1032: RNA
                     polymerase, sigma-24 subunit, ECF subfamily"
                     /gene_kind="regulatory"
                     /inference="COORDINATES: similar to AA
                     sequence:RefSeq:WP_019433895.1"
                     /locus_tag="SSOG_RS34580"
                     /note="Derived by automated computational analysis using
                     gene prediction method: Protein Homology."
                     /old_locus_tag="SSOG_06831"
                     /product="SigE family RNA polymerase sigma factor"
                     /protein_id="WP_078510465.1"
                     /transl_table=11
                     /translation="MSAPTDGPSAAATTPTSGLPADGLPAADAQAEFREFFERHHAELA
                     RLAHLLLGSSDGADDLAADALVAVWHRWDRVRVAEHPAAYARGVVANLARSRIRSLVRE
                     RRRVALFSSERLDRVDDPDVSAMVDVQAALVRLPFRKRACVVLRHAFDLSERDTARTLG
                     ISVGTVKSQTSRGIAELERLLGPPDDDTERPAAGTQAQAPPPAPSAQETVRGLRPGRVG
                     HAGHPGGNG"
     misc_feature    complement(9163..9165)
                     /note="tta leucine codon, possible target for bldA
                     regulation"
                     /tool="antismash"
     misc_feature    complement(9178..9180)
                     /note="tta leucine codon, possible target for bldA
                     regulation"
                     /tool="antismash"
     misc_feature    complement(9221..9235)
                     /note="TFBS match to CelR, Cellobiose uptake repressor,
                     confidence: weak, score: 22.04"
                     /tool="antismash"
     misc_feature    9222..9236
                     /note="TFBS match to CelR, Cellobiose uptake repressor,
                     confidence: weak, score: 22.04"
                     /tool="antismash"
ORIGIN
        1 cgacggcgcg cagcgcccgg cggggagcgg tggtggcgtg gggctgtgtc atgtacacca
       61 gccttgcgga gcgtccggct ccgcacctcg ccctgtccga cgatccgcct ccgccgtgcg
      121 gcggaggccg tcgtcctccc cgcgacccgc ggccttcgga ttcagagacg gcgcgtggcg
      181 agcgtgaggc ggtcgcgggc gtcgaacagc gcgtccttga tcatctgctc atgggccccg
      241 gtcagccggg ccaccggcac cgagcagctg atggcgtccc gggagggggt gcggtagggg
      301 atggcgatgc cgaagcagcg cagccccagg gtgttctcct cgcggtccac cgcgtacccc
      361 tgctcgcgga tcaggtgcag ttcctcgatc agcttctcgc ggtcggtgat cgtgtgctcg
      421 gtgagcgggc tcagcgtctc cggcagcagc ttgcgcacct gctcgtcggt gtacgtggcc
      481 agcagcgcct tgcccagcga ggtggagtgg gccgggaggc ggcggccgac gcgggtgaag
      541 gggcgcagat agtgctgcga ctggcgggtg gcgaggtaga ccacgttggt gccgtcgagc
      601 cgggcgaggt ggatggtctc ggtggtgtcg tccgagagcc ggtccagcgt gggccgggcc
      661 gccgccacga cctcgtcgcc gtcgatgtag gacgtgccga cgagcagcgc ccgcaccccg
      721 atgccgtacc gggtgccggt cgcgtccgtc tccacccagc ccagttcgac cagggtgcgc
      781 agcagcatgt acaggctcga cttggggtaa cctacggcct cctgcacgga ggccaggctg
      841 tgcatcccgg gccggccggc gaagaactcc agcagttcga ccgtgcggac cgccgacttg
      901 acctgggccc cgcccgtctc ggcagctgtc atcgccttga cccctctgtt cgaccagcca
      961 tagagtcccg atggaattca tcaaccggga cagcgttcag catatcgaac acccgtcgat
     1021 gtgcgagacg caagcagcgg taacacagca ggaggaatcc gcggtgtcag caccagtgtg
     1081 gagcgtcgac ccccgaaccg gaaagcagcg cgagcaggta gcggtcgaag ccacagcgga
     1141 ggaggtcgac aaggtggtgc gggcggccga cgccgcgcgc ggcgccctta ccgaccgtac
     1201 ggtgcgcgcc gccttcctgc gcaccgccgc cgatctgctc gacgaggccc gtgacacgct
     1261 cgtcgaggcc gccgacgccg agaccgcgct cggcccggcc cggctcaccg gcgagctggc
     1321 ccgcaccagc taccagctcc ggtccttcgc gacgatcgtg gacgagggtg ccttcctcga
     1381 cgtccacgtc gaccacgccg acgccaccca gaccccgccg tggcccgatc tgcgccgcta
     1441 caagctgccg ctcggcgtcg tcgccgtcta cgccgccagc aacttcccgc tggccttctc
     1501 cgtgcccggt ggcgacaccg ccagcgccct cgccgcgggc tgcccggtcg tcgtcaaggc
     1561 ccaccccgac caccccgcga cctccgaggt ggccgcctcc gcgctgcgcc gggccgccga
     1621 gcaggtcggg ctgcccgccg acgtgatctc cgtggtgcac ggcttcgacg cgggcctcga
     1681 actcatcaag cacccgctga tcgccgccgc gggcttcacc ggctcgatcc gcggcggccg
     1741 cgccctgtac gacgccgcgg ccgcccggcc ggtgccgatc cccttccacg gcgaactggg
     1801 cagcctcaac ccggtcgtgg tcaccgaggc cgccgccgcc gagcgcggcg agcagatcgg
     1861 ctccgggctc agcggctcga tgacgctggg cgtgggccag ttctgcacca agcccggctt
     1921 cgtgctggcc cccgcgggcg ccgctgggga cgcgctggtc aagtcgctca ccgaggccgt
     1981 cagcaacacc gacgccgggg tgctgctcga ccaccgcatg cgcgacaact tcgtggcggg
     2041 cgtcaaggag cgtgccgagc tggcggacgt cgaggcgccg gtgacccccg gttccggtgg
     2101 cgagcacagc gtcagccccg gcttcctcac cgtcccggcc cggcgcctcg ccgagcaggg
     2161 cccgcacgat gtgctgctgg aggagtgctt cggcccggtg accgtcgtcg cccggtacga
     2221 gagcgacgcg gagatctccg ccgtcctcgg ccgcctccag ggcaacctga ccgccaccgt
     2281 ccagatctcc gccgccgagg ccgagggcac cgagggccgg gccggtgagc tgatcgccga
     2341 gctgaccccg ctcgcgggcc gggtcctggt caacggctgg ccgaccggcg tcgccgtcgc
     2401 ccccgcccag caccacggcg gcccgtaccc ggcgacgacg tccacctcca cctcggtggg
     2461 cggcaccgcc gtcgagcgct ggctgcggcc cgtcagctac caggacaccc cgcccgcact
     2521 gctcccgccg gagctgcgcg acgacaaccc gctcgggctg ccccgccggg tgaacggcat
     2581 ccgggaacag cagggctgac tccccggtga cgacacggcc gtcgggcgcg cggacctgac
     2641 ccgcccgccc ggcggccgtt tcgcgttccg gcgttccgca ttccggtagc gggaatcagt
     2701 gcaggtcaga ggggatacgt aagcatccgt ggcgggcacg cttcagagat gtgcgatgga
     2761 cgccggggcg ggcccgtccc atagtggatg gacatggaga gcctggcggg cgccgagttc
     2821 gcgccacaga cgacgtatct caacaccgct tccagcggtc tgatccccgc acgctccacc
     2881 gccgccatga aggcggccct cgacgacgcg gccgtcggcg tccgctatac cgatgcggcc
     2941 ttcgaggcgg tcgaggagac ccgcgccgcc tacgcccggc tggtcggcgt accggcacgg
     3001 cgggtcgcga cgggcggctc ggtcgtcgtg tacgccggac tgatcgccac ctctctgccg
     3061 tccggcgccg aagtcctcac cgccgagggc gacttcagct ccctggtcaa ccccttcgcc
     3121 gtacgccgcg acctcaagct gcgcaccgta ccgctggagg agctggcgga ggcggtacgt
     3181 ccggaaaccg cgctgatcgc ggtgagcgcc gtccagtcgg cggacggacg gatcgccgac
     3241 ctcgacgcga tccgggacgc cgcccgggcg cacggggccc gcaccctggt ggatctcagc
     3301 caggccgtcg gctggctgcc ggtgagcgcg ggggaggacg acttcaccgt ggccgtcggc
     3361 tacaaatggc tgctgtgccc gcacggcacc gccttcctca ccgtgcccga ggacctgggc
     3421 ggcctcaccc cggtcttcgc gggctggcgc tcgggggagc agccgtggga cagctgctac
     3481 gggccggtgg ccgagccggc ccgttcggcg cggcgctacg acgagagccc ggccctgctc
     3541 gcgtacgtcg cggcccggca ctcgctcgcc ctgctccatg agctgggcac cgaggccgtc
     3601 cgcgcccacg accgcgcgct cgccgaccgc ttccgcgccg ggatcaccgg cctggggcgc
     3661 tcctgcgtgc ccgcccccgg ctcggtgatc gtctccgtac cgggcctcgg tgaggccgcg
     3721 ccccggctgg ccaaggcgga cgtacaggtc tcggcgcggg ccgggaacct gcgcgccgcc
     3781 ttccacctct acaacacccc cgccgacgtg gaccgtctgc tggaagtgct gggggacggt
     3841 gcgtagcggg aaaacacaga tggccagggg nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn
     3901 nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn
     3961 nnnnnnnnnn gccgtcatga tcgcctcgac actggccacc ggcgttctga caccacgctt
     4021 cggcccgaaa ccgatcgtcc cgggcgtgta ccgccctccc cctggccacc cacggtcacg
     4081 ggaccggtgg ttcacctcac cggggtgttc acctcaccgg ggtgaagtcc cgggccccga
     4141 tgtgctccgg ccgccgcacc ggcgccgcga acggctccac cgccgtgttc tccacgctgt
     4201 tgaacacgat gaagacgttg ctgcgcgagt agggcgtgat gttgtcaccc gagccgtgca
     4261 tgcagttgca gtcgaaccag gtcgccgaac cggccttgcc ggtgaacagc ttgatgccgt
     4321 gcttgtcggc gaagccggag agcgcctcgt ccgacggaat cccggcgtcc tgcatctgga
     4381 gcgacttctt gtagttgtcc ttcggggtct ggcccgcgca gccgaggaag gtcttgtgcg
     4441 aacccggcat gatcatcagg ccgccgttgg tgtcgaagtt ctcggtcagg gcgaccgaga
     4501 ccgacacggt ccgcatgttc ggcagaccgt cctcggcgtg ccaggtctcg aagtccgagt
     4561 gccagtagaa gcccgaggcc ccgaaacccg gcttgacgtt gatccggctc tggtggacgt
     4621 agacgtccga gccgaggatc tgccgggcgc gacccactac ccgcgggtcg cgcaccaggt
     4681 tcgcgaacac ctcgctgatc cggtgcacct cgaacaccga ccgtacgtcc tgcgacttcg
     4741 gctcgatgat cgagcgctcg tcggcgcgca cgtccgggtc gacgatcagc cggtccagct
     4801 cggcgcggta gaccgcgacc tcgtccggag tgatgagctg ctcgatcgcc agaaagccgt
     4861 cgcgatcgaa atcgctcagc tcggacggtt gcatcggacc ctcggccccg gggtcggacc
     4921 acacgacggg gtccacgcgg ggcgtggcca cctcgagggg accgcgggtc gggtacaggt
     4981 cagtgaccgt ggtcatcgga tcatgcctcc tcggtctcgg tgatcagggg gtataccccg
     5041 ttctcatcat ggtcctcccg tccggtgatc ggagggttga agacgcaaac gcatcgaaag
     5101 tcctttttga tgcgcatggt gtgcttctcg tgcccgtcca gcaggtacat ggtgccgggc
     5161 gtgatgatgt gcttctcacc gttctcgtcg ttggtgagct cggcctcgcc ctcgacgcag
     5221 aggaccgcct cgatgtggtt ggcgtaccac atggacgtct cggtgcccgc gtacagcgtg
     5281 gtctcgtgaa gcgagaagcc gaccttctcc ttcgcgagca cgatgcgctt gctctcccac
     5341 gtacccgact tggccttgac gtgcctttcg gttccctcga tgtccttgaa ggatcggacg
     5401 atcacggtga gtcgctacct ttctgcagtt acgttactga gggtgttcag gcggtctctc
     5461 ggaccgcccg ggcgaggatc cgcagaccct cgtccagctc ttcgggcgtc atcgtcagcg
     5521 ccggcagcag cttgacgacc tcgctctccg gaccggaggt ctcgaccagc aggccgagct
     5581 cgaaggcgcg cttgcagacg gccgaggcgc gcggcttgtc ctcgaactcc atgccccaca
     5641 ccaggccgcg gccgcggtgg ctgatgccgg agttctcgtc gcagatcgcg cgcagcccct
     5701 gctcgacctg ctcaccgcgg gccagggtct gcttctccat ctggccgtcg gcccagtagg
     5761 tgtccagggt cgcggcggcg gtcacgaacg cggggttgtt gccacggaac gtgccgttgt
     5821 gctcaccagg ctcccagatg tccagctccg gcttgaagag ggtgagcgcc agcggcatgc
     5881 cgtaaccgct gatcgacttg gagaccgtga cgatgtccgg ggtgatgccc gcctcctcga
     5941 aggagaagaa ggcaccggtg cggccgcagc ccatctggat gtcgtcgacg atcagcagca
     6001 tgtcctggcg ctcgcacagc tcggacaggg cgcgcagcca ctcggcacgg gcgacgttga
     6061 tgccgccctc gccctgcacc gtttcgacga tcacggcggc gggcttgttg aggccggagc
     6121 cctggtcctc gagcagccgc tcgaaccaca ggaagtcctc gaccgcgccg tcgaagtagt
     6181 tgtcgaacgg catcggggtg ccgtggacca gcgggatacc cgcaccggcc cgcttgaagg
     6241 cgttgccggt gacggcgagc gagccaagcg acatgccgtg gaaggcgttg gtgaacgaga
     6301 cgatcgactc gcgccccttg accttacggg cgagcttcag cgcggcctcg accgcgttgg
     6361 tgcccgtcgg gccggggaac atgaccttgt agggcaggtc acgcgggcgc aggatgttgt
     6421 tctggaacga ctccaggaac gcgcgcttgg ccgtggtgga catatcgaga ccgtgcgtga
     6481 cgccgtcacg ctcgatgtag tcgatcagcg cccgtttcag gaccgggttg ttgtggccgt
     6541 agttgagcga cccggctccg gcgaagaagt cgaggtacgt gtggccgtcc tcgtcgtaca
     6601 tgtggctgcc ctgcgcgcgg tcgaagacgg caggccaacc gcggcagtag ctgcgcacct
     6661 ccgactccag ggcctcgaag acgctcaggg cgggcggggt gatggtcaca gcattctcct
     6721 gggagatgag tgcgaggggt gagggggtga gggcgtgcgc cctcggggtg ttcaggggct
     6781 cgctgtacgg gggatcgggg gcagaaatcg gggatgacct cgggatgccg ttaccggggc
     6841 tcagccgcgg tccgcgagcg gaccgatgcg gtagagcacc tcgggctcgt gccctccgtc
     6901 ggggaacagg ccgccgtcga agagcacctc gtgctcgacg ggcgcgccgt gccgctcggc
     6961 gaaggacgtg aacagccggt tcgacgcggt gttgtccggg gtgatcgtgg tctcgacgcc
     7021 ccggatgccc gtcgcggcga cgcgcgcggt cagcccgtcc agcagggtgg cggccagtcc
     7081 gcgaccgcgc cacgcctgat cgacggcgac ctgccagacc accagggtct cgggccggtc
     7141 ggggcggatg tagccggtga tgaagccgac cggtccgccc tcggcgtcgc gcgccaccac
     7201 ggaggtggcg gcgaagtcgc ggcaccacaa caggtagctg taggaggagt tgaggtccag
     7261 agtcttggag tcgcgggcga ttcgccagat cgcggctccg tcctcgacgc gtggtgtgtc
     7321 gagggacagc ccctctggca aatcaatttc gctacgggcg cctactcggt ctgcttgtgc
     7381 ggcggtcatg cgaattcaat ttacccagca gaatctaaaa atgcatcgag gagaggggtt
     7441 acgtagacgg gggtcgatgt ggtaacacgc gagtgcgcgc gcacgcgcgg cagatcgttg
     7501 aaaagcgcta ttttgtgggg gagttagcgg gtcgaaacgg tcggcgtgtg gagtgtgtca
     7561 catgtgcgta acgggtgcgt gatgacaccg aattaccgtg tcctggacca ctaaaaaaag
     7621 tgcgtttaga gatcggaaga gcgggaagac gagataggaa gatgtgcatg aaaagcgccc
     7681 cgaatttgtc ttccggtctt ccggaattca ttccaccggg cttggcgcat cgcttctcgc
     7741 acggaacata cccccgaact cacggaaagt cgccacggac ggcgaatgcg ccggagggcc
     7801 cgttgccgga ggcggagtag cggtcccgcc cggcgtcccg gctgccctcg gcgtggttgt
     7861 actgccccgc gaagacatgg cgcccggccg gcaccgtggc ccccttccgg agcgtccagc
     7921 ggtagaccag caccccgccc tcggagcgca ccgagacggc caggtcgtcc acgggcaggg
     7981 tgctccagga ccccgtggtg tgcaccccgc cgttctcggc gacccgcagc tccaccgtca
     8041 gcgacgtcag cggcttgctc gtggtgaccg tgatgtcact ctgcgcccag tacgggttgg
     8101 aattggggtt gatcgccccc cgcgaacgca ccgcggggtc ggcgagatgg tttggaccga
     8161 ccgaggtggg ataggcgctc ggcgtgtgcc gaccagcccc gtcccccgcg ccgccgcccg
     8221 gccgtcgcgc cgtgccgccg ccctcgcccg cgggcggccc ggccggttcg ctcgacgtcc
     8281 gtaccgtctg cggtccgtcg gcggtgtcgg tggtcaccac cgcgatcccg gcggccgcga
     8341 ccgccgcggc cagccccgcg gccgggccca cgacccgggt ccactgccgg gcggggaccc
     8401 ggcggccgct ccgccgctcg cccgccgcca tgcccgcggt gacccgggcc agcatccggg
     8461 cccggtccgg ctggtgcgcc tcggcggcct cgcgcagctc ggcggcgaca tcgcgccggt
     8521 cgtcgcgatg gtcgatcatc aaccgttccc tcctgggtgc ccggcatgac ccacacggcc
     8581 ggggcggagc ccccgtaccg tctcctgcgc cgacggcgcc ggcggcgggg cctgcgcctg
     8641 ggtgcccgcc gcgggccgct cggtgtcgtc gtcgggcggg ccgagcagcc gctccagctc
     8701 ggcgatgccg cgggaggtct ggctcttgac cgtgcccacc gagatgccga gcgtacgggc
     8761 cgtgtcccgc tccgagaggt cgaacgcgtg ccgcagcacc acgcacgccc gcttccggaa
     8821 cggcagcctc accagcgcgg cctgcacatc caccatcgcg gacacatccg gatcgtccac
     8881 ccggtccaag cgctccgacg agaagagcgc cacccgccgc cgctcgcgca ccaggctgcg
     8941 gatcctggac cgggccaggt tggcgaccac gccccgggcg tacgcggccg gatgctcggc
     9001 gacacgcacc cggtcccacc ggtgccagac cgcgaccagc gcgtcggccg ccaggtcgtc
     9061 cgccccgtcg gaactgccca gcagcaagtg ggcgaggcgg gccagttcgg catggtgccg
     9121 ctcgaagaac tcccggaact ccgcctgtgc gtcggccgcg ggtaagccgt ccgccggtaa
     9181 gccggacgtc ggtgtggtgg ccgcggcgga cgggccatcg gtaggagcgc tcacacgggc
     9241 ctcctcactg tccctgctgt ccctcgtccg gtgggttgcg tgagcgtagc agcgctcccc
     9301 gtgttgtacg agtggagccc ttccgcttcc ccggaattgg ccggtcagcg gccgcccgcc
     9361 gtctcgtaca gtgcgtgaca tgagggagag cgcggtgctg cacatcaagg ggcggatcct
     9421 ggtcggcccg gaggacgggg acgccggtgg cgtccgggac gagttgtggg tggtcgacgg
     9481 gaagatcacc ttcgaccggc cgaccggggc ccgggatgtc cggcagctgg acggctgggt
     9541 gctgccgggg ctcgtcgacg cccattgcca cgtcggcctg gacgcccacg gcgcggtgga
     9601 cgacgcgacg agcgagaagc aggcactgac cgaacgcgac gccggggcgc tgctgctgcg
     9661 cgacgccgga tcaccgtccg acacccgctg ggtcgacgac cgcgaggacc tgccccgcat
     9721 catccgggcc ggtcgccaca tcgcccgcac caagcgctac atccgcaact acccgcacga
     9781 gatcgaaccg gaggacctgg tcgcctacgt ggcggccgag gcgcgccgcg gcgacggctg
     9841 ggtcaagctc gtcggcgact ggatcgaccg ggacagcggc gacctgtccg cctgctggcc
     9901 gcgcgggtcg tacgaggcgg ccatcgccga ggcgcaccgg ctcggggcgc gggtcacggc
     9961 gcactgcttc gccgaggaga ccctggctcc gctggtcgag gccgggatcg actgcatcga
    10021 gcacgcgacc gggctgaccg aggacaccat cccgctgttc gccgagcggg gcgtcgccat
    10081 cgtcccgacc ctggtcaaca tcgccacctt cccgcggctc gcggagggcg gcgaggccaa
    10141 gttcccgctc tgggccgacc acatgcgacg gctgcacgcg cggcgctacg acacggtgcg
    10201 cgccgcctgg gacgcgggca tccccgtcta caccggcacc gacgcgggcg gttcgctcgc
    10261 ccacgggctg gtcggccagg aggtcgccga gctggtcaag gcgggcatcc cggtgcggga
    10321 cgcgctgtcg gcggccacct ggggcgcccg gacctggctg ggacggccgg ggatcaccga
    10381 gggcgccccc gccgacctcg tggtg
//
