LOCUS NZ_KE386846 19040 bp DNA linear CON 22-APR-2024 DEFINITION Streptomyces flavidovirens DSM 40150 G412DRAFT_scaffold00009.9, whole genome shotgun sequence. ACCESSION NZ_KE386846 VERSION NZ_KE386846.1 KEYWORDS WGS; IMPROVED_HIGH_QUALITY_DRAFT; RefSeq. SOURCE Streptomyces flavidovirens DSM 40150 ORGANISM Streptomyces flavidovirens DSM 40150 Bacteria; Actinomycetota; Actinomycetes; Kitasatosporales; Streptomycetaceae; Streptomyces. REFERENCE 1 (bases 1 to 265539) AUTHORS Kyrpides,N., Huntemann,M., Han,J., Chen,A., Mavromatis,K., Markowitz,V., Palaniappan,K., Ivanova,N., Schaumberg,A., Pati,A., Liolios,K., Nordberg,H.P., Cantor,M.N., Hua,S.X. and Woyke,T. CONSRTM DOE Joint Genome Institute TITLE Direct Submission JOURNAL Submitted (02-JUL-2013) DOE Joint Genome Institute, 2800 Mitchell Drive, Walnut Creek, CA 94598-1698, USA COMMENT ##MIGS-Data-START## environment :: Soil investigation_type :: bacteria_archaea project_name :: Streptomyces flavidovirens DSM 40150 sequencing_meth :: WGS ref_biomaterial :: DSM 40150, ATCC 19900 finishing_strategy :: Level 1: Standard Draft GOLD Stamp ID :: Gi11969 Type Strain :: Yes Funding Program :: DOE-CSP 2012 Isolation Site :: Soil Sporulation :: Sporulating Gram Staining :: Gram+ ##MIGS-Data-END## ##Genome-Assembly-Data-START## Finishing Goal :: Improved High-Quality Draft Current Finishing Status :: Improved High-Quality Draft Assembly Method :: Unknown program v. before 2013-03-07 Genome Coverage :: Unknown Sequencing Technology :: Illumina HiSeq 2000 ##Genome-Assembly-Data-END## ##Genome-Annotation-Data-START## Annotation Provider :: NCBI RefSeq Annotation Name :: GCF_000429085.1-RS_2024_04_22 Annotation Date :: 04/22/2024 01:37:00 Annotation Pipeline :: NCBI Prokaryotic Genome Annotation Pipeline (PGAP) Annotation Method :: Best-placed reference protein set; GeneMarkS-2+ Annotation Software revision :: 6.7 Features Annotated :: Gene; CDS; rRNA; tRNA; ncRNA Genes (total) :: 6,525 CDSs (total) :: 6,452 Genes (coding) :: 6,216 CDSs (with protein) :: 6,216 Genes (RNA) :: 73 rRNAs :: 6, 1, 5 (5S, 16S, 23S) complete rRNAs :: 6 (5S) partial rRNAs :: 1, 5 (16S, 23S) tRNAs :: 58 ncRNAs :: 3 Pseudo Genes (total) :: 236 CDSs (without protein) :: 236 Pseudo Genes (ambiguous residues) :: 0 of 236 Pseudo Genes (frameshifted) :: 106 of 236 Pseudo Genes (incomplete) :: 173 of 236 Pseudo Genes (internal stop) :: 25 of 236 Pseudo Genes (multiple problems) :: 59 of 236 CRISPR Arrays :: 1 ##Genome-Annotation-Data-END## ##antiSMASH-Data-START## Version :: 8.dev-cf2fc5ee(changed) Run date :: 2025-09-13 10:04:46 NOTE :: This is a single region extracted from a larger record! Orig. start :: 246499 Orig. end :: 265539 ##antiSMASH-Data-END## REFSEQ INFORMATION: The reference sequence is identical to KE386846.1. URL -- http://www.jgi.doe.gov JGI Project ID: 1011953 Source DNA and organism available from Hans-Peter Klenk at the German Collection of Microorganisms and Cell Cultures (DSMZ) (hans-peter.klenk@dsmz.de) Contacts: Nikos Kyrpides (nckyrpides@lbl.gov) Tanja Woyke (microbe@cuba.jgi-psf.org) Whole genome sequencing and draft assembly at JGI Annotation by JGI The JGI and collaborators endorse the principles for the distribution and use of large scale sequencing data adopted by the larger genome sequencing community and urge users of this data to follow them. It is our intention to publish the work of this project in a timely fashion and we welcome collaborative interaction on the project and analysis. (http://www.genome.gov/page.cfm?pageID=10506376) Full annotations are available from IMG. The annotation was added by the NCBI Prokaryotic Genome Annotation Pipeline (PGAP). Information about PGAP can be found here: https://www.ncbi.nlm.nih.gov/genome/annotation_prok/ FEATURES Location/Qualifiers region 1..19040 /candidate_cluster_numbers="1" /contig_edge="True" /product="terpene" /region_number="2" /rules="(PT_phytoene_like or phytoene_synt or Lycopene_cycl or Lycopene_cycl_fung or T1TS or T1TS_KS or T2TS or TS_UbiA or TS_Pyr4)" /tool="antismash" cand_cluster 1..19040 /candidate_cluster_number="1" /contig_edge="True" /detection_rules="(PT_phytoene_like or phytoene_synt or Lycopene_cycl or Lycopene_cycl_fung or T1TS or T1TS_KS or T2TS or TS_UbiA or TS_Pyr4)" /kind="single" /product="terpene" /protoclusters="1" /tool="antismash" protocluster 1..19040 /aStool="rule-based-clusters" /category="terpene" /contig_edge="True" /core_location="[10000:16675](+)" /cutoff="20000" /detection_rule="(PT_phytoene_like or phytoene_synt or Lycopene_cycl or Lycopene_cycl_fung or T1TS or T1TS_KS or T2TS or TS_UbiA or TS_Pyr4)" /neighbourhood="10000" /product="terpene" /protocluster_number="1" /tool="antismash" proto_core 10001..16675 /aStool="rule-based-clusters" /tool="antismash" /cutoff="20000" /detection_rule="(PT_phytoene_like or phytoene_synt or Lycopene_cycl or Lycopene_cycl_fung or T1TS or T1TS_KS or T2TS or TS_UbiA or TS_Pyr4)" /neighbourhood="10000" /product="terpene" /protocluster_number="1" gene complement(297..1238) /locus_tag="G412_RS0131835" CDS complement(297..1238) /GO_function="GO:0008324 - monoatomic cation transmembrane transporter activity [Evidence IEA]" /codon_start=1 /inference="COORDINATES: similar to AA sequence:RefSeq:WP_014049646.1" /locus_tag="G412_RS0131835" /note="Derived by automated computational analysis using gene prediction method: Protein Homology." /product="cation diffusion facilitator family transporter" /protein_id="WP_028815571.1" /transl_table=11 /translation="MGAGHDHGHTHGGPPPTGTAAAAYKGRLRIALGITLTVMTVEIVG GLLANSLALIADAAHMATDALGLAMALLAIHFANRPATTKATFGYARAEILAALANCLL LLGVGGFLLFEAVDRFITPAETKSGTAIVFALVGLVANMISLSLLMKGQKDSLNVRGAF LEVLADTLGSVTVLVSAGIIMATGWQAADPIASLVIGLMIVPRTWKLLRETLSVLLEIA PKGVDMAQVREHILDLPGVDDVHDLHAWTITSGMPVLSAHVVVSQDFLDSIGHEKMLHA LQGCLGEHFDVEHCTFQLEPGGHAEHEAKLCH" misc_feature 1319..1333 /note="TFBS match to ZuR_variant_1, Zinc-responsive repressor, confidence: weak, score: 18.72" /tool="antismash" misc_feature 1349..1360 /note="TFBS match to AfsR, Pleiotropic regulatory for antibiotic production, confidence: medium, score: 18.76" /tool="antismash" gene complement(1381..2364) /gene="galE" /locus_tag="G412_RS0131840" CDS complement(1381..2364) /EC_number="5.1.3.2" /GO_function="GO:0003978 - UDP-glucose 4-epimerase activity [Evidence IEA]" /GO_process="GO:0006012 - galactose metabolic process [Evidence IEA]" /NRPS_PKS="Domain: NAD_binding_4 (58-207). E-value: 3e-11. Score: 34.8. Matches aSDomain: nrpspksdomains_G412_RS0131840_NAD_binding_4.1" /codon_start=1 /gene="galE" /gene_functions="biosynthetic-additional (rule-based-clusters) RmlD_sub_bind" /gene_functions="biosynthetic-additional (rule-based-clusters) Polysacc_synt_2" /gene_functions="biosynthetic-additional (smcogs) SMCOG1010: NAD-dependent epimerase/dehydratase" /gene_kind="biosynthetic-additional" /inference="COORDINATES: similar to AA sequence:RefSeq:WP_018888986.1" /locus_tag="G412_RS0131840" /note="Derived by automated computational analysis using gene prediction method: Protein Homology." /product="UDP-glucose 4-epimerase GalE" /protein_id="WP_028815572.1" /sec_met_domain="RmlD_sub_bind (E-value: 9e-19, bitscore: 61.5, seeds: 37, tool: rule-based-clusters)" /sec_met_domain="Polysacc_synt_2 (E-value: 1.5e-15, bitscore: 51.0, seeds: 77, tool: rule-based-clusters)" /transl_table=11 /translation="MTWLITGGAGYIGAHVARTMAAAGERVVVLDDVSSGIPQRLPEDI ALVRGSVLDRELLDRTLAEHGVTGVVHLAARKQVGESVEKPLLYYRENVYGLTVLLEAV AAAGVRRFLFSSSAAVYGIPEAELISEDAPCVPINPYGETKLAGEWLVRAAGRAHGFST ACLRYFNVAGAARPELADTGVFNIIPMFFDRITRGEAPRIFGDDYPTPDGTCVRDYIHV ADLADAHLSVAQGLTARDEPADMTFNIGRGEGVSVRELADLVSEITGSTLAPVVEPRRP GDAAKAVASSDLIAKELGWTAGRGVREMVASAWEGWLLRHPEAARN" aSDomain complement(1744..2190) /aSDomain="NAD_binding_4" /aSTool="nrps_pks_domains" /database="nrpspksdomains.hmm" /detection="hmmscan" /domain_id="nrpspksdomains_G412_RS0131840_NAD_binding_4.1" /evalue="3.00E-11" /label="G412_RS0131840_NAD_binding_4.1" /locus_tag="G412_RS0131840" /protein_end="207" /protein_start="58" /score="34.8" /tool="antismash" /translation="DRTLAEHGVTGVVHLAARKQVGESVEKPLLYYRENVYGLTVLLEA VAAAGVRRFLFSSSAAVYGIPEAELISEDAPCVPINPYGETKLAGEWLVRAAGRAHGFS TACLRYFNVAGAARPELADTGVFNIIPMFFDRITRGEAPRIFGDD" CDS_motif complement(2275..2361) /aSTool="nrps_pks_domains" /database="abmotifs" /detection="hmmscan" /domain_id="nrpspksmotif_G412_RS0131840_0001" /evalue="4.20E-07" /label="PKSI-KR_m1" /locus_tag="G412_RS0131840" /protein_end="30" /protein_start="1" /score="22.6" /tool="antismash" /translation="TWLITGGAGYIGAHVARTMAAAGERVVVL" misc_feature complement(2426..2440) /note="TFBS match to MexT, Global virulence regulator, confidence: weak, score: 17.07" /tool="antismash" gene complement(2483..2803) /locus_tag="G412_RS41310" CDS complement(2483..2803) /codon_start=1 /inference="COORDINATES: ab initio prediction:GeneMarkS-2+" /locus_tag="G412_RS41310" /note="Derived by automated computational analysis using gene prediction method: GeneMarkS-2+." /product="hypothetical protein" /protein_id="WP_244283346.1" /transl_table=11 /translation="MTDEARVDKGHPLTGGCLVEQLGGLLGGVAVGPGVPYDESQCPQI ALERRSGYRRAGEDGGRQTNSLLGAGQGLGLAWHAALRPGTGAKACRQRLSDERKQEFM AR" gene 2771..4399 /locus_tag="G412_RS0131845" CDS 2771..4399 /codon_start=1 /inference="COORDINATES: similar to AA sequence:RefSeq:WP_018552445.1" /locus_tag="G412_RS0131845" /note="Derived by automated computational analysis using gene prediction method: Protein Homology." /product="DUF5941 domain-containing protein" /protein_id="WP_244283347.1" /transl_table=11 /translation="MALVDPRFVGHVHSLRLALTDPRFPAAAAPGALTAQPAARAALAR AVRGGAAGPDAAAAALDANGTVVRRPELGSLVAVVATDEDERVQARAAVAAVDEEAVRL RTAVKSRDGFFTTFCVSPYSRYLARWCARRGFTPNQVTTASLITALIAAGCAATGTRGG YVAAGALLLFSFVLDCTDGQLARYSLQYSTMGAWLDATFDRAKEYAYYAGLALGAARGG DDVWALALGAMVLQTCRHVVDFSFNEANHDAESNTSPTAALSDKLDSVGWTVWVRRMII LPIGERWAMIAVLTAVTTPRIVFYALLAGCALAALYTTAGRVLRSLTRAARRTDRAARA LADLADSGPVARAVAARGPRLKGAWTAPVLAAAGAAALVATALTQPLGSRQMIIAAVCY AVLCGTAVARPLKGALDWLVPPVFRAAEYGTILILAARSEVNGALPAAFGLVSAVAYHH YDTVYRIRGGTGAPPQWLVRTIGGHDGRTLVVVVLATVLATRNTDLTLALAALAVAVAL VVLVESIRFWVSSGAPAVHDEGEPA" gene 4396..5133 /locus_tag="G412_RS0131850" CDS 4396..5133 /codon_start=1 /gene_functions="biosynthetic-additional (rule-based-clusters) NTP_transf_3" /gene_functions="biosynthetic-additional (smcogs) SMCOG1064: glucose-1-phosphate adenylyl/thymidylyltransferase" /gene_kind="biosynthetic-additional" /inference="COORDINATES: similar to AA sequence:RefSeq:WP_018955403.1" /locus_tag="G412_RS0131850" /note="Derived by automated computational analysis using gene prediction method: Protein Homology." /product="phosphocholine cytidylyltransferase family protein" /protein_id="WP_028815574.1" /sec_met_domain="NTP_transf_3 (E-value: 1.3e-13, bitscore: 45.4, seeds: 297, tool: rule-based-clusters)" /transl_table=11 /translation="MIGLVLAAGAGRRLRPYTDTLPKALVPVDGETTVLDLTLGNFAEV GLTEVAIVVGYRKEAVYARKEALEAKYGLKITLVDNDKAEEWNNAYSLWCAREVLKQGV ILANGDTVHPVSVEKTLLAARGDGQKIILALDTVKNLADEEMKVITQDGKGVRRITKLM DPATATGEYIGVTLIEPEAAEELADALKTTFERDPDLYYEDGYQELVNRGFTVDVAPIG EVTWVEIDNHDDLAKGREIACQY" gene 5121..6182 /locus_tag="G412_RS0131855" CDS 5121..6182 /GO_function="GO:0030554 - adenyl nucleotide binding [Evidence IEA]" /GO_function="GO:0046872 - metal ion binding [Evidence IEA]" /codon_start=1 /gene_functions="biosynthetic-additional (rule-based-clusters) Fe-ADH" /gene_functions="biosynthetic-additional (rule-based-clusters) DHQ_synthase" /gene_functions="biosynthetic-additional (smcogs) SMCOG1181: dehydrogenase" /gene_kind="biosynthetic-additional" /inference="COORDINATES: similar to AA sequence:RefSeq:WP_007452391.1" /locus_tag="G412_RS0131855" /note="Derived by automated computational analysis using gene prediction method: Protein Homology." /product="iron-containing alcohol dehydrogenase family protein" /protein_id="WP_028815575.1" /sec_met_domain="Fe-ADH (E-value: 8.9e-20, bitscore: 64.6, seeds: 218, tool: rule-based-clusters)" /sec_met_domain="DHQ_synthase (E-value: 4.9e-10, bitscore: 32.8, seeds: 76, tool: rule-based-clusters)" /transl_table=11 /translation="MPVLTRLIPSPVVVDISCGAMDDLAGLLADQRISASGKLAIATSG GSGLPLRHKLAPVLPGADWYSVADGTIDSAVKLADEIKGKRYDAVVGLGGGKIIDVTKY AAARVGLPMVAVATNLSHDGLCSPVSILDNDNGRGSYGVPTPIAMVIDLDVIRDAPVRF IRSGIGDAISNISAIADWELSHRINGEPVDGLAAAMARTAGEAVLRHPGGVGTDEFLTV LAESLVLSGIAMSISGDSRPSSGACHEISHAFDLLYPKRAASHGEQVGLGAAFAMHLRG ATEQAGLFAEVLHRHGLPVLPEEIGFSVDEFVKAVAYAPQTRPGRFTILEHLDLSADQI KDAYADYVKAISS" gene 6160..6939 /locus_tag="G412_RS0131860" CDS 6160..6939 /codon_start=1 /inference="COORDINATES: similar to AA sequence:RefSeq:WP_008413838.1" /locus_tag="G412_RS0131860" /note="Derived by automated computational analysis using gene prediction method: Protein Homology." /product="CDP-alcohol phosphatidyltransferase family protein" /protein_id="WP_078607584.1" /transl_table=11 /translation="MSRPSVAELRPVVHPEGVKDRRSGEHWAGRMYMREISLHIDPYLV NTKITPNQLTYLMVVVGVLGGAALLIPGLTGAILAVVLFQIYLLLDCVDGEVARWRKQT SITGVYLDRIGHYLCEAALLVGFGLRGADLFGGGRPEWLWAFLGTLAALGAILIKAETD LVDVARTRSGLPAVKDEASVPRSSGLALARRLAAALKFHRLVGGIEASLFILVVAIADF VQGDLFFSRLGIAVLAGIAIVQTFLHLVSILASSRLK" gene 6936..7835 /locus_tag="G412_RS0131865" CDS 6936..7835 /EC_number="2.4.-.-" /codon_start=1 /gene_functions="biosynthetic-additional (rule-based-clusters) Glycos_transf_2" /gene_kind="biosynthetic-additional" /inference="COORDINATES: similar to AA sequence:RefSeq:WP_018384775.1" /locus_tag="G412_RS0131865" /note="Derived by automated computational analysis using gene prediction method: Protein Homology." /product="glycosyltransferase" /protein_id="WP_037650563.1" /sec_met_domain="Glycos_transf_2 (E-value: 3.1e-15, bitscore: 50.3, seeds: 256, tool: rule-based-clusters)" /transl_table=11 /translation="MTAAGAASGLKVGAVIITMGNRPDELRALLDSVAKQDGDRVEVVV VGNGSPVAGVPEGVRTVELPENLGIPGGRNAGIEAFGPSGAEMDILLFLDDDGLLAHHD TAELCRQAFAADPKLGIISFRIADPDTGETQRRHVPRLRAADPMRSSRVTTFLGGANAV RTKVFAEVGGLPDEFFYAHEETDLAWRALDAGWMIDYRSDMVLYHPTTAPSRHAVYHRM VARNRVWLARRNLPAPLVPVYLGVWLLLTLARRPSVPALRAWFGGFKEGWTTSAGPRRP MRWRTVWRLTRLGRPPVI" gene 7930..8865 /locus_tag="G412_RS0131870" CDS 7930..8865 /GO_component="GO:0043190 - ATP-binding cassette (ABC) transporter complex [Evidence IEA]" /GO_function="GO:0140359 - ABC-type transporter activity [Evidence IEA]" /codon_start=1 /inference="COORDINATES: similar to AA sequence:RefSeq:WP_003972239.1" /locus_tag="G412_RS0131870" /note="Derived by automated computational analysis using gene prediction method: Protein Homology." /product="ABC transporter permease" /protein_id="WP_028815578.1" /transl_table=11 /translation="MSDTTRDGAVATSAPPSPDDGLTPAELAAKYGLSVSGARPGLIEY VRQLWGRRHFILAFSQAKLTAQYSQAKLGQLWQVATPLLNALVYFLIFGLILEADRGMD REVYVPFLVTGVFVFTFTQSSVMAGVRSISGNLGLVRALHFPRASLPISFALQQLQQLL FSMIVLVIIAVAFGSYPSLSWLLVIPALAVQFVFNIGLALIMARLGSKTPDLAQLMPFV MRTWMYASGVMFSIPAMLADKDLPGWVTDVLQWNPAAVYMDLIRFGLIDGYGSENLPPH VWGVALGWALLVGLVGFVYFWKAEERYGRG" gene 8858..9643 /locus_tag="G412_RS0131875" CDS 8858..9643 /GO_function="GO:0005524 - ATP binding [Evidence IEA]" /GO_function="GO:0016887 - ATP hydrolysis activity [Evidence IEA]" /GO_function="GO:0042626 - ATPase-coupled transmembrane transporter activity [Evidence IEA]" /GO_function="GO:0140359 - ABC-type transporter activity [Evidence IEA]" /codon_start=1 /gene_functions="transport (smcogs) SMCOG1000: ABC transporter ATP-binding protein" /gene_kind="transport" /inference="COORDINATES: similar to AA sequence:RefSeq:WP_003964853.1" /locus_tag="G412_RS0131875" /note="Derived by automated computational analysis using gene prediction method: Protein Homology." /product="ABC transporter ATP-binding protein" /protein_id="WP_028815579.1" /transl_table=11 /translation="MADDNKQGRVPTVIADDVHIVYRVNAGAGGRGSATAALSRIMKRG KGDSPGVRKVHAVRGVSFTSYRGEAIGLIGTNGSGKSTLLRAIAGLLPTEQGKVYTDGQ PSLLGVNAALMNDLTGERNVILGGLAMGMTREEIRERYQQIVDFSGINEKGDFITLPMR TYSSGMAARLRFSIAAAKDHDVLMIDEALATGDRKFQIRSEQRIRELRKEAGTVFLVSH SNKSIRDTCDRVLWLEKGELLMDGPTDEVLKAYEKETGR" gene 10001..10903 /gene="hpnC" /locus_tag="G412_RS0131880" CDS 10001..10903 /EC_number="2.5.1.21" /GO_function="GO:0004311 - farnesyltranstransferase activity [Evidence IEA]" /codon_start=1 /gene="hpnC" /gene_functions="biosynthetic (rule-based-clusters) terpene: PT_phytoene_like" /gene_functions="biosynthetic (rule-based-clusters) terpene: phytoene_synt" /gene_kind="biosynthetic" /inference="COORDINATES: similar to AA sequence:RefSeq:WP_005308894.1" /locus_tag="G412_RS0131880" /note="Derived by automated computational analysis using gene prediction method: Protein Homology." /product="squalene synthase HpnC" /protein_id="WP_028815580.1" /sec_met_domain="phytoene_synt (E-value: 3.2e-38, bitscore: 125.4, seeds: 8, tool: rule-based-clusters)" /sec_met_domain="PT_phytoene_like (E-value: 3.2e-40, bitscore: 132.1, seeds: 61, tool: rule-based-clusters)" /transl_table=11 /translation="MTTTRHVRTDAHARTTLDKAADENFPVAPFFLPRAWRDDLMAVYG YARLVDDIGDGDLAPGGADARLLGLDPALADDRLLLLDAFEADLRRVFDASGDGPRHPL LRALVPTVRRCSLTPGPFLGLIEANRQDQLVRRYKTYDDLLAYCELSANPVGRLVLQIT GTASPERIRRSDAVCTALQIVEHLQDVAEDLGRDRIYLPADTMARFHVEEADLAAPSAN ASVRSLIAYEAERAGRLLDEGTPLVASVSGRLKLLLAGFAGGGRAALAAIAATGYDVLP GPPKPTKLSLLRAVGAVLR" gene 10948..11868 /gene="hpnD" /locus_tag="G412_RS0131885" CDS 10948..11868 /EC_number="2.5.1.103" /GO_function="GO:0016767 - geranylgeranyl-diphosphate geranylgeranyltransferase activity [Evidence IEA]" /GO_process="GO:0016117 - carotenoid biosynthetic process [Evidence IEA]" /codon_start=1 /gene="hpnD" /gene_functions="biosynthetic (rule-based-clusters) terpene: PT_phytoene_like" /gene_functions="biosynthetic (rule-based-clusters) terpene: phytoene_synt" /gene_kind="biosynthetic" /inference="COORDINATES: similar to AA sequence:RefSeq:WP_017944204.1" /locus_tag="G412_RS0131885" /note="Derived by automated computational analysis using gene prediction method: Protein Homology." /product="presqualene diphosphate synthase HpnD" /protein_id="WP_106964213.1" /sec_met_domain="phytoene_synt (E-value: 9.2e-59, bitscore: 192.7, seeds: 8, tool: rule-based-clusters)" /sec_met_domain="PT_phytoene_like (E-value: 2.7e-70, bitscore: 230.8, seeds: 61, tool: rule-based-clusters)" /transl_table=11 /translation="MSAPVLAAYSYCEAVTGAQARNFAYGIRLLPTDKRNAMSALYAFS RRVDDIGDGVLEPAVKQVRLEETRALLDRIRTGAVDDDDTDPVAVALSDTARRFPLPLE GLDELIDGVLMDVRGETYETWDDLKVYCRCVAGAIGRLSLGVFGTQTGARSTERASEYA DTLGLALQLTNILRDVREDAGNGRTYLPADDLAKFGCSAGFHGATPPEGSDFTGLIHFE VRRARALFAEGYRLLPLLDRRSGACVAAMAGIYRRLLDRIERDPLAVLRGRVSLPGHEK AYVAVRGLSGLDSRHISRSHIWRRV" gene 11996..13426 /gene="hpnE" /locus_tag="G412_RS0131890" CDS 11996..13426 /EC_number="1.17.8.1" /GO_function="GO:0016491 - oxidoreductase activity [Evidence IEA]" /codon_start=1 /gene="hpnE" /gene_functions="biosynthetic-additional (rule-based-clusters) DAO" /gene_functions="biosynthetic-additional (smcogs) SMCOG1222: dehydrogenase" /gene_kind="biosynthetic-additional" /inference="COORDINATES: similar to AA sequence:RefSeq:WP_018568730.1" /locus_tag="G412_RS0131890" /note="Derived by automated computational analysis using gene prediction method: Protein Homology." /product="hydroxysqualene dehydroxylase HpnE" /protein_id="WP_078607571.1" /sec_met_domain="DAO (E-value: 1.3e-11, bitscore: 38.5, seeds: 521, tool: rule-based-clusters)" /transl_table=11 /translation="MTRGTTGGRSVVPARSPGPAAGHAVVVGGGLAGVTAALQLADAGM RVTLLEGRPRLGGLAFSFRRGELTVDNGQHVYLRCCTAYRWFLDRVEGAGLAPLQSRLD VPVLDVGRPAGPRLGRLRRTALPVPLHLAASLAAYPHLSLAERASVGRAALALKKLDPA DPALDGVDFATWLGRHGQSPRTIEALWDLVGVATLNATAPDASLGLAAMVFKTGLLSDP GAADIGWAHVPLGDLHDTLARKALDSAGVRTALRTRVSSLSRSVSGGWSVGTAAGERIE ADAVVLAVPQRETHAVLPEGVIDDPGRLLDIGTAPILNVHVVYDRKVLRRPFFAALGSP VQWVFDRTESSGLRGGGQYLAVSQSAARGEIDAPVAELRARYLPELERLLPAARGAGVR DFFVTRERTATFAPTPGVGRLRPGARTHAPGLYLAGAWTATGWPATMEGAVRSGITAAG AALRELGRVHEHPLQEAV" gene 13423..14517 /locus_tag="G412_RS0131895" CDS 13423..14517 /codon_start=1 /gene_functions="biosynthetic (rule-based-clusters) terpene: PT_FPPS_like" /gene_functions="biosynthetic-additional (smcogs) SMCOG1182: Polyprenyl synthetase" /gene_kind="biosynthetic" /inference="COORDINATES: similar to AA sequence:RefSeq:WP_003964849.1" /locus_tag="G412_RS0131895" /note="Derived by automated computational analysis using gene prediction method: Protein Homology." /product="polyprenyl synthetase family protein" /protein_id="WP_051338738.1" /sec_met_domain="PT_FPPS_like (E-value: 1.1e-59, bitscore: 195.7, seeds: 145, tool: rule-based-clusters)" /transl_table=11 /translation="MSTGMETRGEPVTPVNPVDNTTVDIAALLERGRTLSTPVLKAAVG RLAPPMDTVAAYHFGWIDAQGRPADGDGGKAVRPALALLSAQAAGAAAEVGVPGAVAVE LVHNFSLLHDDLMDGDEQRRHRDTVWKVHGPAQAILVGDALFALANEVLLELGTVEAGR ATRRLTTATRKLIDGQAQDISFEHRERVTVEECLEMEGNKTGALLACAVSIGAVLGGAD DRTADTLEAYGYHLGLAFQAVDDLLGIWGDPDATGKQTWSDLRQRKKSLPVVAALAAGG PASERLGELLSADAKSNDFESFSEQEFATRAALIEEAGGRDWTAQEARRQHAVAIEALD GVDMPPQVRAQLTALADFVVVRKR" misc_feature 14528..14544 /note="TFBS match to NuR, Nickel-responsive regulator, confidence: weak, score: 20.45" /tool="antismash" gene 14639..16675 /gene="shc" /locus_tag="G412_RS36970" CDS 14639..16675 /EC_number="5.4.99.17" /GO_function="GO:0051007 - squalene-hopene cyclase activity [Evidence IEA]" /GO_process="GO:0019746 - hopanoid biosynthetic process [Evidence IEA]" /codon_start=1 /gene="shc" /gene_functions="biosynthetic (rule-based-clusters) terpene: T2TS" /gene_kind="biosynthetic" /inference="COORDINATES: similar to AA sequence:RefSeq:WP_020116600.1" /locus_tag="G412_RS36970" /note="Derived by automated computational analysis using gene prediction method: Protein Homology." /product="squalene--hopene cyclase" /protein_id="WP_051338744.1" /sec_met_domain="T2TS (E-value: 4.3e-151, bitscore: 498.0, seeds: 58, tool: rule-based-clusters)" /transl_table=11 /translation="MTATTDGSPGAMKPCAAAASEFTDNNTNTTTTEITRHGPVIAGVL DAARRATDRGVAHLLAQQDDQGWWKGDLETNVTMDAEDLLLRQFLGIQDEKTVRAAALF IRGEQRGDGTWATFYGGPGDLSTTIEAYVALRLAGDLPDAPHMERAAAWIRARGGIAAS RVFTRIWLALFGWWKWDDLPELPPELIFLPSWFPLNIYDFGCWARQTIVPLTVVSAKRP VRPAPFALDELHANPRVPNPRKRLSTPTSWEGAFQRLDKALHVYRKVAPARLRGAAMKS AARWIVERQENDGCWGGIQPPAVYSLIALHLLGYDLDHPVMRAGLESLDRFTVWREDGA RMIEACQSPVWDTCLATIALADAGVRPDDPALVKAADWMMSEQIVRPGDWAVRRPGVEP GGWAFEFHNDNYPDIDDTAEVVLALRRVAHPDRARTEATVDRAVRWNLGMQSKNGAWAA FDADNTSPFPNRLPFCDFGEVIDPPSADVTAHVVEMLAVEGKAHDPRTRRGIEWLLAEQ EPSGAWFGRWGVNYVYGTGAVVPALIAAGLPAAHPAVRRAVGWLESVQNDDGGWGEDLR SYREAGWAGRGASTASQTAWALLALLAAGERDGRSVERGVAWLAGTQREDGSWDEPYFT GTGFPWDFSINYHLYRQVFPLTALGRYVHGEPTYPDLATREGT" gene 16675..17355 /locus_tag="G412_RS0131905" CDS 16675..17355 /codon_start=1 /inference="COORDINATES: similar to AA sequence:RefSeq:WP_014152780.1" /locus_tag="G412_RS0131905" /note="Derived by automated computational analysis using gene prediction method: Protein Homology." /product="1-hydroxy-2-methyl-2-butenyl 4-diphosphate reductase" /protein_id="WP_244283348.1" /transl_table=11 /translation="MAPMDPADRTDPADEPPRPAPLLIACALGIEQFALRSGHRKEAPG PVTVLRTGMGPKAAATAVRQALAHGGPAPDAAVIASGFCAGLVPGMHPGDLIVADETRG PHGTSACTGVELLAKALDRVVPGRTVHTGPLLGSAHVVRGPERARLAATGAIAVDMESA ATLGSALRAGPRPVAAVRVVVDAPEHELVRIGTLRGGISAFRVLRAVLPAFFEWHRSSL LPRR" gene 17361..18383 /gene="hpnH" /locus_tag="G412_RS0131910" CDS 17361..18383 /GO_function="GO:0003824 - catalytic activity [Evidence IEA]" /GO_function="GO:0051536 - iron-sulfur cluster binding [Evidence IEA]" /codon_start=1 /gene="hpnH" /gene_functions="biosynthetic-additional (rule-based-clusters) PF04055" /gene_functions="biosynthetic-additional (rule-based-clusters) QueE" /gene_kind="biosynthetic-additional" /inference="COORDINATES: similar to AA sequence:RefSeq:WP_015037634.1" /locus_tag="G412_RS0131910" /note="Derived by automated computational analysis using gene prediction method: Protein Homology." /product="adenosyl-hopene transferase HpnH" /protein_id="WP_028815585.1" /sec_met_domain="PF04055 (E-value: 2.4e-19, bitscore: 64.2, seeds: 518, tool: rule-based-clusters)" /sec_met_domain="QueE (E-value: 9.3e-10, bitscore: 32.2, seeds: 32, tool: rule-based-clusters)" /transl_table=11 /translation="MAMPLRQTIRVGTYLVEQKLRKREKFPLIVELEPLYACNLACEGC GKIQHPAGVLKQRMPVAQAVGAVLESGAPMVSIAGGEPLMHPQIDEIVRQLVAKRKYVF LCTNAMLLRKKIEKFTPSPYFAFAVHIDGLRERHDESVAKEGVFDEAVEAIKEAKKRGF RVTTNSTFFNTDTPQTIIEVLNYLNDDLEVDEMMISPAYAYEKAPDQEHFLGVEQTREL FKKAFAGGNRRRWRLNHSPLFLDFLEGKADFPCTAWAIPNYSLFGWQRPCYLMSDGYVP TYRELIEETDWSKYGRGKDPRCANCMAHCGYEPTAVLATMGSLKESLRAVRETVSGNHG " misc_feature 17408..17421 /note="TFBS match to LexA_variant_2, Repressor of DNA damage response, confidence: weak, score: 19.73" /tool="antismash" gene 18388..>19040 /locus_tag="G412_RS0131915" CDS 18388..>19040 /EC_number="1.17.7.3" /GO_function="GO:0046429 - 4-hydroxy-3-methylbut-2-en-1-yl diphosphate synthase activity [Evidence IEA]" /GO_process="GO:0016114 - terpenoid biosynthetic process [Evidence IEA]" /codon_start=1 /inference="COORDINATES: similar to AA sequence:RefSeq:WP_010356627.1" /locus_tag="G412_RS0131915" /note="Derived by automated computational analysis using gene prediction method: Protein Homology." /product="flavodoxin-dependent (E)-4-hydroxy-3-methylbut-2-enyl-diphosphate synthase" /protein_id="WP_028815586.1" /transl_table=11 /translation="MTLGEPVALGLPELPARPLAVRRASRRIQVGSVAVGGDAPVSVQS MTTTRTSDVGATLQQIAELTASGCQIVRVACPTQDDADALATIARKSQIPVIADIHFQP KYVFAAIDAGCAAVRVNPGNIKQFDDKVKEIAKAASETRTPIRIGVNAGSLDARLLQKY GKATPEALVESALWEASLFEEHGFGDIKISVKHNDPVVMVNAYRQLAAQCDYPL" ORIGIN 1 gcgcgcttcg gcggcgctcc gcagctcggc gaggaacggc tcgtacagat cgacggccgt 61 cacctcggcg ccgcactccc cggccagcag cagcgagacc ctgcccggac cgcagcccag 121 gtcgagcacg cgcggccggt gcggcagcgg tccggcgagg gagagcagat ggcgggcggt 181 ggcgtcggag ccggggccct ggcggggcag cccgcggtgc agggtcatga aagcttcgaa 241 catggcgttg tcgctcaacg taagaaccct ttggtgtgcc tgacagtacc ggcggatcag 301 tggcagagct tcgcctcgtg ctcggcgtga ccgccgggct ccagctggaa cgtgcagtgc 361 tcgacgtcga agtgctcccc gaggcagccc tggagggcat gcagcatctt ctcgtgcccg 421 atcgagtcga ggaagtcctg gctgacgacg acatgcgccg acaggaccgg catgcccgag 481 gtgatcgtcc aggcatgcag atcgtgtacg tcgtcgacgc ccggcagatc gagtatgtgc 541 tcccgcacct gcgccatgtc cacgcccttg ggcgcgatct ccaggagcac gctcagcgtc 601 tcccgcagca gcttccaggt gcgcggcacg atcatgaggc cgatcaccag cgacgcgatg 661 gggtcggcgg cctgccagcc ggtcgccatg atgatgccgg ccgagaccag cacggtgacc 721 gagcccaggg tgtccgcgag cacctccagg aaggcgccgc gcacattgag actgtccttc 781 tggcccttca tcagcaggga cagcgagatc atgttggcga cgagaccgac cagggcgaag 841 acgatcgccg taccgctctt ggtctcggcc ggggtgatga agcggtcgac ggcctcgaag 901 agcaggaaac cgccgacacc gagcagcagc agacagttgg cgagtgcggc gaggatctcg 961 gccctggcgt acccgaaggt ggccttcgtg gtggccgggc ggttggcgaa gtggatggcc 1021 aggagcgcca tcgcgaggcc gagggcgtcc gtcgccatat gggccgcgtc cgcgatcagc 1081 gccagcgaat tggcgagcag accgccgacg atctcgaccg tcatgacggt cagggtgatg 1141 ccgagcgcga tacgaagccg ccccttgtac gcggcggccg ctgtgccggt gggcggcggc 1201 cccccgtgcg tatgcccgtg atcgtgccca gcccccatga aggatgcctc cccagtggtg 1261 tgaatggtct gccgaggagc cagtgaacta cgggcggggg gtatgcgcaa cacgggactg 1321 aacaccgttg tcatatgctc tgacctgcgg aaatgaccgc aggtcagagc ggcgacctga 1381 tcagttccgg gcggcctcgg ggtgccggag cagccagccc tcccacgccg aggcgaccat 1441 ctcgcgcacc ccgcgcccgg ccgtccagcc cagctccttg gcgatcagat cggacgaggc 1501 gaccgccttc gccgcgtcgc cggggcggcg cggctcgacc accggggcga gcgtggaacc 1561 ggtgatctca ctcaccagat cggcgagctc gcgcaccgag acgccctcac cgcggccgat 1621 gttgaaggtc atgtccgccg gctcgtcccg cgcggtcagc ccctgtgcga ccgacaggtg 1681 ggcatcggcg agatcggcga cgtgaatgta gtcacggacg caggtgccgt cgggcgtcgg 1741 atagtcgtcc ccgaagatcc gcggggcctc gccgcgcgtg atccggtcga agaacatcgg 1801 aatgatgttg aacactccgg tgtcggcgag ctccggcctc gccgcgcccg ccacattgaa 1861 gtagcgcaga cacgcggtcg agaagccgtg cgcccggccc gcggcccgca cgagccactc 1921 gccggccagc ttcgtctcgc cgtacggatt gatcgggacg cagggggcgt cctccgagat 1981 gagctccgcc tcgggaatgc cgtacacggc ggcggaggag gagaagagga accgccgcac 2041 cccggccgcc gcgacggcct ccaggagcac ggtgagtccg tacacatttt cccggtagta 2101 gagcagcggc ttctcgaccg actccccgac ctgcttcctc gccgcgagat gcaccacacc 2161 tgtgacgccg tgctccgcga gcgtacggtc gagcagctcc cggtccagta cggagccgcg 2221 taccagcgcg atgtcctcgg gcagccgctg cgggatgccg gacgagacgt cgtccaggac 2281 gaccacccgc tcacccgccg ccgccatggt ccgcgccaca tgcgcgccga tgtaacccgc 2341 cccacccgtg atcagccatg tcataacggc caccctaggg cgtgtccgga cctcaggcgg 2401 tcgcggtttg tggggcgggg cgctgatcga gcaagatgat ctcggtgggg cgcccgcgtc 2461 ctggcccggt cttatccggg cgttaacggg ccatgaactc ctgcttccgt tcatccgata 2521 gcctctgccg acatgccttc gcgcccgtgc cgggccgcaa tgccgcatgc cacgcaaggc 2581 caagcccctg cccagcgcca aggagtgagt tcgtctgccg accgccatcc tcaccggccc 2641 gccggtaccc ggatcgccgc tcgagggcga tctgcgggca ctgggattcg tcgtacggta 2701 cgccgggccc gacggcgacg ccgccgagca gaccgccaag ctgctcgacg agacacccgc 2761 cggtgagcgg gtggcccttg tcgacccgcg cttcgtcggt cacgtccact ccctgcgcct 2821 cgcgctgacc gacccccgct tcccggccgc cgcggccccc ggcgcgctca ccgcgcagcc 2881 cgcggcccgc gccgcgctgg cccgcgccgt gcgcggcggc gccgccgggc ccgacgccgc 2941 cgccgccgcg ctcgacgcga acggcaccgt cgtgcggcgg ccggagctcg gctcgctggt 3001 cgccgtcgtc gccacggacg aggacgagcg cgtgcaggcc cgcgccgccg tcgccgccgt 3061 cgacgaggag gccgtacggc tgcgtacggc cgtgaagtcc cgcgacggtt tcttcaccac 3121 cttctgcgtc agcccgtact cgcgctacct cgcgcgctgg tgcgcccgca ggggcttcac 3181 cccgaaccag gtcaccacgg cctcgctgat caccgcgctg atcgcggccg gctgcgcggc 3241 gaccggtacc cgcggcggct acgtcgcggc cggcgcgctg ctgctcttct ccttcgtcct 3301 ggactgcacc gacgggcagc tcgcccgcta ctcgctccag tactcgacga tgggcgcctg 3361 gctcgacgcg accttcgacc gcgccaagga gtacgcgtac tacgcgggcc tcgcgctcgg 3421 cgccgcccgt ggcggggacg acgtatgggc gctggcgctc ggcgcgatgg tgctccagac 3481 ctgccgccat gtcgtggact tctccttcaa cgaggcgaac cacgacgcgg agtccaacac 3541 cagccccacc gccgcgctct ccgacaaact cgacagcgtc ggctggacgg tctgggtgcg 3601 ccggatgata atcctgccga tcggcgagcg gtgggcgatg atcgcggtac tgaccgccgt 3661 caccaccccg cggatcgtct tctacgccct gctcgccggc tgcgcgctcg ccgccctcta 3721 caccacggcc ggccgcgtcc tgcgctcgct gacccgcgcg gcacggcgta ccgaccgggc 3781 ggcccgggcc ctcgccgacc tcgcggacag cgggcccgtc gcccgggcgg tggcggcccg 3841 tgggccgcgc ctcaagggcg cctggaccgc ccccgtcctc gccgccgccg gcgccgccgc 3901 cctggtggcc acggcgctca cccagcccct cggcagccgc cagatgatca tcgcggcggt 3961 ctgctacgcc gtgctctgcg gcacagccgt cgcgcggccc ctcaagggcg ccctcgactg 4021 gctcgtcccg cccgtcttcc gggcagccga gtacggcacg atcctgatcc tggccgcccg 4081 ctccgaagtg aacggagcgt tgccggccgc attcgggctg gtgtcggcgg tcgcctacca 4141 tcactacgac accgtgtacc gcatccgcgg cggcacgggc gcgccgcccc agtggctggt 4201 gcggacgatc ggcgggcacg acggccggac gctggtggta gtggtactcg ccaccgtcct 4261 ggcgacgcgg aacacagacc tcaccctggc gctcgcggcc ctcgctgtgg ccgtggcact 4321 cgtggtgctg gtggagtcca tccgcttctg ggtgtcctcc ggagcacccg cggtacatga 4381 cgaaggagaa cccgcatgat cggcctcgta ctggctgccg gcgccggacg gcgtctgcgt 4441 ccctacacgg acaccctccc gaaggccctg gtgcccgtgg acggtgagac gaccgtcctc 4501 gacctgacgc tgggcaactt cgccgaggtc ggcctgaccg aggtcgccat cgtcgtcggc 4561 taccgcaagg aggccgtcta cgcgcgcaag gaggcccttg aggcgaagta cggcctcaag 4621 atcacgctcg tcgacaacga caaggccgag gagtggaaca acgcctactc cctgtggtgc 4681 gcgcgtgagg tcctcaagca gggcgtgata ctcgccaacg gcgacaccgt tcacccggtc 4741 tccgtcgaga agacgctgct cgccgcccgc ggcgacggcc agaagataat cctcgccctc 4801 gacacggtga agaacctcgc cgacgaggag atgaaggtca tcacgcagga cggcaagggt 4861 gtccggcgga tcaccaagct gatggacccg gccaccgcca ccggtgagta catcggtgtc 4921 acgctcatcg agccggaggc cgccgaggag ctggccgacg cgctgaagac gacgttcgag 4981 cgcgaccccg acctctacta cgaggacggc taccaggagc tggtcaaccg cggcttcacc 5041 gtcgacgtgg cccccatcgg cgaagtgacg tgggtcgaga tcgacaacca cgacgacctc 5101 gcgaagggcc gtgagatcgc gtgccagtac tgacccggct catcccgtca ccggtcgtcg 5161 tcgacatcag ctgcggcgcc atggacgacc tggccggcct gctggccgac cagcggatct 5221 ccgcctccgg caagctggcg atcgcgacca gcggcggctc gggcctgccc ctgcgccaca 5281 agctcgcgcc ggtcctgccg ggcgccgact ggtactccgt cgccgacgga acgatcgact 5341 ccgccgtgaa gctcgccgac gagatcaagg gcaagcggta cgacgccgtg gtcggcctcg 5401 gcggcggcaa gatcatcgac gtcacgaagt acgcggcggc gcgggtcggc ctgccgatgg 5461 tcgccgtcgc gacgaacctc tcgcacgacg gtctctgctc gccggtgtcc atcctggaca 5521 acgacaacgg gcgcggctcc tacggcgtac cgacccccat cgccatggtc atcgacctcg 5581 atgtgatccg cgacgccccg gtccgcttca tccggtccgg catcggtgac gcgatctcca 5641 acatctcggc aatcgcggac tgggagctgt cccaccgtat aaacggcgag ccggtcgacg 5701 gactggccgc cgccatggcc cgtacggccg gagaagccgt actccgccac cccggcggcg 5761 tcggcaccga cgagttcctc acggtcctcg ccgagtccct cgtcctgtcc ggtatcgcca 5821 tgtcgatcag cggcgacagc cgcccctcgt ccggtgcgtg ccacgagatc agccacgcct 5881 tcgacctgct gtaccccaag cgggccgcga gccacggcga gcaggtcggc ctcggcgcgg 5941 ccttcgccat gcacctgcgg ggcgccaccg agcaggccgg gctcttcgcc gaggtactgc 6001 accggcacgg cctgccggtg ctcccggagg agatcggctt cagcgtcgac gagttcgtca 6061 aggccgtcgc ctacgcacca cagacccgtc cgggacgctt cacgatcctg gaacacctcg 6121 acctgtccgc agaccagatc aaggacgctt acgccgacta tgtcaaggcc atcagtagct 6181 gaactccgtc cggtcgttca ccccgagggc gtgaaggacc ggcgcagcgg tgagcactgg 6241 gccgggcgca tgtacatgcg ggagatctcg ctgcacatcg acccgtacct ggtgaacacc 6301 aagatcacgc cgaaccagct cacgtacctc atggtcgtcg tgggtgtgct cggcggcgcg 6361 gccctgctga tccccggcct gaccggtgcg atcctcgcgg tggtcctgtt ccagatctat 6421 ctgctgctcg actgtgtcga cggcgaggtc gcccgctggc gcaagcagac ctcgatcacg 6481 ggcgtctacc tggaccgcat cggccactac ctgtgcgagg cggccctgct cgtgggcttc 6541 ggcctgcgcg gcgccgacct gttcggcggc ggacggcccg agtggctgtg ggcgttcctc 6601 ggtacgctcg ccgcgctggg cgcgatcctg atcaaggccg agaccgacct ggtcgacgtc 6661 gcccgtacgc gcagcggtct gcccgccgtg aaggacgagg cctccgtgcc gcgctcctcc 6721 ggtctggccc tggcgcgcag gcttgccgcg gcgctgaagt tccaccgcct ggtcggcggc 6781 atcgaggcga gcctgttcat cctggtcgtc gcgatcgccg acttcgtcca gggcgatctg 6841 ttcttctccc gtctcgggat cgccgtcctc gcgggcatcg cgatcgtgca gacgttcctg 6901 cacctcgtgt ccatcctcgc ttcgagcagg ctgaagtgac ggccgcgggc gcggcctccg 6961 gcctgaaggt cggcgccgtc atcatcacga tgggcaaccg ccccgacgag ctgcgcgccc 7021 tcctggactc ggtcgccaag caggacggcg accgggtcga ggtcgtcgtc gtcggcaacg 7081 gctccccggt cgccggtgtc cccgagggcg tacggaccgt cgagctgccc gagaacctcg 7141 gcattccggg cggccgcaac gccggcatcg aggcgttcgg ccccagcggc gccgagatgg 7201 acatcctgct cttcctcgac gacgacggcc tgctggccca ccacgacacg gccgagctgt 7261 gccgccaggc cttcgccgcc gacccgaagc tcggcatcat cagcttccgg atcgcggacc 7321 ccgacacggg cgagacccag cgccgccacg tcccgcggct gcgcgcggcc gacccgatgc 7381 gctcctcacg cgtgacgacc ttcctcggcg gcgccaacgc cgtacgcacc aaggtcttcg 7441 ccgaggtcgg cgggctgccg gacgagttct tctacgccca tgaggagacc gatctggcct 7501 ggcgggcgct cgacgcgggc tggatgatcg actaccgctc ggacatggtc ctgtaccacc 7561 cgacgaccgc tccttcccgg cacgccgtct accaccgcat ggtggccagg aaccgggtct 7621 ggctggcacg gcgcaacctg ccggcccccc tggtgcccgt ctacctcggc gtatggctgc 7681 tgctcaccct cgcgcggcgc ccctccgtac ccgctctgag agcatggttc ggcggcttca 7741 aggaggggtg gacgacctca gcgggtccgc gccgtccgat gagatggcgt accgtctggc 7801 gactgacccg gctcggccga cctcctgtca tctgacaagc tcgggtctga aagcatcggg 7861 ggcacgcggg acgtgtgccc ggtcctgcgc cttactgcct gcgcatcttg aacacgaaag 7921 tttcaacttg tgagtgacac aacccgtgac ggtgcggtcg cgacgagcgc cccgccatct 7981 cccgacgacg gacttacccc ggcggagctc gccgcgaagt acggcctgtc ggtgagcggt 8041 gcccggccgg ggctgatcga gtacgtccgg cagctgtggg gccggcgcca cttcatcctc 8101 gcgttctcgc aggcgaagct gaccgcccag tacagccagg ccaagctcgg ccagctgtgg 8161 caggtggcga ctccgctgct gaacgccctc gtgtacttcc tcatcttcgg cctgatcctg 8221 gaggccgaca ggggcatgga ccgggaggtg tacgtcccct tcctggtcac cggtgtgttc 8281 gtcttcacct tcacccagag ctcggtgatg gccggtgtcc ggtccatctc gggcaacctg 8341 ggcctggtca gagcgctgca cttcccgcgt gcctcgctgc cgatctcctt cgcgctccag 8401 cagctccagc agctgctgtt ctcgatgatc gtgctggtga tcatcgcggt ggcgttcggc 8461 agctacccct cgctctcctg gctgcttgtc atccccgcgc tggccgtgca gttcgtcttc 8521 aacatcggcc tcgcgctgat catggccagg ctcgggagca agacccctga cctcgcccag 8581 ctcatgccct tcgtgatgcg tacgtggatg tatgcctccg gagtgatgtt cagcatcccg 8641 gccatgctgg ccgacaagga cctgcccggc tgggtcacgg acgtgctcca gtggaacccg 8701 gcggccgtct acatggatct catccgtttc ggcctgatcg acggctacgg ctccgagaac 8761 ctgcccccgc acgtctgggg cgtcgccctg ggctgggcgc ttctcgtggg cctcgtgggc 8821 ttcgtgtact tctggaaggc tgaggagagg tacggccgtg gctgacgaca acaagcaggg 8881 acgcgtcccc accgtcatcg cggacgacgt gcacatcgtg taccgcgtca acgccggcgc 8941 gggcgggcgc ggcagcgcca ccgccgccct gagccgcatc atgaagcgcg gcaagggcga 9001 ctcgcccggc gtacgcaagg tgcacgcggt gcgcggtgtc tccttcacgt cctaccgggg 9061 cgaggcgatc ggcctgatcg gcaccaacgg gtcgggcaag tcgaccctgc tgcgcgccat 9121 cgcgggcctc ctgccgaccg agcagggcaa ggtctacacg gacggccagc cctcactcct 9181 cggtgtgaac gcggcgctga tgaacgacct gaccggcgag cgcaacgtca tcctcggcgg 9241 cctcgccatg ggtatgacac gtgaggagat ccgcgagcgc tatcagcaga tcgtcgactt 9301 ctcgggtatc aacgagaagg gcgacttcat caccctgccg atgcgcacgt actcctccgg 9361 catggccgcc cgtctgcgtt tctccatcgc cgccgccaag gatcacgacg tactgatgat 9421 cgacgaggcg ctggcgacgg gcgaccggaa gttccagatc cgctccgaac agcgcatccg 9481 cgagctccgc aaggaggccg gcacggtctt cctggtcagc cacagcaaca agtcgatcag 9541 ggacacctgt gaccgcgttc tgtggctgga gaagggcgag ctgctgatgg acggtcccac 9601 ggacgaggtc ctcaaggcct acgagaagga aaccggccgc tagagcgggt cctgcgggcg 9661 cggtgccccc gccgggccgc cccggcgggg gtttcgcgtg tccgggtgca ccctttccgc 9721 gtttccgtgc ggcgcgagcg gcccggcgcg ccgtaacatg ccggaatcgt caactccggc 9781 cacttacggg aaggttgagg catatcgcag cgttgttgtc atccgcccaa caccccggcg 9841 cggtgccggg cgttgtacaa cgtaagctgt accggtgggc ggcgtgtccg aaatgggatg 9901 tattaggtcg acggtgtaga acgggagatg tgacggcaat ggcgatggaa tctctccagc 9961 tcgaagacgc ttccgccgtc cccgcaccgg gcagcccgcg atgaccacaa cccgtcacgt 10021 gcgcaccgac gcacacgcgc gcacgaccct ggacaaggcc gccgacgaga acttcccggt 10081 ggctcccttc tttctgccgc gcgcctggcg ggacgacctg atggccgtct acggctacgc 10141 ccgcctcgtc gacgacatcg gcgacggcga cctcgccccc ggcggcgccg acgcccgcct 10201 gctcggcctc gaccccgccc tcgccgacga ccgcctgctc ctgctcgacg ccttcgaggc 10261 cgacctgcgc cgcgtcttcg acgcctcggg agacggcccg cgccaccccc tcctgcgcgc 10321 cctcgtgccg accgtgcgcc gctgctcgct cacccccggc ccgttcctcg ggctcatcga 10381 agccaaccgg caggaccagc tcgtacgccg ctacaagacg tacgacgatc tcctcgcgta 10441 ctgcgagctg tcggcgaacc ccgtcggccg gctcgtcctc cagatcaccg gcaccgcgag 10501 ccccgagcgg atccgccgct ccgacgcggt ctgcaccgcc ctgcagatcg tcgaacacct 10561 ccaggacgtc gccgaggacc tcggccgcga ccggatctac ctgcccgccg acaccatggc 10621 gcgcttccat gtcgaggagg ccgatctggc cgcgccgtcc gcgaacgcgt cggtgcgctc 10681 cctgatcgcg tacgaagcgg aacgcgccgg gcgcctcctg gacgagggca ctccgctcgt 10741 ggccagtgtc agcggcaggc tcaagctgct cctcgccggt ttcgccggcg gcggccgcgc 10801 ggccctcgcg gcgatcgcgg ccaccggcta cgacgtgctc cccggaccgc ccaagcccac 10861 caagctcagc ctgctgcgcg cagtgggagc tgtcttgcga tgagaccgaa gagaggggtg 10921 agccggcccg tggaaggact cacgcagatg tccgcaccgg tactcgccgc gtacagctac 10981 tgcgaggccg tgaccggagc gcaggcgcgc aatttcgcgt acggcatcag gctgctgccg 11041 accgacaagc gcaacgccat gtccgcgctg tacgcgttct cacggcgcgt ggacgacatc 11101 ggcgacggtg tgctggagcc cgccgtcaag caggtgcggc tcgaagagac ccgcgcactg 11161 ctcgaccgga tccggacggg cgccgtcgac gacgacgaca ccgacccggt cgccgtggcg 11221 ctctcggaca cggcgcgccg cttcccgctg ccgctcgaag ggctcgacga actcatcgac 11281 ggcgtcctca tggacgtacg cggcgagacc tacgagacct gggacgacct gaaggtctac 11341 tgccggtgcg tcgcgggcgc catcggacgg ctctcgctgg gcgtgttcgg cacccagacc 11401 ggcgcgcgct ccacggaacg cgcctcggag tacgccgata cactcggtct cgccctccag 11461 ctgaccaaca tcctccggga cgtccgcgag gacgccggga acgggcgtac ctacctgccc 11521 gccgacgacc tcgccaagtt cggctgctcg gccggtttcc acggcgccac cccgcccgaa 11581 gggtccgact tcacgggcct gatccacttc gaagtacgcc gcgcccgcgc cctgttcgcc 11641 gagggctacc ggctgcttcc cctgctcgac cggcgctccg gcgcctgcgt cgccgcgatg 11701 gccggcatct accggcggct ccttgaccgc atcgagcgcg accccctggc cgtactgcgc 11761 ggccgggtct ccctgccggg acacgagaag gcgtacgtcg ccgtgcgcgg cctgtcgggc 11821 ctcgactccc ggcacatctc ccgcagtcac atctggaggc gggtctgatg ggccccgttg 11881 cggccggcgg gacgaacgcg tacaagagag gcgcaacccc ccggcggccc ggtgcgtccc 11941 tgagtgcaac ggcccgtcgt ccgtacaccg tcggcgagcc ccgaggggag ggcgcatgac 12001 acgcggcacc acaggcggca ggtccgtcgt gccggcccgc tcgccggggc ccgcggcagg 12061 tcacgcggtc gtcgtcggag gcggactggc cggagtcacc gccgcgctcc agcttgccga 12121 cgccggcatg cgggtgacgc tgctggaagg gcgcccgcgc ctgggcggcc tcgcgttctc 12181 cttccgccgc ggcgagctca ccgtcgacaa cggccagcac gtctacctgc gctgctgcac 12241 cgcctaccgg tggttcctcg accgggtcga gggcgccgga cttgccccgc tgcagagccg 12301 cctcgacgta cccgtcctcg acgtcggccg gcccgccggg ccccggctcg ggcggctgcg 12361 ccgcacggcg ctgcccgtac cgctgcacct ggccgcgagc ctggcggcgt atccgcatct 12421 ctccctcgcc gagcgggcga gcgtggggcg tgccgctctg gcgctcaaga agctcgatcc 12481 ggccgatccc gccctggacg gtgtggactt cgccacctgg ctgggccgcc acggacagtc 12541 gccgcgcacc atcgaggcgc tgtgggacct cgtcggcgtc gcgacgctca acgcgaccgc 12601 gcccgacgcc tccctgggcc tcgccgcgat ggtcttcaag accgggctgc tctccgaccc 12661 cggcgccgcc gacatcggct gggcgcacgt accgctcggc gatctgcacg acacactcgc 12721 ccgcaaggcg ctcgactccg cgggtgtacg gaccgcactg cgcacccggg tcagctccct 12781 ctcccgttcg gtgagcgggg gctggtccgt cggcacggcg gccggggagc ggatcgaggc 12841 cgacgccgtc gtgctcgccg taccgcagcg cgagacgcac gccgtgctgc ccgagggtgt 12901 catcgacgac cccggccggc tgctggacat cggcaccgcg ccgatcctca acgtccatgt 12961 cgtgtacgac cgcaaggtgc tgcgccggcc cttcttcgcc gcgctcggca gccccgtcca 13021 gtgggtcttc gaccggacgg agtcctccgg gctgaggggc ggcggacagt acctcgccgt 13081 ctcgcagtcg gcggcccgag gcgagatcga cgcgcctgtg gccgagctgc gcgcacgcta 13141 tctgcccgag ctggaacggc tgttgcccgc cgcgcgcggc gcgggcgtac gcgacttctt 13201 cgtcacccgg gagcggacag cgacgttcgc ccccaccccc ggcgtcggac ggctgcgccc 13261 cggcgcccgt acgcacgcgc ccggcctgta cctggccggc gcgtggaccg ccaccggatg 13321 gcccgcgacc atggagggcg ctgtgcgcag cggcatcacc gccgccggtg cggccctccg 13381 cgaactcggc cgcgtacatg aacatccgtt gcaggaggcg gtatgagcac tggcatggaa 13441 acaagaggag agcctgtgac cccggtgaac ccggtggaca acaccacggt ggacatcgcc 13501 gcgctgctgg agcgcggccg caccctgtcc accccggtgc tgaaggccgc cgtgggccgg 13561 ctcgcgccgc ccatggacac cgtcgcggcg taccacttcg ggtggatcga cgcccagggg 13621 cggccggccg acggcgacgg cggcaaggcg gtgcgtccgg cgctcgcgct gctgtcggca 13681 caggcggcgg gcgcggcggc cgaggtcggt gttcccggcg ccgtcgccgt cgaactcgta 13741 cacaactttt cgctcctcca cgacgacctg atggacggcg acgagcagcg caggcaccgc 13801 gacaccgtgt ggaaggtgca cggacccgcg caggcgatcc tcgtcggcga cgcgctcttc 13861 gccctcgcca acgaggtgtt gctcgagctc ggcacggtcg aggcggggcg cgccacgcgc 13921 cggctcacca ccgcgacccg gaagctcatc gacggtcagg cgcaggacat ctccttcgag 13981 caccgcgagc gggtcaccgt cgaggagtgc ctggagatgg agggcaacaa gacgggcgcg 14041 ctgctggcct gtgccgtgtc catcggcgcg gtgctcggcg gggcggacga ccgtacggcg 14101 gacacgctgg aggcgtacgg ctaccacctc ggtctcgcct tccaggccgt cgacgacctg 14161 ctcggcatct ggggcgaccc ggacgccacc ggcaagcaga cctggagcga cctgcgccag 14221 cgcaagaagt ccctgcccgt cgtcgcggcg ctggcggccg gcggtccggc gtcggagcgg 14281 ctcggcgagc tgctctccgc cgacgccaag agcaacgact tcgagagctt ctccgagcag 14341 gagttcgcca ccagggcggc tctgatcgag gaggcgggcg gccgtgactg gacggcgcag 14401 gaagctcgca ggcagcatgc cgtcgccatc gaggcgctgg acggcgtcga catgcccccg 14461 caggtccgtg cgcagctcac cgcgctcgcg gacttcgtcg tcgtacgaaa gagatgatca 14521 gcatccgcct catataactc gcaagtcgcc gtccggtgcc gggattcgcc gtaaaccccg 14581 caccggacga cgtcggaccc cagctcagca gatgaccaac tgccacgaag gggaaaccat 14641 gacagcgacg accgacggaa gccctggggc catgaagccc tgcgcggccg cggccagcga 14701 atttaccgac aacaacacca ataccaccac cacagagatc acccgtcacg ggccggtcat 14761 cgccggcgtt ctggacgccg cgcggcgggc cacggaccgc ggggtcgcgc atctgctcgc 14821 acagcaggac gaccagggct ggtggaaagg tgacctcgaa accaatgtga ccatggacgc 14881 cgaggacctg ctgctgcgtc agttcctggg catccaggac gagaagaccg tccgggccgc 14941 cgcgctcttc atccgcggcg agcagcgggg cgacggcacc tgggccacct tctacggcgg 15001 tcccggcgac ctctccacca ccatcgaggc gtacgtcgcc ctgcggctcg ccggtgacct 15061 gccggacgct ccgcacatgg agcgcgccgc ggcctggatc agggcgcgcg gcggcatcgc 15121 cgcgagccgc gtcttcaccc ggatctggct ggcgctgttc gggtggtgga agtgggacga 15181 tctgcccgag ctgccacccg agctgatctt ccttccgtcg tggttcccgc tgaacatcta 15241 cgacttcggc tgctgggcca ggcagaccat cgtgcctctg accgtggtct ccgcgaagcg 15301 gcccgtgcgt cccgcgccgt tcgcgctcga cgagctgcac gccaacccac gcgtgccgaa 15361 cccacgcaag aggctttcca cgcctaccag ttgggagggt gccttccagc ggctcgacaa 15421 ggccctgcac gtctaccgca aggtcgcgcc ggccaggctg cgcggcgccg cgatgaagag 15481 cgcggcgcgc tggatcgtcg agcggcagga gaacgacggc tgctggggcg ggatccagcc 15541 acccgccgtg tactccctga tcgcgctcca cctgctcgga tacgacctgg accacccggt 15601 gatgcgcgcc ggcctggagt cgctcgaccg gttcacggtg tggcgcgagg acggcgcccg 15661 catgatcgag gcctgccagt cgccggtctg ggacacctgc ctcgccacca tcgcactcgc 15721 cgacgcgggg gtacggcccg acgatccggc cctcgtcaag gccgccgact ggatgatgag 15781 cgagcagatc gtacggcccg gggactgggc cgtacgccgc cccggagtcg agccgggcgg 15841 ctgggcgttc gagttccaca acgacaacta ccccgacatc gacgacaccg ccgaagtcgt 15901 cctggcactg cggcgcgtcg cgcacccgga ccgcgcgcgg acagaggcga ccgtcgaccg 15961 cgccgtccgc tggaacctcg gcatgcagtc gaagaacggc gcctgggcgg ccttcgacgc 16021 cgacaacacc agccctttcc ccaaccggct gcccttctgc gacttcggcg aggtcatcga 16081 cccgccgtcc gccgacgtca ccgcgcacgt ggtggagatg ctggccgtcg aaggcaaggc 16141 acacgacccg cgcacccggc gcggcatcga gtggctgctg gccgaacagg agccgagcgg 16201 tgcctggttc ggccgctggg gcgtcaacta cgtatacgga acaggggcgg tcgtgccggc 16261 actgatcgct gccgggctgc cggccgccca ccccgccgtc cgtcgcgccg tcggctggct 16321 ggagtccgtc cagaacgacg acggcggctg gggcgaggac ctgcgttcgt accgtgaggc 16381 cggctgggcc gggcgcggtg cttcgaccgc ttcgcagacc gcctgggcgc tgctcgccct 16441 gctcgcggcg ggggagcggg acggccggtc cgtggagcgg ggcgtcgcct ggctggccgg 16501 gacccagcgg gaggacggct cctgggacga gccgtacttc acgggcaccg gattcccgtg 16561 ggacttctcc atcaactacc acctctaccg gcaggtcttc ccgctcaccg ccctcgggcg 16621 gtacgtccac ggggagccca cctacccgga cctggccacc cgggaaggga cctgatggcc 16681 cccatggacc cggccgaccg gaccgatccg gctgacgagc cgccgcggcc cgcgccgctg 16741 ctgatcgcct gcgcgctcgg catcgagcag ttcgcgctgc gcagtggaca ccgcaaggag 16801 gcgccggggc ccgtgaccgt actgcgtacg gggatgggcc ccaaggcggc ggcgacggcc 16861 gtcaggcagg cgctcgccca cggcggtccg gccccggacg ccgccgtcat cgcatccggc 16921 ttctgcgcgg ggctggtccc cggaatgcac cccggagacc tgatcgtcgc cgacgagacg 16981 cgcggacccc acggcacttc ggcctgtacg ggcgtggagc tgctcgcgaa ggcgctggac 17041 cgggtggtgc ccggacgcac cgtgcacacc ggcccgctgc tcggctccgc ccatgtcgtg 17101 aggggacccg agcgggcccg gctggcggcc accggggcga tcgccgtgga catggagtcc 17161 gccgcgacgc tcggcagcgc gctgcgggcc ggaccgcgtc cggttgcggc cgttcgggtg 17221 gtcgtggacg ctccggagca cgagctcgta cgcattggaa cgcttcgcgg tggaatatcg 17281 gcatttcgtg ttcttcgcgc agtgctgcca gccttttttg aatggcaccg ttcttcgctg 17341 ctccccagga ggtgagctag atggccatgc cgctccgtca gaccatcagg gtcgggacgt 17401 atctcgtaga acaaaagctc cgcaagcgag agaagttccc gctcatcgtc gagctcgaac 17461 cgctctacgc ctgcaacctc gcgtgcgagg gctgcggcaa gatccagcac ccggcggggg 17521 ttctgaagca gcgcatgccc gtggcgcagg ccgtgggagc ggtcctggaa tccggtgctc 17581 cgatggtctc catcgcgggc ggcgagccgc tgatgcatcc tcagatcgac gagatcgtgc 17641 ggcagctggt ggccaagagg aagtacgtat tcctctgcac caacgcgatg ctgctgcgca 17701 agaagatcga gaagttcacg ccgtcgccgt acttcgcctt cgccgtgcac atcgacgggc 17761 tgcgggagcg gcacgacgaa tcggtcgcga aggaaggcgt cttcgacgaa gcggtggagg 17821 cgatcaagga ggccaagaag cgcggtttcc gggtcaccac caattccacc ttcttcaaca 17881 ccgacacccc gcagacgatc atcgaggtgc tcaattacct caatgacgac ctggaggtgg 17941 acgagatgat gatctcgccc gcctacgcgt acgagaaggc tcccgaccag gagcacttcc 18001 tcggggtcga gcagacgcgg gagctgttca agaaggcgtt cgcgggtggc aaccggcggc 18061 gctggcggct gaaccactcg ccgctcttcc tggacttcct ggagggcaag gcggacttcc 18121 cgtgcacggc gtgggcgatc cccaactact ccctcttcgg gtggcagcgg ccctgctacc 18181 tgatgagcga cgggtacgtc ccgacgtacc gggagctcat cgaggagacc gactggagca 18241 agtacgggcg gggcaaggac ccgaggtgcg ccaactgcat ggcgcactgc ggctacgagc 18301 cgacggccgt cctcgcgacc atgggctcgc tcaaggagtc cctgcgagcg gtccgggaga 18361 cggtctcggg aaaccacggg tgacgtcatg acccttggtg agcctgtcgc actggggctc 18421 cccgagctgc cggcccggcc cctggcggtg cgccgcgctt cgcggcgtat ccaggtcggg 18481 tcggtggcgg tcggcgggga cgcgccggtc tcggtgcagt cgatgacgac gacgaggacg 18541 tccgacgtcg gggcgacgct ccagcagatc gcggagctga cggcttcggg ctgccagatc 18601 gtacgggtgg cgtgtcccac ccaggatgac gcggacgcgc tggcgaccat cgcgcgcaag 18661 tcgcagatcc cggtgatcgc ggacatccac ttccagccga agtacgtctt cgccgccatc 18721 gacgccgggt gcgcggcggt gcgggtgaat ccgggcaaca tcaagcagtt cgacgacaag 18781 gtgaaggaga tcgccaaggc ggcctccgag acccgtacgc cgatccgcat cggcgtgaac 18841 gccgggtcgc tggacgcgcg gctgctgcag aagtacggga aggcgacccc ggaggcgctg 18901 gtcgagtcgg cgctgtggga ggcgtctctc ttcgaggagc acggtttcgg tgacatcaag 18961 atctcggtca agcacaacga cccggtcgtg atggtcaacg cgtaccggca gcttgccgcc 19021 cagtgcgact atccgctgca //