LOCUS NZ_KZ195574 28278 bp DNA linear CON 06-SEP-2024 DEFINITION Streptomyces sp. CS113 scaffold00001, whole genome shotgun sequence. ACCESSION NZ_KZ195574 VERSION NZ_KZ195574.1 KEYWORDS WGS; RefSeq. SOURCE Streptomyces sp. CS113 ORGANISM Streptomyces sp. CS113 Bacteria; Actinomycetota; Actinomycetes; Kitasatosporales; Streptomycetaceae; Streptomyces. REFERENCE 1 (bases 1 to 8695358) AUTHORS Malmierca,M.G., Gonzalez-Montes,L., Perez-Victoria,I., Sialer,C., Brana,A.F., Garcia Salcedo,R., Martin,J., Reyes,F., Mendez,C., Olano,C. and Salas,J.A. TITLE Searching for Glycosylated Natural Products in Actinomycetes and Identification of Novel Macrolactams and Angucyclines JOURNAL Front Microbiol 9, 39 (2018) PUBMED 29441046 REMARK Publication Status: Online-Only REFERENCE 2 (bases 1 to 8695358) AUTHORS Malmierca,M.G., Mendez,C., Olano,C. and Salas,J.A. TITLE Searching for glycosylated natural products in actinomycetes isolated from leaf-cutting ants: activation of two silent clusters and identification of a novel macrolactam JOURNAL Unpublished REFERENCE 3 (bases 1 to 8695358) AUTHORS Malmierca,M.G., Mendez,C., Olano,C. and Salas,J.A. TITLE Direct Submission JOURNAL Submitted (03-MAY-2017) Biologia Funcional, Area de Microbiologia, Universidad de Oviedo, Julian Claveria S/N, Oviedo, Asturias 33006, Spain COMMENT ##Genome-Assembly-Data-START## Assembly Method :: Newbler v. 2.9 Genome Representation :: Full Expected Final Version :: Yes Genome Coverage :: 78.17x Sequencing Technology :: Illumina MiSeq ##Genome-Assembly-Data-END## ##Genome-Annotation-Data-START## Annotation Provider :: NCBI RefSeq Annotation Name :: GCF_002188365.1-RS_2024_09_06 Annotation Date :: 09/06/2024 02:31:13 Annotation Pipeline :: NCBI Prokaryotic Genome Annotation Pipeline (PGAP) Annotation Method :: Best-placed reference protein set; GeneMarkS-2+ Annotation Software revision :: 6.8 Features Annotated :: Gene; CDS; rRNA; tRNA; ncRNA Genes (total) :: 7,790 CDSs (total) :: 7,719 Genes (coding) :: 7,614 CDSs (with protein) :: 7,614 Genes (RNA) :: 71 rRNAs :: 2, 1 (5S, 16S) complete rRNAs :: 2 (5S) partial rRNAs :: 1 (16S) tRNAs :: 65 ncRNAs :: 3 Pseudo Genes (total) :: 105 CDSs (without protein) :: 105 Pseudo Genes (ambiguous residues) :: 0 of 105 Pseudo Genes (frameshifted) :: 35 of 105 Pseudo Genes (incomplete) :: 81 of 105 Pseudo Genes (internal stop) :: 4 of 105 Pseudo Genes (multiple problems) :: 15 of 105 ##Genome-Annotation-Data-END## ##antiSMASH-Data-START## Version :: 8.dev-cf2fc5ee(changed) Run date :: 2025-09-13 09:56:45 NOTE :: This is a single region extracted from a larger record! Orig. start :: 1669636 Orig. end :: 1697914 ##antiSMASH-Data-END## REFSEQ INFORMATION: The reference sequence is identical to KZ195574.1. The annotation was added by the NCBI Prokaryotic Genome Annotation Pipeline (PGAP). Information about PGAP can be found here: https://www.ncbi.nlm.nih.gov/genome/annotation_prok/ Bacteria and source DNA available from Prof. Salas's laboratory, A. Microbiology, University of Oviedo, Spain. FEATURES Location/Qualifiers region 1..28278 /candidate_cluster_numbers="1" /contig_edge="False" /product="terpene" /region_number="9" /rules="(PT_phytoene_like or phytoene_synt or Lycopene_cycl or Lycopene_cycl_fung or T1TS or T1TS_KS or T2TS or TS_UbiA or TS_Pyr4)" /tool="antismash" cand_cluster 1..28278 /candidate_cluster_number="1" /contig_edge="False" /detection_rules="(PT_phytoene_like or phytoene_synt or Lycopene_cycl or Lycopene_cycl_fung or T1TS or T1TS_KS or T2TS or TS_UbiA or TS_Pyr4)" /kind="single" /product="terpene" /protoclusters="1" /tool="antismash" protocluster 1..28278 /aStool="rule-based-clusters" /category="terpene" /contig_edge="False" /core_location="[10000:18278]" /cutoff="20000" /detection_rule="(PT_phytoene_like or phytoene_synt or Lycopene_cycl or Lycopene_cycl_fung or T1TS or T1TS_KS or T2TS or TS_UbiA or TS_Pyr4)" /neighbourhood="10000" /product="terpene" /protocluster_number="1" /tool="antismash" proto_core 10001..18278 /aStool="rule-based-clusters" /tool="antismash" /cutoff="20000" /detection_rule="(PT_phytoene_like or phytoene_synt or Lycopene_cycl or Lycopene_cycl_fung or T1TS or T1TS_KS or T2TS or TS_UbiA or TS_Pyr4)" /neighbourhood="10000" /product="terpene" /protocluster_number="1" misc_feature 25..41 /note="TFBS match to ZuR_variant_2, Zinc-responsive repressor, confidence: weak, score: 15.4" /tool="antismash" gene 111..536 /locus_tag="B9W62_RS07150" /old_locus_tag="B9W62_07165" CDS 111..536 /codon_start=1 /inference="COORDINATES: similar to AA sequence:RefSeq:WP_014670594.1" /locus_tag="B9W62_RS07150" /note="Derived by automated computational analysis using gene prediction method: Protein Homology." /old_locus_tag="B9W62_07165" /product="DUF2267 domain-containing protein" /protein_id="WP_087806273.1" /transl_table=11 /translation="MQHDEMTGKVQALAQLPDRGSAERATRAVLETLAERLPSALANHM AAQLPPTLAASVRQRTDSAADGHGSTSGERFDLTVFAGRIAGRAATDEETAIREAAGVL EVLDAALTPELTEKMAGVLPADIRELLPASRAVDESG" gene 714..1844 /locus_tag="B9W62_RS07155" /old_locus_tag="B9W62_07170" CDS 714..1844 /codon_start=1 /inference="COORDINATES: similar to AA sequence:RefSeq:WP_007387049.1" /locus_tag="B9W62_RS07155" /note="Derived by automated computational analysis using gene prediction method: Protein Homology." /old_locus_tag="B9W62_07170" /product="acetate/propionate family kinase" /protein_id="WP_087806275.1" /transl_table=11 /translation="MFEEPFGAVLVVDAGSSSLHLTVFDDDLGVLAERDSSSAPGDHAV GLLRQLLHEAPAPVAVGHRVVHGGPALREHLLVDDHVRDALDGAADMAPLHVPPALTVL DAARDLLPDIPHAACLDTAFHAGLPAAAREYAVPGAWRERYGLRRYGFHGLSYSWALGR AAELLGRRPERLQVVIAHLGGGCSACAVRDGRSVDTTMGFTPLEGLVMAHRSGSIDPGA LTWLQTRHHLSAEEVDTALNQGGGLLALSGTSDDTRDLVRARAGGDERAGFALDVFTHH CRRGIAAMAASLDRLDALVFTGEIGEDQPEVREEVCARLTTLGLTSGLQVPEGATVDRP TVVSAPGAAVPVVVVPTGEGRQVDRETRTLLRQHQA" gene 2137..3375 /locus_tag="B9W62_RS07160" /old_locus_tag="B9W62_07175" CDS 2137..3375 /EC_number="3.1.3.16" /GO_function="GO:0004722 - protein serine/threonine phosphatase activity [Evidence IEA]" /GO_function="GO:0046872 - metal ion binding [Evidence IEA]" /GO_process="GO:0006470 - protein dephosphorylation [Evidence IEA]" /codon_start=1 /gene_functions="other (smcogs) SMCOG1054: hypothetical protein" /gene_kind="other" /inference="COORDINATES: similar to AA sequence:RefSeq:WP_005473538.1" /locus_tag="B9W62_RS07160" /note="Derived by automated computational analysis using gene prediction method: Protein Homology." /old_locus_tag="B9W62_07175" /product="PP2C family protein-serine/threonine phosphatase" /protein_id="WP_256976218.1" /transl_table=11 /translation="MNTSERTSRMLTELLRDSHLAAFDELPSLVARYTEQAGMHDVRIY LADLRQEILREVTGKGLSAAGGGEVYAVEGTLPGRAYTSIRPQLFANGEQDRWWIPILD GTERLGLLCGSLTDDADLEQLQAVASLVGLLVVSKRPSSDAAARLARTQAMNVAAELQW NLMPPRSFANKDVVISAAMEPAYEIGGDAFDYAIADGRVHLGIFDAMGHNSHAGLAANL VVAACRNQRRQGTDLVALGERVEEILLEHFAHETFVTATLAELNTSTGLLTWINRGHHP PVLIRAGRWTTVLHCPPAHPLGAGLETTATLCREQLEPGDRILLYTDGITEARDKQGQE FGLTRFTDFIIRHHADGFPVPETLRRLMRAVLHHHDGRLNDDATVVCVEWHGPSRNTQL SAAPPPVPPPGRS" gene 3642..4016 /locus_tag="B9W62_RS07165" /old_locus_tag="B9W62_07180" CDS 3642..4016 /codon_start=1 /inference="COORDINATES: protein motif:HMM:NF016056.5" /locus_tag="B9W62_RS07165" /note="Derived by automated computational analysis using gene prediction method: Protein Homology." /old_locus_tag="B9W62_07180" /product="thiol-disulfide oxidoreductase DCC family protein" /protein_id="WP_373433667.1" /transl_table=11 /translation="MLLFDGDCGFCTTAVRWIERNVSPHCESVAWQRADLRGIGVTRQR AQREALWVTPAGTVYGGADAVSKLLLSAGGGWSLLGALLMVKPVRRVAHGVYRLVADNR SRLPGTTDACSRSKARTRHT" gene complement(4164..4379) /locus_tag="B9W62_RS07170" /old_locus_tag="B9W62_07185" CDS complement(4164..4379) /codon_start=1 /inference="COORDINATES: similar to AA sequence:RefSeq:WP_019359421.1" /locus_tag="B9W62_RS07170" /note="Derived by automated computational analysis using gene prediction method: Protein Homology." /old_locus_tag="B9W62_07185" /product="hypothetical protein" /protein_id="WP_087788979.1" /transl_table=11 /translation="MLVKKLHESGIRAEHAYAAGFVSIGLSVVTWCTSLRVEKKGEERA DRWGIFVGEWAPTFFAIGLALSTYEK" gene 4609..4872 /locus_tag="B9W62_RS07175" /old_locus_tag="B9W62_07190" CDS 4609..4872 /codon_start=1 /inference="COORDINATES: similar to AA sequence:RefSeq:WP_016823975.1" /locus_tag="B9W62_RS07175" /note="Derived by automated computational analysis using gene prediction method: Protein Homology." /old_locus_tag="B9W62_07190" /product="hypothetical protein" /protein_id="WP_087806279.1" /transl_table=11 /translation="MDKKQLAEETINTAANGGKDAPLTSTDVERVFDLLFGTVEHPGSI AEALNRRENVSLGSFGSFRMDGGTATFRPGTALTEFLQNKTG" gene complement(4906..5433) /locus_tag="B9W62_RS41190" CDS complement(4906..5433) /codon_start=1 /inference="COORDINATES: protein motif:HMM:NF024937.5" /locus_tag="B9W62_RS41190" /note="Derived by automated computational analysis using gene prediction method: Protein Homology." /product="DUF4126 family protein" /protein_id="WP_107424195.1" /transl_table=11 /translation="MNPAAETLIRVGLIGAATGLRSQWGVAALSWSTPPAAGRPLPSAL LTGPWARALTCFTTAAEFVADKAPSTPSRLSAQGMGPRVVLGALAGAALAQRRGALPPF TAVATGVSTAVAGAFAGARWRGAASSPGWSVAVAAAAEDLLAAGLAWAACTSSVRRPLP AGDDEHGTKPSR" gene complement(5430..5747) /locus_tag="B9W62_RS07180" /old_locus_tag="B9W62_07195" CDS complement(5430..5747) /codon_start=1 /inference="COORDINATES: ab initio prediction:GeneMarkS-2+" /locus_tag="B9W62_RS07180" /note="Derived by automated computational analysis using gene prediction method: GeneMarkS-2+." /old_locus_tag="B9W62_07195" /product="hypothetical protein" /protein_id="WP_087806281.1" /transl_table=11 /translation="MIQSMVHGGAAGVASTTVRNAVTYADTVRRGRSAREVPAQVVARL AAEEAVHPGTVRDNRLTGPGALSAIAVGCGAGATVSMPRRTGARMPVWLSGLVTAVEHR S" gene 6338..>6843 /locus_tag="B9W62_RS07190" /old_locus_tag="B9W62_07205" /pseudo="" CDS 6338..>6843 /codon_start=1 /inference="COORDINATES: similar to AA sequence:RefSeq:WP_003978686.1" /locus_tag="B9W62_RS07190" /note="frameshifted; incomplete; partial in the middle of a contig; missing C-terminus; Derived by automated computational analysis using gene prediction method: Protein Homology." /old_locus_tag="B9W62_07205" /product="NAD(P)H-binding protein" /pseudo="" /transl_table=11 /translation="MAVTGRRYAEGLHCPVTGASGYVGAHLVPELLESGHRVRCLARSS ANLRDQPWAADAESVQGDVPDPRSVADAMRGADVAYCLVHALGSGAHPPHRSPRRVALP GRVRHHAGRRGPGLRRRRSGHAHLPRIDAPLRRRRRAPTAGHRAGTGPRPRGRPATGSA W" gene 6981..8354 /locus_tag="B9W62_RS07195" /old_locus_tag="B9W62_07210" CDS 6981..8354 /EC_number="4.1.99.3" /GO_function="GO:0003677 - DNA binding [Evidence IEA]" /GO_function="GO:0003904 - deoxyribodipyrimidine photo-lyase activity [Evidence IEA]" /GO_function="GO:0050660 - flavin adenine dinucleotide binding [Evidence IEA]" /GO_process="GO:0006281 - DNA repair [Evidence IEA]" /codon_start=1 /inference="COORDINATES: similar to AA sequence:RefSeq:WP_003978685.1" /locus_tag="B9W62_RS07195" /note="Derived by automated computational analysis using gene prediction method: Protein Homology." /old_locus_tag="B9W62_07210" /product="deoxyribodipyrimidine photo-lyase" /protein_id="WP_087806285.1" /transl_table=11 /translation="MTVAVVLFTSDLRLHDHPPMHAALKAADEVVPLFVLDPGIRTAHF DAPNRRAFLADCLNDLDASLRRRGGRLVVRAGPVAREVRAVAAECGAGEVHMAAGVTAY ARHREERLRKELDDSDVALRVHEAVLTALAPGAVTPSGSDHFAVFTPYFRRWSQESLRQ PLRAPRTVRVPDGVRGEPLPERADVSGTSPGLPAGGETAGRDRFSRWSRSGLSRYADRH DDLAGDATSRLSPYLHFGVLSPAELVHRSRERGGPGAEAFVRQVCWRDFHHQVMAARPS ASGKDYRTRHDRWRGEDEAAEDIAAWREGRTGYPVVDAAMRQLRHEGWMHNRARLLAAS FLTKTLYVDWRIGARHFLDLLVDGDVVNNQLNWQWVAGTGTDTRPHRVLNPLVQARRFD PDGTYVRRWVPELTGVDGKRVHEPWRLPAGQRDALDYPEPVIDLADGLARFKHARGRD" misc_feature complement(7021..7034) /note="TFBS match to ArgR, Regulator of arginine biosynthesis genes, confidence: weak, score: 16.87" /tool="antismash" gene 8368..9864 /locus_tag="B9W62_RS07200" /old_locus_tag="B9W62_07215" CDS 8368..9864 /codon_start=1 /inference="COORDINATES: similar to AA sequence:RefSeq:WP_018843427.1" /locus_tag="B9W62_RS07200" /note="Derived by automated computational analysis using gene prediction method: Protein Homology." /old_locus_tag="B9W62_07215" /product="cryptochrome/photolyase family protein" /protein_id="WP_087806287.1" /transl_table=11 /translation="MHWLFGDQLGPHFLTPGDEGPGHDTPLLMIEARSVFRRRRFHRAK AHLVLSAMRHRAAELGDRVTYVRADTYRDGLDRAARGRRVGLHHPTSHAALRLVRALPR VAVGPARGFLVPMADFTAWADDHGGKRLRQEDFYHWVRRGHDLLMDGGQPAGGQWNLDH DNREPPPRDTTSLQVGRPYRPREDDIDDEVRHDLDRWERDGDVSFVGRDGPRLFPATRA EARRALRRFVEHRLATFGPYEDAMLAADPVMSHSLLSSSLNLGLLDPAECVETAERAWR EGRAPLNSVEGFVRQVAGWREYVWQLYWYFGEDYRRSNTLRHTTPLPDWWNDLDADAVR ANCLHTVLSQVRDTGWTHHIPRLMILGSHALQRGWDPAAVTDWFHRCFVDGYDWVMLPN VVGMSQYADGGRMTTKPYTSGGAYIKRMSDLCGPCAYRPGDRTGERACPYTAGYWAFLD RHRDRLAGNQRIAQPVRQLDRLSDLTEVREQERARGDTPP" misc_feature complement(9939..9949) /note="TFBS match to HypR, L-hydroxyproline utilization repressor, confidence: weak, score: 18.59" /tool="antismash" gene 10001..11143 /locus_tag="B9W62_RS07205" /old_locus_tag="B9W62_07220" CDS 10001..11143 /EC_number="2.5.1.-" /GO_function="GO:0004659 - prenyltransferase activity [Evidence IEA]" /GO_function="GO:0046872 - metal ion binding [Evidence IEA]" /GO_process="GO:0008299 - isoprenoid biosynthetic process [Evidence IEA]" /codon_start=1 /gene_functions="biosynthetic (rule-based-clusters) terpene: PT_FPPS_like" /gene_functions="biosynthetic-additional (smcogs) SMCOG1182: Polyprenyl synthetase" /gene_kind="biosynthetic" /inference="COORDINATES: similar to AA sequence:RefSeq:WP_018837939.1" /locus_tag="B9W62_RS07205" /note="Derived by automated computational analysis using gene prediction method: Protein Homology." /old_locus_tag="B9W62_07220" /product="polyprenyl synthetase family protein" /protein_id="WP_179235380.1" /sec_met_domain="PT_FPPS_like (E-value: 1e-45, bitscore: 154.9, seeds: 145, tool: rule-based-clusters)" /transl_table=11 /translation="MPGLPPTVAPPAGRVDAPLTVPDAANAVGTVLERVLDERLRHSLA IDPVFARELADRLIALVGRGGKRLRTAFTHCGWRAAGGSGDTGAVLRTGAALELLQACA LVHDDVMDGSVQRRGAPALHVELARAHWASGMHGSAEAFGTSAAVLAGDLALAWADDLL TETALGTPHGSLLLGEWRAMRTEMVAGQYLDLRAQAARSSGVDEALAIATLKSALYTVA RPLALGASLAGADAQVVDALRAAGRCAGLAFQLRDDLLGAFGDPALTGKPTDDDLRSRK LTYLLAVALQLADAADDHQAAARLAPDAVPRSEHAVQRVRAALQRTGARDLVEARIEEL TDMSLGHFVRTGAPPAVQDEFSTLVQHATGVRPRRAEEVA" gene 11140..12711 /gene="crtI" /locus_tag="B9W62_RS07210" /old_locus_tag="B9W62_07225" CDS 11140..12711 /EC_number="1.-.-.-" /GO_function="GO:0016166 - phytoene dehydrogenase activity [Evidence IEA]" /GO_process="GO:0016117 - carotenoid biosynthetic process [Evidence IEA]" /codon_start=1 /gene="crtI" /gene_functions="biosynthetic-additional (rule-based-clusters) DAO" /gene_functions="biosynthetic-additional (smcogs) SMCOG1222: dehydrogenase" /gene_kind="biosynthetic-additional" /inference="COORDINATES: similar to AA sequence:RefSeq:WP_018103068.1" /locus_tag="B9W62_RS07210" /note="Derived by automated computational analysis using gene prediction method: Protein Homology." /old_locus_tag="B9W62_07225" /product="phytoene desaturase family protein" /protein_id="WP_256976219.1" /sec_met_domain="DAO (E-value: 1.8e-17, bitscore: 62.7, seeds: 521, tool: rule-based-clusters)" /transl_table=11 /translation="MKRVPGPTDHVVVVGAGLSGLACALHLLGAGRRVTLVERDAGPGG RAGRVRRGGYELDTGPTVLTMPHLADEAFAAVGDSLHRRVELTALHPAYRACFADGSSL DVHTDGEAMEAEVRRFAGPAQAAGYRDLRQWLERLYRAQMRRFIDTNFDSPAQLLHPDL ARLAALGGFGRLDGRIGRFLSDDRLRRVFSFQALYAGVAPARALAAYAVIAYMDTVAGV WFPKGGMHALPRAMADAAADAGADLRWRAEVSTLERSSGRVRAVHLTSGERIPCDAVVL TCELPAAYRLLERTPRRPARLRHSPSAVILHAGTDRTWPHLAHHTISFGAAWERTFDEL TRTGELMSDPSLLITRPTSHDPDLAPPGRHLHYVLAPCPNTTVGPSAATWRDLGPRYRD RLAGELERRGLDGFTDSIEEELLVTPLDWTAQGHAAGSPFSVAHTFAQTGPFRPRNLVR EVENVVLAGCGTTPGVGVPTVLISGKLAAARVTGGSVARPVGARTPPAAADEPSVPASA GADDAA" gene 12698..13693 /locus_tag="B9W62_RS07215" /old_locus_tag="B9W62_07230" CDS 12698..13693 /EC_number="2.5.1.-" /codon_start=1 /gene_functions="biosynthetic (rule-based-clusters) terpene: PT_phytoene_like" /gene_functions="biosynthetic (rule-based-clusters) terpene: phytoene_synt" /gene_kind="biosynthetic" /inference="COORDINATES: similar to AA sequence:RefSeq:WP_018837937.1" /locus_tag="B9W62_RS07215" /note="Derived by automated computational analysis using gene prediction method: Protein Homology." /old_locus_tag="B9W62_07230" /product="phytoene/squalene synthase family protein" /protein_id="WP_087806291.1" /sec_met_domain="phytoene_synt (E-value: 2.2e-123, bitscore: 409.6, seeds: 8, tool: rule-based-clusters)" /sec_met_domain="PT_phytoene_like (E-value: 4.4e-64, bitscore: 215.4, seeds: 61, tool: rule-based-clusters)" /transl_table=11 /translation="MTRRELDAAGITDPALRTAYTRCRRLNARHGKTYFLATRLLPLER RSAVHALYGFARWADDIVDDLDRTLAPEERDRLLRRLESDLMSGLRSGGGDEPVVRAVA DTATRYAIEPVLFADFMSSMRADLTVTDYPTYADLQGYVHGSAAVIGLQMLPVLGTVTV REEAAPHAAALGVAFQLTNFLRDVGEDLDRGRVYLPGDLLAAHGVDRPLLEWSRHTGRT DPRIRAALVAAEAMTREVYRTAEPGIAMLDPRVRPCIRAAFTLYGGILDAIAEQEYTVL HRRAVVSRRRRAATAATGVLRVAGARWRAHAPSRKATAGGPAVERKEPVR" gene 13690..14700 /locus_tag="B9W62_RS07220" /old_locus_tag="B9W62_07235" CDS 13690..14700 /codon_start=1 /inference="COORDINATES: similar to AA sequence:RefSeq:WP_011026912.1" /locus_tag="B9W62_RS07220" /note="Derived by automated computational analysis using gene prediction method: Protein Homology." /old_locus_tag="B9W62_07235" /product="DUF5914 domain-containing protein" /protein_id="WP_087806293.1" /transl_table=11 /translation="MSERTGRQRWTPPLRLRRPGPDWAAQTPTWRQARPALIADALKRA SARPSGNWFVLGASRNVRADGRPYGRTVGGVEIVLWRSDTGDLRAGPGVCPHLGAPLRD SRVACGTLVCHWHGLALDGSPSPGWDPFPVHDDGVLVWVRLDQVGGEEPTDRPAVPVRP ATGSGVDAVFTAVGRCEPQDVVANRLDPWHGSWFHPYSFVDLTVSREPQGEEDDAFVVD VSFRVAGRLVVPVRAEFTAPEPRTVVMRITDGEGATSVVETHATPLTGAGHAHPRTAVV EATIAASDRPGFALARAAAPVLRPLMLHTAGRLWRDDLAYAERRWALRSTGRFPG" gene complement(14723..16291) /locus_tag="B9W62_RS07225" /old_locus_tag="B9W62_07240" CDS complement(14723..16291) /GO_function="GO:0016491 - oxidoreductase activity [Evidence IEA]" /codon_start=1 /gene_functions="biosynthetic-additional (rule-based-clusters) DAO" /gene_kind="biosynthetic-additional" /inference="COORDINATES: similar to AA sequence:RefSeq:WP_003978679.1" /locus_tag="B9W62_RS07225" /note="Derived by automated computational analysis using gene prediction method: Protein Homology." /old_locus_tag="B9W62_07240" /product="FAD-dependent oxidoreductase" /protein_id="WP_087806295.1" /sec_met_domain="DAO (E-value: 1.1e-13, bitscore: 50.4, seeds: 521, tool: rule-based-clusters)" /transl_table=11 /translation="MTTRRTDPARRGRDRKAEVLLPAPGRSRFQPGEGPTVAVIGGGIA GLAAATLLAERGARVTLYEKEASLGGRLSGWPTRLADGSPVTMTRGFHAFFRQYYNLRG LLRRTDPALARLTPLPDYPLRHSGGLTDSFARVPRTPPLSALGFVALSPTFGWRDLAAM DARAALPLLDVRVPEVYERFDEFSATGFLEGVRFPEAAHHLAFEVFSRSFFADPRELSA AELLLMFHIYFLGSAEGLLFDVPSEPFPQALWDPLAGYLQRLGADIRTGTPVHGVLPAG DAGADVLTDTGTGRHQAVVLALDPGGLRRVVGASPGLGTSGWREDLAALRTAPPFLVSR LWLDRPVRADRPGFLGTSGYGGLDNISVLERYEGEAARWAERTGGSVVELHAYAVDTGA ERKETQDMLVDRLHEVYPETRRARVVDARHEWRSDCPLFPVGSYHRRPTVRTPHPWLTL AGDAIRCDLPVALMERAATTGFLAANALLADRGVRGQVLWTVPRAGRSSVLRALGAVAG RRSSP" gene complement(16288..17064) /locus_tag="B9W62_RS07230" /old_locus_tag="B9W62_07245" CDS complement(16288..17064) /EC_number="2.1.1.-" /GO_function="GO:0008168 - methyltransferase activity [Evidence IEA]" /codon_start=1 /gene_functions="biosynthetic-additional (smcogs) SMCOG1089: methyltransferase" /gene_kind="biosynthetic-additional" /inference="COORDINATES: similar to AA sequence:RefSeq:WP_019329054.1" /locus_tag="B9W62_RS07230" /note="Derived by automated computational analysis using gene prediction method: Protein Homology." /old_locus_tag="B9W62_07245" /product="class I SAM-dependent methyltransferase" /protein_id="WP_087806297.1" /transl_table=11 /translation="MSLLRDHDLARAFDHASHSYDRLTSLNPGYRTGLLRSARRLRLPD DGAGLHLLDLGCGTGASTRALLRAAPRARITAVDASAGMLRRALAKPWPVRVRFLHLTA EEVATAGEGPFDAVFAAYLFRNVTDPDAVLGSVRTLLRPGGRLAVHEYSLSGALRHRAL WSAVCRGVVVPAGTLTGDRALYRHLRHSVAVFDTAPSFADRLTRAGFTGVRVAPVAGWQ TGIVHTFVARGGPTPANTAPATSTPASSSPGREHAQ" gene complement(17061..18278) /locus_tag="B9W62_RS07235" /old_locus_tag="B9W62_07250" CDS complement(17061..18278) /codon_start=1 /gene_functions="biosynthetic (rule-based-clusters) terpene: Lycopene_cycl" /gene_kind="biosynthetic" /inference="COORDINATES: similar to AA sequence:RefSeq:WP_019329053.1" /locus_tag="B9W62_RS07235" /note="Derived by automated computational analysis using gene prediction method: Protein Homology." /old_locus_tag="B9W62_07250" /product="lycopene cyclase family protein" /protein_id="WP_087806299.1" /sec_met_domain="Lycopene_cycl (E-value: 1.1e-103, bitscore: 346.1, seeds: 20, tool: rule-based-clusters)" /transl_table=11 /translation="MTPRSTEVSDVIVVGGGAAGLGLAHRLTETGAATVTVIEPPDGPL RPAERTWCYWGEGADGLDEVVGASWSRLRLHGADGRPVTVDPAPFSYRMVRSTDFERLI HGRLARSDGGRLLRATVGSVKGVPGGAEVRCTLPGGWSLTLRARRVFDSRPLRALPPAR TQLLQHFRGWFVRTGSARFDPAVADLMDFRVPQPAHGLAFGYVLPLTPDRALVEYTEFS RDVLSTEAYESALGHYCRDVLGLGALTVERAEQGVIPMTDARFPRRVGAAVFRIGAAGG ATRPATGYTFAAMQRQSRAVAAALRDGHGTVLPAPHGRRALAMDAILLRALDTGRIDGP DFFTGLFRRVPAERLLRFLDGATSLREEWGIGLRTPVRPMLRTAAELPFLPRRSRRTAR NGDDHR" gene complement(18528..19778) /locus_tag="B9W62_RS07240" /old_locus_tag="B9W62_07255" CDS complement(18528..19778) /EC_number="1.-.-.-" /GO_function="GO:0009055 - electron transfer activity [Evidence IEA]" /GO_function="GO:0016491 - oxidoreductase activity [Evidence IEA]" /codon_start=1 /gene_functions="biosynthetic-additional (rule-based-clusters) DAO" /gene_functions="biosynthetic-additional (smcogs) SMCOG1222: dehydrogenase" /gene_kind="biosynthetic-additional" /inference="COORDINATES: similar to AA sequence:RefSeq:WP_016325003.1" /locus_tag="B9W62_RS07240" /note="Derived by automated computational analysis using gene prediction method: Protein Homology." /old_locus_tag="B9W62_07255" /product="NAD(P)/FAD-dependent oxidoreductase" /protein_id="WP_087806301.1" /sec_met_domain="DAO (E-value: 3e-10, bitscore: 39.0, seeds: 521, tool: rule-based-clusters)" /transl_table=11 /translation="MSDEHRRAPDAVVVGAGLAGLACALDLCRAGWRVALLEASDGVGG RMRTDRRDGFLLDRGFQVFNTSYPQVKRRLDLRSLRLRPFTAGVIAHTSTGLVRLTDPT REPGAAGALLPGRILSARDLAALAALTARDAVLPVSTTRRRPDRPTSAALSRAGLSDAV ISDVLRPFLSGVFLEDRLETSARFFHLVWRSMVRGSLCLPAEGVGAVPARLADGLPDGV LRLGTPVAEITDAGVLLGDGTEVPARVVVVATDPATAARLLPGLTVPDTRTVTTYYHAT DRAPMAEPTLMVDSTGAVLNTCVLSEVAPTYAPPGTALVSTSVLGTDPPDRGRTVLRRL AELYGTDTSDWHQVAARTIEGALPAMLPPWPLSRSTRLGPGRYVCGDHRATGSVQGALA SGARAAREASADQGPER" gene complement(19992..21011) /locus_tag="B9W62_RS07245" /old_locus_tag="B9W62_07260" CDS complement(19992..21011) /GO_function="GO:0003677 - DNA binding [Evidence IEA]" /GO_process="GO:0006355 - regulation of DNA-templated transcription [Evidence IEA]" /codon_start=1 /inference="COORDINATES: similar to AA sequence:RefSeq:WP_014157565.1" /locus_tag="B9W62_RS07245" /note="Derived by automated computational analysis using gene prediction method: Protein Homology." /old_locus_tag="B9W62_07260" /product="MerR family transcriptional regulator" /protein_id="WP_087806303.1" /transl_table=11 /translation="MSTQRESGEDRFGDGDGDRQEGGLTTGEVARRLGVAPTTVRTWDR RYGLGPDAHTDGRHRRWTAADVARLERMCALTATGLPPAEAARLARSEEPSAAPSRSAA GPPPASSPARRSRAGSGLRLGDARQECRGIARAALRLDAAVLDELLLAAITEHGLVAAW TEVIVPTLQAVGRKWETSGEKYVEVEHFLSWHVSGALRRAAPRTVADRPGATTVLACVP GENHTLPLEVLSAALTERGVPVRMFGGALPVESLVAAVRRTGPAAVGLWAQSRTTASRP LAQHVAAMEWGVRGARRRPVVLTLGPGWAGRTVAGLARPSGLAEAVAVLEPLVSAQFS" gene 21174..21782 /locus_tag="B9W62_RS07250" /old_locus_tag="B9W62_07265" CDS 21174..21782 /GO_function="GO:0003700 - DNA-binding transcription factor activity [Evidence IEA]" /GO_function="GO:0016987 - sigma factor activity [Evidence IEA]" /GO_process="GO:0006352 - DNA-templated transcription initiation [Evidence IEA]" /GO_process="GO:0006355 - regulation of DNA-templated transcription [Evidence IEA]" /codon_start=1 /gene_functions="regulatory (smcogs) SMCOG1032: RNA polymerase, sigma-24 subunit, ECF subfamily" /gene_kind="regulatory" /inference="COORDINATES: similar to AA sequence:RefSeq:WP_006135425.1" /locus_tag="B9W62_RS07250" /note="Derived by automated computational analysis using gene prediction method: Protein Homology." /old_locus_tag="B9W62_07265" /product="sigma-70 family RNA polymerase sigma factor" /protein_id="WP_087806305.1" /transl_table=11 /translation="MNTDTRRTATLVPPVEETRHEEELARGLACADEEAFAVIYRRWGA LVHTMATRSLGDTYEAEDVTQQVFVGAWRGRHGFRPERGALGAWLVGITRRKIVDALAA RTRRLALVESAAQDATPTRFVQQAPDEVLDRVLLVEALSRLPHAQREVLCLAFYEDLTQ AQIAERTGVPLGTVKSHARRGLHRLRAAVDRADVHGTGI" gene 21889..22725 /locus_tag="B9W62_RS07255" /old_locus_tag="B9W62_07270" CDS 21889..22725 /codon_start=1 /inference="COORDINATES: similar to AA sequence:RefSeq:WP_016823977.1" /locus_tag="B9W62_RS07255" /note="Derived by automated computational analysis using gene prediction method: Protein Homology." /old_locus_tag="B9W62_07270" /product="DUF4397 domain-containing protein" /protein_id="WP_087806307.1" /transl_table=11 /translation="MTSRTTVAVAASTGACALALGVTAPAIAAPDQAQDQAMVSVFHGI PGMTVDVYANGDELIGDFKPGTVTDPQSLDAGTYDIQVFEAGQGPEGKPALEKQVKVPE GGNATVAAHLSAGGKPELTAFTNDVSKVDAGKARLTVRHVAAAPAVDVRAGGQPVFTGL TNPDEGTAAVDAGTVNADVVLAGTDTVAIGPADLGLKEGTSNVLYAWGSADDKNLALAT QTFSGMESRPNAVHAGGSGAAVTPNSPDQWLAWAAAAGAVTLTGVLLARQVSGRRG" gene 22949..23374 /locus_tag="B9W62_RS07260" /old_locus_tag="B9W62_07275" CDS 22949..23374 /codon_start=1 /inference="COORDINATES: similar to AA sequence:RefSeq:WP_004981399.1" /locus_tag="B9W62_RS07260" /note="Derived by automated computational analysis using gene prediction method: Protein Homology." /old_locus_tag="B9W62_07275" /product="class F sortase" /protein_id="WP_256976220.1" /transl_table=11 /translation="MRAVGLEARVRPVGVTERGAMTVPEGPAVAGWYRYGPAPGGREGS AVLVGHVDSETGALGEFAALYDIQRGDRVEVRRAAAAPVAYRVVSRTTVPKDELPPSVF RRTGDPVLTLITCAPPFEPERGGYLANLVVTAEPLPE" misc_feature complement(23965..23978) /note="TFBS match to ANR, Anaerobic transcriptional regulator, confidence: weak, score: 18.76" /tool="antismash" gene 24138..24866 /locus_tag="B9W62_RS07275" /old_locus_tag="B9W62_07290" CDS 24138..24866 /EC_number="1.-.-.-" /GO_function="GO:0016491 - oxidoreductase activity [Evidence IEA]" /codon_start=1 /gene_functions="biosynthetic-additional (smcogs) SMCOG1001: short-chain dehydrogenase/reductase SDR" /gene_kind="biosynthetic-additional" /inference="COORDINATES: similar to AA sequence:RefSeq:WP_018850966.1" /locus_tag="B9W62_RS07275" /note="Derived by automated computational analysis using gene prediction method: Protein Homology." /old_locus_tag="B9W62_07290" /product="SDR family oxidoreductase" /protein_id="WP_087806311.1" /transl_table=11 /translation="MRWDNHRVVVTAAGRDFGRTLAIRLADLGAEVFLSARRLAAAQRV RDEIRDRGHQRVHAYACDLTDPASIRDFASGVADHTDRVDVLVNNGSRYLAGPDLLSAT DADVVDTLASGATGTVLTTKSFLPLLLNSAKPDVVTMVSACGTPGHHRSDAHDAFYAAK SAQAGFTEILSKRLRPQGVRVISLYPPDFVNADPLSEEWETAPRGAEDALTSQSLVECV LFAVAQPRDCFIKAFHFEQL" gene complement(24944..25453) /locus_tag="B9W62_RS07280" /old_locus_tag="B9W62_07295" CDS complement(24944..25453) /GO_function="GO:0016531 - copper chaperone activity [Evidence IEA]" /codon_start=1 /inference="COORDINATES: similar to AA sequence:RefSeq:WP_003978000.1" /locus_tag="B9W62_RS07280" /note="Derived by automated computational analysis using gene prediction method: Protein Homology." /old_locus_tag="B9W62_07295" /product="copper chaperone PCu(A)C" /protein_id="WP_087806313.1" /transl_table=11 /translation="MTSATWTPTRRRLADTAPAALAPVCACVLALGGLAVWTATGNAGT PARIGVTDARLFLPSRGVPETAAFFRITNTGGARDRLMEVTSSDVTDGIALSTHRMTAG GAAYRRPVESLPVPADGTLDMSPHSSDVTVPAAARWETGDLVPFTLHFEHSGPVEVLAV VVRPGS" gene complement(25450..27774) /locus_tag="B9W62_RS07285" /old_locus_tag="B9W62_07300" CDS complement(25450..27774) /codon_start=1 /gene_functions="other (smcogs) SMCOG1037: heavy metal translocating P-type ATPase" /gene_kind="other" /inference="COORDINATES: similar to AA sequence:RefSeq:WP_011027397.1" /locus_tag="B9W62_RS07285" /note="Derived by automated computational analysis using gene prediction method: Protein Homology." /old_locus_tag="B9W62_07300" /product="cation-translocating P-type ATPase" /protein_id="WP_087806315.1" /transl_table=11 /translation="MAGGPASVREVTDLAVGGMTCAACVKRVEKKLAGLDGVSASVNLA TGRARVHHPPEVLPEQLVAAVERAGYTAAPPEPPAERRGGSGNRGGSGHGDGSGDDTDT EAQRERNRLAVTALLAVPVLVLSMVPAWQFRNWQWLCFVLAAPVVVWGARPFHQRAARA LRHSASTMDTLVSLGVVASFAWSTYALFLGGAGDPGMRMPFSLVPAASDGVAHIYLEAA VGVPLFVLAGRHLEARARRGTGAALRSLAELAVKEVAVRDDAGERLVAIDALRVGQVFV VRPGERVATDGTVAEGSSAVDLSLVTGESEPAEVGFGAAVIGGSVNVGGLLAVRATAVG ADTRLARITHLVTEAQAGKARAQRLADQVAGVFVPVVLTLAATVLGFWLGAGADPQAAI TASVAVLVVACPCALGLATPTALMAATGRGARLGVLVSGPRALEGLQHVDAVLLDKTGT LTSGHMSVARITAAPRGIGEEQAMRLAGAVERGSEHPLGQAVTAHARGAAPAGTLPDVT GFAALPGRGVRGEVEGRLVEVLAPDDDLPAPLAEALTAAEAAAHTAVVVRVDGVTEALV EVGDVLRPGSYRAVDRLRRLGVRPVLATGDREAPARAVAAALRIDDVHARCTPEDKARL VRRLQDEGCRVAVVGDGVNDAAALAGADLGIAMGTGTDAAIGAADVTLVRGDIDALADA VRLSRSTLATIRVNLLWAFGYNAVTVPLAAVGLLTPMPAAAAMSVSSLLVVGNSLRLRA WQPSPARTRRSATPARGRSLR" gene complement(27774..28028) /locus_tag="B9W62_RS07290" /old_locus_tag="B9W62_07305" CDS complement(27774..28028) /codon_start=1 /inference="COORDINATES: similar to AA sequence:RefSeq:WP_019521399.1" /locus_tag="B9W62_RS07290" /note="Derived by automated computational analysis using gene prediction method: Protein Homology." /old_locus_tag="B9W62_07305" /product="hypothetical protein" /protein_id="WP_179235381.1" /transl_table=11 /translation="MAQTRRSASGAPDGPVRRLLPPPALCGFLVLLLLVFAASYAVGRG IGPVAPGMHGPGITQDGHEGGGTDSEDGGMGGMNHGGGH" misc_feature complement(28147..28162) /note="TFBS match to CRP, Cyclic AMP receptor protein, confidence: medium, score: 17.13" /tool="antismash" ORIGIN 1 accgtcggtt tccggcggac ggtcgtgtag cgacatcacg tcgggtaccc gctcgttgcc 61 cccgccggct tccccggtac cgacgcaccg tgagggaagg aagcaacctc atgcagcacg 121 acgagatgac aggaaaagtg caggctcttg ctcagttgcc ggaccggggt tcggccgaac 181 gggcgacacg ggccgtactg gagacactgg ccgaacgact gccgtccgca ctcgcgaacc 241 acatggccgc ccagctcccg ccgaccctcg ccgcctccgt acgccagagg acggattctg 301 ccgccgacgg ccacggatcc acctcgggtg agcggttcga cctcaccgtg ttcgccggcc 361 gcatcgcagg ccgcgcggca acggacgagg agacggccat acgagaggcc gcaggcgtcc 421 tggaggtact cgacgccgcc ctcactccgg agttgacaga gaaaatggcc ggtgtgctgc 481 ccgcggacat ccgcgaactt ctgcccgcaa gccgcgccgt ggacgagagt ggctgaagac 541 cggcgccctg tgaccgacgg tgagtccggc tcctcgactt gggcggccac cgctgcgtgc 601 tgcccgccgc gtccaccgcc gtacggcgtg gctcccgccg caggcgtcac ggtgctgcct 661 tcctcgacgt caccgcgggc atgacgctcc atcctgatca gtaggaggat gccgtgtttg 721 aggaaccttt cggggccgtc ctggtggtcg atgccggatc gtcgagcctg cacctgaccg 781 tgttcgacga cgacctcggc gtcctcgccg aacgcgacag ttcctccgca cccggcgacc 841 atgccgtcgg tctgctgcgg caactgctgc atgaggctcc ggcacccgtg gctgtcggcc 901 accgcgtcgt ccacggcggc ccggcgctgc gggagcatct gctggtggac gatcacgtac 961 gcgacgccct ggacggggcg gccgacatgg cgccgctcca cgtcccgccc gcgctcaccg 1021 tgctggacgc cgcccgggac ctgctcccgg acatcccgca cgccgcttgc ctggacaccg 1081 ccttccacgc cggcctgccc gccgccgccc gggagtacgc ggtgcccggg gcgtggcgcg 1141 agcggtacgg gctgcgccgt tacgggttcc atggtctgtc ctactcgtgg gcactcggcc 1201 gggctgctga gctcctgggc cgacgccccg aacgattgca ggtggtgatc gcccacctcg 1261 gtggcggctg ttccgcctgc gcggtgcgcg acggccgcag tgtcgacacc accatggggt 1321 tcactccgct ggaggggctg gtcatggccc accggagcgg cagcatcgat cccggcgcgc 1381 tgacgtggct gcagaccagg caccacctgt ccgccgagga ggtcgacacc gcgctgaacc 1441 aggggggcgg tctactcgcg ctctccggca cgtcggacga cacccgcgac ctcgtacgcg 1501 ctcgtgccgg gggcgacgag cgcgccggct tcgcgttgga cgtcttcacc caccactgtc 1561 gccgtggcat cgccgcgatg gccgcttccc tcgaccggct ggacgccctg gtcttcaccg 1621 gggagatcgg cgaggaccag cccgaggttc gcgaggaggt gtgcgcgcgg ctcaccacgc 1681 tcggtctgac gagcgggctg caggtgcccg aaggtgcgac cgtggaccgc ccgaccgtgg 1741 tcagcgcacc gggagccgcc gtcccggtgg tcgtcgtgcc cacgggtgag gggcggcagg 1801 tcgaccgcga gacgcggacc ctgctgcggc agcaccaggc gtgacaggga gccgttccgg 1861 gggtcgatac cggtagtaca ggccgccggg caggccgtac tcggtgtcgt cggcgagcgt 1921 cttcctcgtc gcccgtcctg aagacggtcg cgaccggcac cgtcctcgac gacgtcgctg 1981 cccctctcgc gggaagaatc gcgttgctcg ccgttgacag caggcccggc gacaggccgg 2041 ttccgagcca caccgtcgtg ccgacactgg tgccgcagcc tgcgatgctg gaactggccg 2101 acgaaggctc tcctgctgct ggaagaggtg gccactatga acacgtccga gcgcacgtcg 2161 agaatgctca ccgagctgct ccgcgacagc cacctggccg ccttcgacga gcttccttcc 2221 ctggtcgccc ggtacaccga acaggcaggc atgcacgatg tacggatcta cctggccgat 2281 ctacgccagg agatcctgcg ggaagtcacc gggaaggggc tcagcgcagc gggtggcggc 2341 gaggtctacg ccgtcgaagg cacactgccg gggcgggcct acaccagcat ccggccccaa 2401 ctgttcgcga acggcgagca ggatcgctgg tggataccga tactggacgg taccgagcgg 2461 ctgggcctgc tgtgcggaag tctgaccgat gacgccgatc tcgaacagct tcaggcggtc 2521 gcctccctgg tgggcctgct ggtggtcagc aaacgaccca gcagtgacgc ggccgcacgg 2581 ctcgcacgca cgcaggcgat gaacgtggcg gcggaactgc aatggaatct gatgcctccg 2641 cgttcgttcg ccaacaagga cgtggtgatt tccgcggcca tggagccggc ctatgagatc 2701 ggtggagatg ccttcgacta tgcgatcgcc gacggccgag tgcatctcgg catcttcgac 2761 gccatgggtc acaacagtca cgccgggctg gccgccaacc tcgtggtggc cgcctgccgc 2821 aaccagcgcc ggcaaggaac ggatctggtc gccctcggag aacgggtgga ggagatcctc 2881 ctcgagcact tcgcacacga aacgttcgtc accgccacgc tggccgagct caacacctcg 2941 accgggctgc tgacctggat caatcgaggc catcacccgc cggtcctcat ccgggccgga 3001 cgctggacga ccgttctgca ctgcccaccc gcgcatccgc tcggcgccgg gttggaaacc 3061 accgcgacgc tgtgccggga acagctcgag cccggtgacc ggatcctgct gtacaccgac 3121 ggcatcaccg aggcccgcga caaacagggc caagagttcg gcctcacccg gttcaccgac 3181 ttcatcatcc gccaccacgc cgacggcttc cctgtcccgg agactctgcg ccgcctcatg 3241 cgcgctgtcc tgcaccacca cgacgggcga ctcaacgacg acgccacggt cgtgtgcgtg 3301 gagtggcacg gtcccagccg caacactcag ctcagcgcgg cgcctccgcc ggttccgccg 3361 cccggacggt cctgacgcgg atgacgcacc cctcagcggc ccggatgccg gaggggcgag 3421 ggaccaccgg accgctggca gacagtttgc gaccgcgccg gctgggaact cgtcgagcgc 3481 cacaacggat gccaggcact gacgacgccg cacccctgct caccagcacg tggcgtgtgc 3541 gccacgcacg aacgcggcgc tgcccggccc accacgagag gaccggccgg atgcccgaca 3601 ggagcacagc gggacgccac gacggcgtgc ggacggtgcc catgctcctc ttcgacggcg 3661 actgcggctt ctgcaccacg gcagtgcgat ggatcgaacg gaacgtgagt ccccattgtg 3721 aaagcgtcgc ctggcagcga gccgacctgc gaggcatcgg cgtgacgcga cagcgtgctc 3781 agcgcgaggc actgtgggtg acaccggctg ggacggtcta cggcggcgca gacgctgtgt 3841 cgaagcttct cctgagcgcg ggcggcggat ggagcctgct cggggccctg ctcatggtga 3901 agccggtacg ccgggtcgcg cacggcgtct accggctcgt ggcagacaac cgctcgcggc 3961 taccggggac gaccgacgcg tgctcccgtt ccaaagcgcg tacacgtcac acgtgaacgc 4021 ggcaccggcg cctcggccgg ccggaccgtc acggacccgg acgggcgcac cgtccgcacg 4081 gcggcgtgtt ccgcaagcgt tcgccacagc agcccgcgcg ggcgcagcag tgcctcgtga 4141 acagaaggcc gggccgggga ggttcacttt tcgtacgtcg acaaggccag accgatcgcg 4201 aagaaggtcg gagcccattc cccgacgaag atgccccaac ggtcggcgcg ttcttctccc 4261 ttcttctcca cgcgcaggga agtgcaccag gtgaccactg acaacccgat ggaaacgaag 4321 ccggccgcgt aagcgtgctc ggctcggatt ccggactcgt ggagcttctt gaccagcatg 4381 accgactcac cttcagacgt gggggtttcc cctcattcca tccgcactgc ggccgcccgg 4441 cgagccacgc gccgccaaac ggtcaaagga cgtgtcaagg ggcgacaccc agtgaggccg 4501 atgcaccacc gactccatcc gtgcgccggt cagaagatgc accaaccgcc gactgcggcc 4561 atcgtggcgt cacacgtccc ctttacgatc aacgctcggg caggacacat ggacaagaag 4621 caactcgccg aggaaaccat caacaccgcg gcgaacggag gcaaggacgc accactcacg 4681 tccaccgacg tggagcgggt gttcgacctc ctcttcggca ctgtggagca ccccggctcg 4741 atcgccgagg cgctgaacag gcgcgagaac gtctctctgg gcagtttcgg cagcttccgc 4801 atggacggtg ggacagccac gttccgcccg ggcacagcgc ttaccgagtt cctgcagaac 4861 aagacaggat gaacgaaccg ccgccgctcg gctgccgaac cgaggctacc tcgagggttt 4921 cgtgccgtgc tcatcgtcac ccgccgggag cgggcgtcgc acagaggagg tgcatgccgc 4981 ccaggccagc cccgcggcca ggagatcctc ggcggcggcc gcgactgcga cagaccagcc 5041 cggtgagctt gctgctccgc gccaacgcgc gccggcgaac gcgccggcca cggccgtcga 5101 gacacccgtg gccacggccg tgaacggcgg gagtgcccca cgacgctgtg cgagtgctgc 5161 gccggcgagc gcgccaagca caacccgagg ccccatgccc tgagcgctca gccgactggg 5221 ggtggacggc gctttgtcgg cgacgaactc cgctgccgtg gtgaagcagg tgagagcacg 5281 ggcccacggg cctgtcagga gcgccgaggg cagtggccgt cccgcggccg gcggggtact 5341 ccaggacagc gccgcgactc cccactgact gcgcagacca gtagccgcgc cgatcagccc 5401 cactctgatc agggtctccg ccgccgggtt catgagcggt gctccaccgc ggtgacgagg 5461 ccgctcagcc acacgggcat ccgggcgccc gtgcgccgcg gcatggagac ggtggctccg 5521 gccccgcagc cgacagcgat ggccgacagg gcgcccgggc cggtgagccg gttgtccctg 5581 accgtccccg gatgcaccgc ctcctccgcg gcgagccgcg ccacgacctg ggcaggcacc 5641 tcgcgggcgg accgtccccg tcggaccgtg tcggcgtagg tcacagcgtt gcgtacggtc 5701 gtagaggcga cacccgccgc gccaccgtgc accatgctct ggatcatgct gcgccgggtt 5761 ccccgaccgg ggcaccccaa tcgtgcacgg cccgctcacg gttcgcccca gcgcgagtga 5821 accgactccc acccgatcaa ggccgtgacc cgggacgtgc agaggacgac cccaagcccg 5881 gtctgcgcgg cgtcaccgtc gtcgtccggc cgtacgggaa cccgctgccc cgcggagcac 5941 cgccgcgtct tgcgctccgc cgggcgcagt cgcggaggcc ggttgtagtc gagcccggct 6001 ccgtcctccg accacttcag ggcggcgtcg ggaggcagct gtgagccacc gcgcccgacc 6061 acccacggaa atggccgcac ggtgccacac aaccgtccat ggggcgtccg tcgcacccac 6121 gtctcgacgt gatcctcggt gaggtacggc agccggacgg atccgagtct ttcgcggagg 6181 cgggccaacg tcatcgtgta gccgcgggca gtgctctcta cgtcacggtc gggtacctcc 6241 acgcgggccg cctcgtcgcg acgggcgtgc cgatgctgtg ggcgtcatga tctcggtagg 6301 acagtgtgcg gtggagcgag ccggaagggg caggacagtg gcggtgaccg gtcggcggta 6361 cgcggagggc ctgcactgtc cggtgaccgg cgcgtccggg tacgtcggtg cgcacctggt 6421 gccggagctg ctggagtccg gtcaccgggt gcggtgtctg gcccgctcat ccgccaacct 6481 ccgcgaccag ccctgggcgg cggacgcgga gagcgttcag ggagacgtgc ccgacccgcg 6541 ttcggtcgcc gacgccatgc gtggtgccga cgtcgcgtac tgcctggtgc acgccctcgg 6601 ttcgggtgcg cacccacccc atcggagtcc gcgacgtgtt gcgctacctg gtcgggtccg 6661 ccaccatgcc ggacgacgtg gaccgggcct tcgacgtcgg cggagcggac atgcccacct 6721 accgcgaatt gatgcgccgc ttcgccgacg tcgcagggct ccgacggcgg ggcatcgtgc 6781 cggtaccggt cctcgccccc gcggtcgtcc ggccactggg tcggcctggt gacaacggtc 6841 ccgggttgtc cgcctccgtc cacggaactc ccggcgaact cctcccgacg catccgacgt 6901 cgccgtcccc gccgaagcac ggagcactga gcacggagca ccaggcaccg tccccgtacc 6961 actgcgtcag gagtgaggcc atgaccgtcg cggtcgtcct gttcacctcg gacctgcgtc 7021 tgcacgacca tccaccgatg cacgccgcgc tgaaggcggc ggacgaggtc gtgccgttgt 7081 tcgtactcga tcccggcatc cgcacggccc acttcgacgc acccaaccgg cgggccttcc 7141 tcgcggactg cctgaacgac ctcgacgcct cattgcgccg ccgcggtggc cgcctcgtgg 7201 tccgcgcggg gccggtcgcc cgggaggtcc gcgcggtcgc cgccgagtgc ggggcgggcg 7261 aggtgcacat ggccgccggc gtcacggcgt acgcccggca ccgtgaggag cggctgcgca 7321 aggagctcga cgacagcgac gtggcgctgc gcgtgcacga ggcggtgctc accgccctcg 7381 cgccgggagc ggtgaccccc tccggctccg accacttcgc cgtgttcacc ccttacttcc 7441 gccgctggtc ccaggagtcg ctgcgccagc ctctccgggc gccgcgcacc gtccgcgtcc 7501 ccgacggggt gcgcggcgaa ccgctgcccg agcgtgccga cgtctccggc acctcgcccg 7561 gccttccggc cggaggagaa acggctgggc gtgaccggtt ctcgcggtgg tcgcggtccg 7621 gactgtcccg ttacgcggac cggcacgacg acctcgccgg cgatgccacg tccaggctgt 7681 cgccgtatct ccacttcggc gtgctgtccc cggcggagct cgtgcacagg tcccgcgagc 7741 gcggggggcc gggcgcggag gcgttcgtgc gccaggtgtg ctggcgggac ttccatcacc 7801 aggtgatggc ggcacggcct tcggcgtcgg ggaaggacta ccgcacccgc cacgaccggt 7861 ggcggggcga ggacgaggcg gccgaggaca tcgcggcgtg gcgggagggg cgcacgggat 7921 atccggtcgt cgacgcggcg atgcgccagc tgcgccacga gggctggatg cacaaccgcg 7981 cgcggctgct ggcggcgagc ttcctgacca agacgctgta cgtggactgg cggatcggcg 8041 cccggcactt cctggacctc ctcgtggacg gcgacgtcgt caacaaccag ctcaactggc 8101 agtgggtcgc gggcaccggc accgacacgc gcccccaccg cgtcctgaac cccctcgtcc 8161 aggccaggcg gttcgacccg gacggtacct acgtacgccg ctgggtcccc gaactcaccg 8221 gcgtggacgg caaaagggtg cacgagccct ggcgcctccc ggccgggcaa cgcgacgcgc 8281 tcgactatcc ggaaccggtc atcgacctcg ccgacgggct cgcgcgcttc aagcacgccc 8341 gcggccgcga ctgaggaggc acccgtcgtg cactggcttt tcggcgacca actcggcccc 8401 cacttcctca ccccgggcga cgaaggcccc gggcacgaca ctcccttgct catgatcgag 8461 gcacgctccg tcttccgacg acgccgcttc caccgcgcaa aggcacatct cgtgctgtcc 8521 gccatgcgcc accgcgcggc cgaactgggc gaccgcgtca cctacgtgcg ggccgacacc 8581 taccgcgacg gcctcgaccg ggcggcacgc gggcgccggg tcggcctcca ccaccccact 8641 tcacacgccg cgctgcgcct ggttcgcgcc cttccccggg tggcggtggg ccccgctcgg 8701 ggcttcctgg tgccgatggc cgacttcacc gcatgggcgg acgaccacgg cggcaagcgc 8761 ctgcggcagg aggacttcta tcactgggtc cggcgtggcc acgacctgct catggacggc 8821 gggcaacccg cgggaggcca gtggaacctg gaccacgaca accgggaacc cccgccccgc 8881 gacaccactt ccctgcaggt cggccgtccc taccggcccc gcgaggacga catcgacgac 8941 gaggtccgcc acgacctcga ccgctgggag cgcgacgggg acgtctcctt cgtcgggcgg 9001 gacggacccc gcctgttccc cgcgacccgg gcggaggcgc ggcgggcgct gcggcgcttc 9061 gtcgagcacc ggctcgccac cttcggcccg tacgaggacg ccatgctcgc cgccgacccc 9121 gtcatgagcc acagcctgct ctcctcctcc ctgaacctcg gcctgctcga ccccgccgaa 9181 tgcgtcgaga cggccgagcg ggcgtggcgc gagggccggg cgccgctgaa cagcgtggag 9241 ggcttcgtgc gtcaggtcgc cggctggcgc gagtacgtct ggcagctgta ctggtacttc 9301 ggcgaggact accggcgctc caacacgctc cggcacacca ccccgctgcc cgactggtgg 9361 aacgacctgg acgcggacgc cgtccgggcg aactgcctgc acaccgtcct ctcccaggtg 9421 cgcgacaccg gatggacgca ccacatcccc cggctgatga tcctgggcag tcacgcactc 9481 cagcgcggct gggaccccgc cgccgtcacg gactggttcc accgctgctt cgtggacggc 9541 tacgactggg tcatgctccc caacgtcgtc ggcatgtccc agtacgccga cggcggccgg 9601 atgacgacca agccctacac ctcgggcggc gcctacatca agcgcatgag cgacctgtgc 9661 ggtccctgcg cctaccggcc cggcgaccgc accggcgagc gcgcgtgccc gtacaccgcc 9721 ggctactggg ccttcctcga ccggcaccgg gaccggctcg ccggcaacca gcgcatcgcc 9781 cagccggtac ggcaactcga ccggctcagc gacctcacgg aggtacgcga gcaggaacgc 9841 gcacgaggtg acacaccgcc atgagtaccg tgccgttggg ccactgaggt gacgccggtc 9901 ggcaagcggt gtggaaccgg acaggaatca ctgcatccgt gacagggcac ccggcggaaa 9961 cagtgtcgtg cggataacca aacgtgagaa ggggttgtgc atgcccggac tgccgccgac 10021 ggtggcccca ccggccggac gcgtcgacgc ccctctcacc gtgcccgacg ccgcgaacgc 10081 cgtcggcacc gtcctcgagc gggtactcga cgaacgactc cggcattccc ttgccatcga 10141 tcccgtcttc gcccgggagc tggccgaccg cctgatcgcc ctggtcgggc ggggcggcaa 10201 acggctgcgt acggcgttca cccactgcgg ctggcgcgcc gcgggcggct cgggcgacac 10261 cggcgccgtc ctgcgtacgg gagccgccct cgaactgctc caggcgtgcg ccctcgtaca 10321 cgacgacgtg atggacgggt ccgtgcagcg caggggtgcc cccgccctgc acgtcgaact 10381 cgcccgggca cactgggcct cgggcatgca cggctccgcg gaggcgttcg gcacgtcggc 10441 cgccgtgctc gccggtgacc tggcgctggc ctgggcggac gacctgctga cggagaccgc 10501 gctcggcacg ccgcacggct ctctcctgct cggggaatgg cgggcgatgc gcaccgagat 10561 ggtcgccgga cagtacctgg acctgcgcgc ccaggccgcg cgttcgtccg gagtggacga 10621 ggcgctggcc atcgccacgt tgaagagtgc cttgtacacg gtcgcccggc cgcttgcgct 10681 gggcgcgtcg ctggccggtg ccgacgcgca ggtggtggac gcgctgcggg cggcgggccg 10741 gtgcgccgga ctggccttcc agctccgcga cgacctcctg ggtgccttcg gcgacccggc 10801 actgaccggc aaaccgaccg acgacgacct gcgctcccgc aagctcacct acctcctcgc 10861 cgtagccctg caactcgccg acgccgccga cgatcaccag gccgccgcgc gactggcccc 10921 ggatgccgtc ccgcggtccg agcacgccgt acaacgcgtg cgggcggcct tgcaacgcac 10981 cggtgccagg gatctcgtgg aggcgaggat cgaggagctg acggacatga gcctcggtca 11041 cttcgtgcgc accggcgcgc ccccggccgt acaggacgag ttctcgacac tggtccaaca 11101 cgccaccggc gtgcgcccgc gtcgtgccga ggaggtcgcg tgaagagggt gccgggtccg 11161 accgaccacg tcgtcgtggt gggcgccggg ctgtccggcc tggcctgtgc cctgcatctg 11221 ctgggtgcgg gccgccgggt caccctggtc gaacgggacg ccggccccgg cggccgggcc 11281 ggccgggtac gacggggcgg ctacgagctg gacaccggcc ccacggtgct gaccatgccg 11341 cacctggcgg acgaggcgtt cgccgccgtc ggggacagcc tccaccgccg cgtcgagctg 11401 acggctctgc accccgccta ccgggcgtgc ttcgcggacg gttcctcgct cgacgtgcac 11461 accgacggcg aggcgatgga ggcggaggtg cgccgcttcg ccggaccggc gcaggcggcc 11521 ggctaccgcg atctgcggca gtggctggag cgtctgtacc gggcgcagat gcgccgtttc 11581 atcgacacga acttcgactc gcccgcccag ctgctgcacc cggatctggc ccggctggcg 11641 gcgctgggcg gtttcgggcg gctggacggc cgcatcggcc gcttcctgtc cgacgaccgc 11701 ctgcgccgcg tcttctcctt ccaggcgctg tacgccgggg tcgccccggc ccgcgcgctc 11761 gcggcctacg cggtgatcgc gtacatggac accgtggccg gcgtctggtt ccccaagggc 11821 ggcatgcacg cactcccccg cgccatggcc gacgcggcgg cggacgccgg tgccgacctg 11881 cgctggcggg ccgaggtgag cacgctggaa cgctcgtcgg gccgcgtgcg ggccgtccac 11941 ctgacctccg gcgagcgcat cccgtgcgac gcggtggtgc tcacgtgcga gttgcccgcc 12001 gcctaccgac tgctggaacg gacgccccgg aggccggccc ggctgcgcca ctcaccgtcc 12061 gccgtgatcc tgcacgcggg caccgaccgc acctggccgc acctcgccca ccacaccatc 12121 tccttcggcg ccgcctggga gcgcaccttc gacgagctga cccgcacggg cgagctgatg 12181 agcgacccgt cgctgctgat cacccgcccc acgtcgcatg atcccgacct ggcgccgccg 12241 gggcggcatc tccactacgt cctcgccccc tgcccgaaca cgacggtcgg cccctcggcc 12301 gccacctggc gggatctcgg cccccgctac cgcgaccgcc tggccggcga actggaacgc 12361 cggggcctgg acggcttcac ggacagcatc gaggaggaac tcctggtgac cccgctcgac 12421 tggacggcgc agggccatgc ggccggcagc cccttctcgg tggcccacac cttcgcccag 12481 accgggccgt tccgcccgcg caacctcgta cgggaggtgg agaacgtggt actggcggga 12541 tgcggtacga ctccgggcgt cggcgtgccg acggtgctca tcagcggcaa gctcgctgcc 12601 gcccgcgtaa ccggcggcag cgtcgctcgt cccgtgggag cgcgtacgcc cccggccgcg 12661 gccgacgaac cctccgtccc ggcctcggca ggtgccgatg acgcggcgtg agctggacgc 12721 ggccgggatc accgatcccg cgctgcgcac cgcctacacc cggtgccgac gactcaacgc 12781 ccgccacggc aagacctact tcctggccac ccgcctgctg ccgctcgaac gccgttccgc 12841 cgtgcacgcc ctgtacggct tcgcccgctg ggcggacgac atcgtcgacg acctcgaccg 12901 caccctggca cccgaggagc gcgaccggct gctccgccgt ctggagagcg acctcatgag 12961 cgggctgcgc tccggcggcg gcgacgaacc ggtggtccgg gccgtggccg acaccgccac 13021 ccggtacgcc atcgagcccg tcctcttcgc cgatttcatg tcctcgatgc gtgccgacct 13081 gaccgtcacg gactatccga cctacgccga cctgcaaggg tacgtgcacg gttcggccgc 13141 ggtgatcggt ctgcagatgc tgccggtcct cggtacggtc acggtccggg aggaggcggc 13201 accgcacgcg gcagcgctcg gtgtggcgtt ccagctgacc aacttcctgc gggacgtggg 13261 tgaggacctc gaccggggcc gcgtctacct gcccggcgat ctgctggccg cgcacggtgt 13321 ggacaggccg ctgctggagt ggagccgcca caccggccgc accgatccgc ggatccgcgc 13381 ggccctcgtc gccgcggagg ccatgacgcg ggaggtgtac cggacggcgg agccgggcat 13441 cgccatgctc gatccacggg tgcgtccctg catccgcgcc gctttcaccc tctacggcgg 13501 cattctcgac gcgatcgcgg agcaggagta caccgtgctg catcgtcgtg cggtggtgtc 13561 ccgtcggcgt cgcgcggcca cggccgcgac gggtgtgctg cgcgtggccg gcgcccggtg 13621 gcgggctcac gccccgagtc ggaaggcgac ggcgggtggt ccggcagtcg aacggaagga 13681 accggtgcgg tgagtgagcg aacgggcagg cagcgctgga ccccaccgtt gcggctccgg 13741 cgtcccggcc cggactgggc ggcgcagacc ccgacctggc gtcaggcccg accggcactc 13801 atcgccgacg cgctcaagcg ggcgtccgcc cgtccgtccg gaaactggtt cgtgctcggc 13861 gcctcccgga acgtgcgcgc ggacggccgc ccgtacggca ggacggtcgg cggcgtggag 13921 atcgtgctgt ggcgctcgga caccggcgac ctgcgcgcgg ggccgggtgt ctgcccgcac 13981 ctgggcgctc ccctgcgcga cagccgggtc gcctgcggca cgctcgtctg ccactggcac 14041 ggcctcgccc tggacggttc tccgtccccc ggctgggacc cgttcccggt tcacgacgac 14101 ggggtgctgg tctgggtacg gctggaccag gtgggcggcg aggagccgac cgaccggcct 14161 gcggtgcccg tgcggcccgc gacgggcagc ggggtggacg cggtgttcac ggcggtgggg 14221 cggtgcgaac cgcaggacgt ggtggccaac cggctcgacc cgtggcacgg ctcgtggttc 14281 cacccgtact cgttcgtcga cctgacggtg tcacgggagc cgcagggcga ggaggacgac 14341 gccttcgtcg tcgacgtctc cttccgggtg gccgggcgcc tggtggtccc ggtacgggcc 14401 gagttcaccg cgccggagcc gcgcacggtc gtcatgcgca tcaccgacgg agagggggcc 14461 acgtccgtgg tggagacgca cgcgaccccg ctcacaggag cgggccacgc acatccgcgc 14521 accgccgtgg tcgaggcgac gatcgccgct tcggatcggc ccggcttcgc cctcgcgcgg 14581 gccgccgccc ccgtgctgcg cccgttgatg ctccatacgg ccgggcgcct gtggcgcgac 14641 gacctggcct acgccgaacg ccggtgggca ctgcgcagca ccgggcggtt ccccggctga 14701 ccgtacgccc gacgggcacc ggtcaggggg aggagcggcg tccggcgacg gctccgagcg 14761 cccggaggac ggaggaacgt cccgcgcggg ggacggtcca cagcacctgg ccgcgtacgc 14821 cccggtcggc gagcagcgcg ttcgccgcca ggaaaccggt ggtggcggcc cgttccatga 14881 gggcgacggg caggtcgcag cggatggcgt cgccggccag ggtcagccag gggtgggggg 14941 tgcgtacggt ggggcgccgg tggtacgaac cgaccgggaa gagcgggcag tcggagcgcc 15001 actcgtgccg ggcgtcgacg acgcgcgccc ggcgggtctc cgggtacacc tcgtgcagcc 15061 ggtcgacgag catgtcctgc gtctccttcc gttcggcacc cgtgtccacg gcgtacgcgt 15121 gcagctccac caccgaaccg ccggtccgct cggcccagcg ggcggcctcg ccctcgtagc 15181 gttccaggac gctgatgttg tccagcccgc cgtagccgct ggtgccgagg aaccccgggc 15241 ggtcggcgcg gaccggccgg tcgagccaga gccgggagac gaggaacggc ggtgcggtgc 15301 gcagggcggc gaggtcctcg cgccagccgg aggtgcccag gccgggtgac gccccgacga 15361 cccggcgcag tcctccgggg tcgagggcga gcaccaccgc ctggtggcgt cccgtgccgg 15421 tgtcggtcag cacgtccgct cccgcgtcgc cggcgggcag cacgccgtgc acgggcgtac 15481 cggtgcggat gtccgccccg aggcgctgga ggtagccggc gagcgggtcc cagagcgcct 15541 gggggaaggg ttcgctcggc acgtcgaaga gcaggccctc ggccgaaccg aggaagtaga 15601 tgtggaacat cagcagcagt tcggcggcgg acagttcgcg cgggtcggcg aagaagctgc 15661 gcgagaacac ctcgaaggcc agatggtgcg cggcttccgg gaagcggacg ccctccagga 15721 agccggtggc gctgaactcg tcgaagcgtt cgtagacctc gggcacccgc acgtcgagga 15781 gcggtagtgc ggctcgggcg tccatggcgg cgaggtcccg ccagccgaag gtggggctga 15841 gggcgacgaa tccgagcgcg ctgaggggcg gggtcctcgg gacacgggcg aaactgtcgg 15901 tcaggccgcc gctgtgccgc agggggtagt cgggcagcgg ggtgagccgc gcgagggcgg 15961 ggtcggtgcg gcggagcagg ccgcggaggt tgtagtactg gcggaagaag gcgtggaaac 16021 cgcgggtcat ggtgaccggg gagccgtccg ccagccgggt gggccatccg gagagccgtc 16081 cgccgagcga tgcctccttc tcgtacaggg tcacgcgggc gccgcgttcg gcgagcaggg 16141 tggccgcggc gagaccggcg attccgccgc cgatcaccgc gacggtcggc ccttccccgg 16201 gctggaaccg gctcctgccg ggtgcgggca gcagaacctc tgccttgcgg tcgcggcccc 16261 ttcgggcggg atcggtgcgg cgggtggtca ctgagcgtgc tcccttcccg gggacgagga 16321 ggccggcgtc gatgttgccg gggcggtatt cgccggggtc gggccgcctc gggcgacgaa 16381 ggtgtgcacg atgcccgtct gccatccggc gaccggggcg acgcgcaccc cggtgaagcc 16441 cgctctcgtc agccggtcgg cgaaagaggg cgcggtgtcg aagacggcga cactgtgccg 16501 caggtgtcgg tagagggcgc ggtcgccggt cagcgtgccc gccgggacga cgacgccccg 16561 gcacaccgcc gaccacaggg cccggtgcct gagcgcgccg ctgagactgt actcgtggac 16621 ggcgagccgc ccgcccggtc gcagcagggt ccggacgctg ccgaggaccg cgtccgggtc 16681 ggtgacgttg cggaacaggt aggcggcgaa gaccgcgtcg aaggggccct cgccggccgt 16741 cgcgacctcc tcggcggtca ggtgcaggaa gcgcacccgt acgggccacg gcttcgccag 16801 tgcgcggcgg agcatgcccg cggaggcgtc cacagcggtg atccgggccc ggggcgcggc 16861 cctgagcagg gcccgggtgg aggcgccggt gccgcatccg aggtcgagga ggtgcagtcc 16921 cgctccatcg tcgggcaggc gcaggcggcg ggccgagcgc agcaggccgg tgcggtagcc 16981 ggggttgagg gaggtgagac ggtcgtagct gtgggaggcg tggtcgaagg cgcgggccag 17041 gtcgtggtcg cgcaggaggc tcatcggtgg tcgtctccgt ttctcgcggt gcgccgggag 17101 cggcggggca ggaagggtag ttcggccgcc gtgcggagca tcggccggac cggggtgcgc 17161 agtccgatgc cccactcctc gcggagtgag gtggcgccgt cgaggaagcg cagcaggcgc 17221 tcggcgggga cgcggcggaa caacccggtg aagaagtcgg gcccgtcgat ccgtccggtg 17281 tccagggcgc gcagcagtat cgcgtccatg gccagcgccc gccgtccgtg gggagcgggc 17341 aggacggtgc cgtgcccgtc gcgcagcgcg gcggcgacgg ccctgctctg ccgttgcatc 17401 gccgcgaagg tgtagcccgt ggcgggacgg gtggcgccgc ccgctgcacc gatacggaac 17461 acggcagccc cgacgcgccg cggaaagcgg gcgtcggtca tggggatgac cccttgctcg 17521 gcccgctcca cggtgagcgc gccgagcccg aggacgtcgc ggcagtagtg accgagcgcc 17581 gactcgtacg cctcggtgga cagcacgtcg cgggagaact cggtgtactc cacgagtgcc 17641 cggtccggcg tgagcggcag aacgtaaccg aaggcgagcc cgtgcgccgg ctggggcacg 17701 cggaagtcca tcagatcggc gacggcggga tcgaaccgcg cgctgcccgt gcggacgaac 17761 cagccgcgga agtgctggag cagctgcgtc cgggcgggcg gcagggcgcg caacggccgc 17821 gagtcgaaca cgcgccgggc acgcaacgtc agggaccagc cccccggcag ggtgcagcgg 17881 acctccgcgc ccccgggcac acccttcacc gatcccaccg tcgcacgcag gagacgcccc 17941 ccgtccgaac gagccaggcg accgtggatc agccgttcga agtcggtgga gcgcaccatg 18001 cggtagctga acggggccgg gtcgacggtg accggccggc cgtcggcgcc gtgcaggcgc 18061 agccgggacc acgaggcgcc gacgacctcg tcgaggccgt cggcaccctc gccccagtag 18121 caccaggtcc gctccgcggg acgcagcggg ccgtccggcg gctcgatcac ggtcacggtg 18181 gcggccccgg tctccgtcag ccggtgcgcg aggccgagac cggccgcgcc gccaccgacg 18241 acgatgacgt cggagacctc ggtggagcgg ggtgtcaccc tgcccgccgt tcggcgaggg 18301 gcgttccggc cgggcccggg cgcagggcaa ccctcgcgcc tcgggcggct gcttcggcgg 18361 tgatcgcacc acccggcacg ccggtggcac cggccaccgg tactcgggcc tgactcagct 18421 ccacgctcac tcccccgcac acggacacgg acaacaaata gaatcgatcg tcactgcttc 18481 cgcagcgggg cagtgtgcgg atgcgcacct cacggctcgc tcgcgcctca tctctccggc 18541 ccctggtcgg cggacgcctc tcgcgcggcg cgcgctccgg aggccagggc gccctgcacc 18601 gaaccggtcg cccggtggtc cccgcacacg tacctgcccg ggcctagccg agtgctgcgg 18661 ctgagcggcc agggcgggag catcgcgggc aacgcgccct cgatcgtgcg ggccgcgacc 18721 tgatgccagt cgctcgtgtc ggtgccgtac agctcggcca gacgccggag caccgtccgc 18781 cccctgtcgg gcgggtccgt gcccagtacg gaggtggaga ccagtgcggt gccgggcgga 18841 gcgtaggtgg gcgcgacctc gctgaggacg caggtgttga gaaccgctcc ggtgctgtcc 18901 accatcaggg tcggctcggc catcggtgcc ctgtcggtgg cgtggtagta ggtggtcacg 18961 gtgcgggtgt ccgggacggt caggcccggc agcaggcggg ccgcggtcgc cgggtccgtc 19021 gccaccacga cgacccgggc cggcacctcg gtcccgtccc cgagcaggac cccggcgtcc 19081 gtgatctcgg cgacgggcgt accgaggcgc aggacgccgt cgggcagacc gtcggcgagc 19141 cgggcgggca cggcgccgac gccctccgcc ggcaggcaca gcgatccgcg gaccatgctg 19201 cgccagacca ggtggaagaa ccgggcggag gtctccagcc ggtcctccag gaagacgccg 19261 gacaggaacg gccgcaggac gtcggagatc acggcgtccg acagccccgc tctggacagg 19321 gccgccgagg tggggcgatc cgggcgccgt ctggtcgtgg acacgggcag aacggcgtcc 19381 cgtgcggtga gtgccgccag cgcggccaga tcacgggccg acaggatccg gcccggcagg 19441 agtgcccctg ccgcaccggg ttccctggtc ggatcggtga ggcggaccag tccggtcgac 19501 gtatgcgcga tcacaccggc ggtgaacggc cgcaggcgca ggctcctcag gtcgaggcgc 19561 cgcttcacct gcggatacga ggtgttgaac acctggaaac cccggtcgag gaggaacccg 19621 tcccgccggt ccgtgcgcat ccggccgccc acaccgtcgg acgcctccag cagggccacc 19681 cgccatccgg cccggcacag atccagggcg cacgccaggc cggccagacc tgcaccgacg 19741 acgactgcgt ccggtgcccg gcgatgttcg tcgctcatac gtctcctcgg tcgacgccgt 19801 acacgtggcc cccggagccg aaatagtccg cccggcctcc gtctcggcca gtacccgcaa 19861 ccgcccggca cagaccggtg ccggaacgcg gcgccggcac ccgtcggccg ggcgtggcca 19921 cctgtccagc ggaccacgtc ccccggccga caccccggcg ccacgaccga gagtcccggg 19981 acgacaacac ctcacgagaa ctgcgccgac accaacggct ccagaacggc gaccgcttcg 20041 gcgagccccg acggacgcgc cagaccggcg accgtccgtc ccgcccagcc ggggcccagg 20101 gtcagcacga ccggcctgcg acgggcgccg cgcactcccc actccatcgc ggccacgtgc 20161 tgggccagcg gcctgctggc ggtggtgcgc gactgtgccc acagtcccac ggcggccggc 20221 ccggtcctgc gcaccgccgc gacgagcgac tcgaccggca gggcaccgcc gaacatccgc 20281 accggaacac ctcgctcggt cagtgccgcc gacaggacct ccaacggcag ggtgtggttc 20341 tcccccggca cacaggcgag caccgtggtg gcgccgggac ggtccgcgac cgtgcggggc 20401 gcggcgcgcc gaagcgcccc ggagacgtgc caggacagga agtgctctac ctcgacgtac 20461 ttctcgcccg acgtctccca cttgcggccg acggcctgga gcgtcggcac gatcacctcg 20521 gtccaggcgg cgaccagtcc gtgttcggtg atcgcggcga gcagcagttc gtccaggacg 20581 gcggcgtcca ggcgcagggc ggccctggcg attcccctgc actcctgacg tgcgtcgccc 20641 agtcgcaggc cgctacccgc ccggctccgg cgcgcgggcg acgacgccgg tggtggcccg 20701 gcggcgcttc gagacggagc cgccgacggc tcctcgctgc gcgccagccg tgccgcctcg 20761 gccggtggca gcccggtcgc cgtcagcgcg cacatccgtt ccagcctcgc cacgtccgca 20821 gccgtccagc gccggtgccg gccgtccgtg tgggcgtccg gcccgagccc gtaccggcgg 20881 tcccacgtac ggaccgtggt gggcgccacc cccagtcgcc gcgccacctc tccggtggtc 20941 agcccgcctt cctgccggtc gccgtcgccg tcgccgaacc ggtcctcgcc gctttccctc 21001 tgtgtgctca cgcctccacc gtacgacgca caaacgacgc atgttgcctg cgtcgtttca 21061 gcctcgaaaa ctggagcaga gcgcacggca gccgcgcagg gcagaccggc ggacccgccg 21121 ttcggcggac ggcccgttcc gtccccgtac gacgcacggc aggaggaacg gccatgaaca 21181 cggacacacg acggaccgcg acgctcgtcc cgcccgtgga ggaaacgcgg cacgaggagg 21241 aactggctcg tggcctcgcc tgcgccgacg aggaagcgtt cgcggtcatc taccgccgct 21301 ggggagcact cgtgcacacg atggccaccc ggtccctcgg tgacacgtac gaggccgagg 21361 acgtgaccca gcaggtcttt gtcggagcct ggcgcggtcg gcacggcttc cgtcccgagc 21421 ggggcgcgct cggcgcgtgg ctggtcggga tcacgcgccg caagatcgtc gacgcgctgg 21481 cggccaggac acggcgtctg gccctggtcg agtcggctgc tcaggacgcg acaccgaccc 21541 ggttcgtcca gcaggcaccg gacgaggttc tcgaccgggt gctgctcgtg gaggccctgt 21601 cccggttgcc gcacgctcag cgggaggtgc tgtgtctggc cttctacgag gatctgacgc 21661 aggcccagat cgctgagcgc accggtgtcc ccctgggcac ggtgaagagc cacgcccgtc 21721 gcggcctgca tcggttgcgc gcggccgtcg accgggccga cgtgcacggc acgggcatct 21781 gaccggggcg gagcagtgac gccccgggcg catccacggc accgcccacc ggcgaaaccc 21841 atgtgagacc cacagcaggc aaagcctccc tttgaaggga tgatccccat gacctcccgg 21901 accaccgtcg ccgtcgccgc ctccaccggg gcctgcgccc tggcactcgg ggtgaccgca 21961 cccgccattg cggcccccga ccaggcacag gaccaagcca tggtgtccgt ctttcacggc 22021 atccccggca tgacggtcga cgtctacgcc aacggcgacg aactgatcgg cgacttcaag 22081 cccggcacgg tgaccgaccc gcagtccctc gacgccggga cctacgacat ccaggtcttc 22141 gaagccggcc agggccccga gggcaagccc gccctggaga agcaggtgaa ggtacccgaa 22201 ggcggcaacg ccacggtcgc cgcccacctc tccgccgggg gcaagcccga gctgacggcg 22261 ttcaccaacg acgtctcgaa ggtggacgcc ggcaaggccc gcctgacggt ccgccatgtc 22321 gcggccgcac cggccgtgga cgtgcgggcg ggcgggcagc ccgtcttcac cggcctgacg 22381 aaccccgacg agggcaccgc agccgtcgac gcgggcaccg tgaacgccga tgtcgtcctc 22441 gccggcacgg acaccgtggc catcggaccg gccgacctcg gcctcaagga gggcacgagc 22501 aacgtcctct acgcctgggg cagcgccgac gacaagaacc tggctctcgc cacccagacg 22561 ttcagcggca tggaatcgag gcccaacgcg gtccacgcgg gcggcagcgg cgccgccgtc 22621 acgccgaact cccccgacca gtggctggcg tgggccgccg cggccggagc ggtcacgctg 22681 accggcgtgc tgctggcccg ccaggtctcc ggccgacgtg ggtgaggccg cgcgacggct 22741 gaccggactg gcgacggcgg ctgccgctct ggccaccgtc ctcgccgttt cgggcgggcc 22801 gtcggcaacc gccccggccc cgcccgactt cgggcccgcg ccctcgcaag ctgcgcagca 22861 gcagccggcg gcggcggcgg ccccgttgct cagcagtaca cagtcgcggt ccgacgccgc 22921 tgcggcagcg ccgcccgagc gtgtcgaggt gcgagccgtg ggcctggagg cccgtgtacg 22981 gccggtgggc gtgaccgagc ggggggccat gaccgtcccc gagggccccg ccgtggccgg 23041 ctggtaccgc tacggcccgg cccccggggg ccgcgaggga tcggccgtac tcgtcggaca 23101 cgtggacagc gagacgggcg ccctcggcga gttcgccgca ctttacgaca tacaacgtgg 23161 tgaccgcgtc gaggtgcggc gagcggcggc cgcgccggtc gcctaccgcg tcgtctcgcg 23221 caccaccgtg cccaaggacg agctgccgcc ctccgtgttc cgccggacgg gcgaccccgt 23281 cctcacactg atcacatgtg cgccgccgtt cgaacccgag cgcggaggct acctcgccaa 23341 cctcgtcgtc accgctgagc cgctgcccga gtgacagccg cacgcgccgg cggctgcgcg 23401 cgcagagacc aggaagcccg agacacagga gagcgcagtg ggtcacgtgg agccgtccca 23461 cctggtggaa ctggcactcg gtcatgtgag cggcgccgag gacgccgacg ccctgcggca 23521 tgtcgcgtcg tgcccgcgct gtcgcggtga actcgccgag acgacccgcg tggtggccgc 23581 cgcgcgtggt gcgcgggcgg gggacctgcc ggcggctccg cccgagcgcg tgtggcagcg 23641 cgtcacgcag gaggtgttcc gtcgtgccga caggctcccg caccccggag agcaccctgc 23701 aggtcggcct gcacctgagg ggaagcgcgg tgtccggagc ctacggacgg tcggctccgg 23761 ggcccgcgag ggcttcttcg cactggctct ggccaccgga gtccttctcg tccggtggtt 23821 gcggatccgg gctggtcgcg ggcctggaag agctccgggg gtccgagccg cttctgggcg 23881 aaggcacgac cctcccggtt gaggatgaca gcgccgacgg tccagacctc gcccgcgccg 23941 gggctgggca cttcgacatc atggttgatc gtgaacgacc gtgtgtgagt ggccatgacc 24001 gatagcgtag ggctggcttt cgtccggctt ccaggcgtcg tggtggtcgg cggcggcgtc 24061 atccgtccgg taccgggccg cccgtgccgc cgcgatgcac cgcacccctc cgatcaaaga 24121 gaacctgggg cgacaccatg agatgggaca accaccgcgt cgtcgtcacg gccgccggcc 24181 gggacttcgg acgtaccctg gccatccgcc tcgcggacct cggcgccgag gtcttcctct 24241 cggcgcgccg gctcgccgcc gcccagcgag tccgcgacga gatccgcgac cgcggacacc 24301 agcgggtgca cgcctacgcg tgcgacctga cggatcccgc ctcgatccgc gacttcgcct 24361 ccggcgtcgc ggatcacacg gaccgtgtcg acgtactcgt caacaacggc tctcgctacc 24421 tggccgggcc ggacctgctg tccgcgaccg acgccgacgt cgtcgacacc ctcgcctccg 24481 gcgccaccgg cacggtgctg accacgaaga gcttcctgcc cctcttgctc aactcggcca 24541 agccggatgt cgtgacgatg gtctccgcat gcggaacgcc gggccaccac cggtccgacg 24601 cgcacgacgc cttctacgcg gccaagagcg cccaggcagg gttcaccgag atcctctcca 24661 agcgcctgcg accgcaagga gtgcgggtga tctcgctcta cccgcccgac ttcgtcaacg 24721 ccgatccgct ttccgaggag tgggagaccg cgccgcgcgg agccgaggac gccctcacgt 24781 cccagtccct cgtggagtgc gtcctcttcg ccgtcgccca gccccgcgac tgcttcatca 24841 aggcgttcca cttcgaacag ctgtgaccca cgctcgcccg cggccgcacg acggctgggg 24901 ccgggccgcg tctcacggcg gctcccgcgt cgcggcggcg gcgtcaggaa ccgggccgga 24961 cgaccaccgc caaaacctcc acgggcccgc tgtgttcgaa gtggagggtg aagggcacca 25021 ggtcgcccgt ctcccagcgt gccgcggccg ggacggtcac gtcgctgctg tgcggtgaca 25081 tgtcgagggt gccgtcggcg gggacgggca gtgactccac gggccggcgg taggcggctc 25141 cgccggcggt catccgatgg gtgctgaggg cgatgccgtc ggtgacgtcg gacgacgtca 25201 cctccatcag ccggtcccgg gcgcccccgg tgttggtgat cctgaagaac gccgccgtct 25261 ccgggactcc ccgggacggg aggaagagcc gcgcgtcggt gacgccgatg cgggcggggg 25321 tgccggcgtt gcccgtggcg gtccagaccg cgagaccgcc cagggccagt acgcaggcgc 25381 agaccggcgc cagagcggcc ggcgcggtgt ccgcgaggcg gcgccgggtg ggcgtccagg 25441 tggcggacgt catcgaaggc tcctcccgcg ggcgggcgtg gcggaccgac gcgtgcgggc 25501 cggtgacggc tgccaggcgc gcagccgcag gctgttgccg accaccagca gcgagctgac 25561 ggacatcgcg gcggcggccg gcatcggggt gagcagaccg accgcggcca ggggcaccgt 25621 cacggcgttg tagccgaagg cccacagcag attgacgcgg atggtggcca gcgtgctgcg 25681 ggagaggcgg acggcgtccg cgagggcgtc gatgtccccg cggaccaggg tgacgtcggc 25741 ggctccgatc gccgcatcgg tgccggtgcc catggcgatg ccgaggtcgg ctccggccag 25801 ggcggcggcg tcgttgacgc cgtcgccgac gacggccacc cggcagccct cgtcctgtag 25861 tcggcggacg aggcgggcct tgtcctcggg ggtgcagcgg gcgtgtacgt cgtcgatgcg 25921 cagggcggcg gcgacggcgc gggcgggtgc ctcccggtcc ccggtggcca gcaccggccg 25981 tacgcccagg cggcgcagcc ggtccacggc ccggtagctg cccggccgca gtacgtctcc 26041 gacctcgacg agcgcttcgg tcaccccgtc gacgcggacc acgacggctg tgtgggcggc 26101 cgcctccgcg gcggtcagcg cctcggccag cggtgcgggc aggtcgtcgt ccggggccag 26161 tacctccacc aggcgtccct cgacctcgcc gcgcacgccg cgtcccggca gagcggcgaa 26221 gccggtcacg tccggcagcg tcccggccgg cgcggcgccc cgggcgtggg ccgtgacggc 26281 ttgtcccaac gggtgctccg agccccgctc caccgcgccc gcgagcctca tcgcctgttc 26341 ctcgccgatg cctcggggcg cggccgtgat ccgggcgacg ctcatgtgcc ccgaggtcag 26401 ggtgccggtc ttgtcgagca gcaccgcgtc gacgtgctgg agtccctcaa gggcccgcgg 26461 accactgacc aggacgccca gtcgggcgcc gcgtccggtg gcggccatca gggcggtcgg 26521 ggtggcgagg ccgagggcgc agggacaggc gacgaccagg acggcgacgg acgcggtgat 26581 cgcggcctgc gggtcggcgc ccgcgcccag ccagaagccg aggaccgtcg cggccagggt 26641 gagcacgacc ggcacgaaga cgccggcgac ctggtcggcc aggcgctgtg cccgtgcctt 26701 gccggcttgc gcctcggtca ccaggtgggt gatgcgggcc agtcgggtgt cggcgccgac 26761 cgcggtggcc cgtaccgcca gcaggccgcc gacgttcacc gaaccgccga tcacggcggc 26821 gccaaacccc acctcggccg gctcgctctc cccggtgacc agggagaggt cgacggccga 26881 actgccttcg gcgaccgtgc cgtcggtggc cacgcgttcc ccgggccgta cgacgaagac 26941 ctggcccacc cggagagcgt cgatcgcgac gagacgctcc cccgcgtcgt cacgtacggc 27001 gacctccttg accgccagtt ccgccaggga gcgcagggcc gctccggtgc cacgacgggc 27061 ccgtgcctcc aggtgccggc ccgccaggac gaacaggggg acgccgaccg cggcctcgag 27121 gtagatgtgg gcgacgccgt ccgacgccgc cggcaccagg ctgaagggca tccgcatgcc 27181 cgggtcgccg gcgccgccga ggaacagggc gtaggtggac caggcgaagg aggccacgac 27241 ccccagcgac accagcgtgt ccatggtcga cgccgagtgg cgcagcgccc gcgccgcccg 27301 ctggtggaag ggccgggcgc cccagaccac gacgggcgcg gcgagcacga agcacagcca 27361 ctgccagttg cggaactgcc aggcggggac catcgacagg accagcacgg ggacggcgag 27421 cagggccgtg accgccagcc ggttgcgttc ccgctgcgcc tccgtgtccg tgtcgtcgcc 27481 gctcccgtcg ccgtggccgc tcccgcccct gttgccgctc ccgccgcgcc gctctgcggg 27541 cggttcgggc ggcgcggcgg tgtatccggc ccgctcgacg gcggcgacga gctgctccgg 27601 gagcacctcg ggcggatggt gcacccgggc ccgcccggtg gcgaggttca cgctcgcgct 27661 gacgccgtcg agcccggcca gcttcttctc cacccgtttc acgcaggccg cgcaggtcat 27721 gccgccgacc gcgaggtcgg tcacctcccg tacggacgcc ggtccgccgg ccatcagtgc 27781 ccgcctccgt ggttcatgcc gcccatgccg ccgtcctccg agtccgtgcc gccaccctca 27841 tggccgtcct gggtgatccc gggaccgtgc atgccgggtg cgacggggcc gatgccccgg 27901 ccgacggcgt aggaggcggc gaacaccagg agcagcagca cgaggaaccc gcacagggcg 27961 ggaggaggca agagccgccg caccgggccg tccggcgcac cggatgcgga tcgccgtgtc 28021 tgcgccatct gtcgactctc ccgatcgctc gcgtctcgcg cctcgcgcct cgcacggctg 28081 ccaccggacc gagccgcacc ggtacaggag tcggacgcgc cggcacttca gttcccccgg 28141 gttgcatgtg acgcctgtca ccggacgggg ccggagtgcc ttggagcgga cgctgcccgg 28201 tcactcccta gacgaacagg gcgaacatca ccgccatgcc cagggccatc acggcgtggc 28261 aggccgggtc gagtgcgg //