close all open/close all

CDS information : Ncarz_00330


close this sectionLocation

Organism
StrainATCC 15944
Entry nameNeocarzinostatin
Contig
Start / Stop / Direction58,477 / 52,544 / - [in whole cluster]
58,477 / 52,544 / - [in contig]
Locationcomplement(52544..58477) [in whole cluster]
complement(52544..58477) [in contig]
TypeCDS
Length5,934 bp (1,977 aa)
Click on the icon to see Genetic map.

close this sectionAnnotation

Category1.1 PKS
Productpolyketide synthase
Product (GenBank)warhead-forming iterative polyketide synthase
GenencsE
Gene (GenBank)
EC number
Keyword
  • iterative
  • enediyne core
Note
Note (GenBank)
Reference
ACC
PmId
[15797213] The neocarzinostatin biosynthetic gene cluster from Streptomyces carzinostaticus ATCC 15944 involving two iterative type I polyketide synthases. (Chem Biol. , 2005)
[12536216] A genomics-guided approach for discovering and expressing cryptic metabolic pathways. (Nat Biotechnol. , 2003)
[18223152] A phosphopantetheinylating polyketide synthase producing a linear polyene to initiate enediyne antitumor antibiotic biosynthesis. (Proc Natl Acad Sci U S A. , 2008)
[25019332] Enediyne polyketide synthases stereoselectively reduce the beta-ketoacyl intermediates to beta-D-hydroxyacyl intermediates in enediyne core biosynthesis. (Org Lett. , 2014)
Related Reference
ACC
Q8GME1
NITE
C1027_00520
PmId
[12183628] Biosynthesis of the enediyne antitumor antibiotic C-1027. (Science. , 2002)
[12536216] A genomics-guided approach for discovering and expressing cryptic metabolic pathways. (Nat Biotechnol. , 2003)
[14528002] Rapid PCR amplification of minimal enediyne polyketide synthase cassettes leads to a predictive familial classification model. (Proc Natl Acad Sci U S A. , 2003)
[18223152] A phosphopantetheinylating polyketide synthase producing a linear polyene to initiate enediyne antitumor antibiotic biosynthesis. (Proc Natl Acad Sci U S A. , 2008)
[25019332] Enediyne polyketide synthases stereoselectively reduce the beta-ketoacyl intermediates to beta-D-hydroxyacyl intermediates in enediyne core biosynthesis. (Org Lett. , 2014)
ACC
Q84HI8
NITE
Dynm_00180
PmId
[18328078] The biosynthetic genes encoding for the production of the dynemicin enediyne core in Micromonospora chersina ATCC53710. (FEMS Microbiol Lett. , 2008)
[22589546] Crystal structure of the acyltransferase domain of the iterative polyketide synthase in enediyne biosynthesis. (J Biol Chem. , 2012)
[12536216] A genomics-guided approach for discovering and expressing cryptic metabolic pathways. (Nat Biotechnol. , 2003)
[25019332] Enediyne polyketide synthases stereoselectively reduce the beta-ketoacyl intermediates to beta-D-hydroxyacyl intermediates in enediyne core biosynthesis. (Org Lett. , 2014)

close this sectionPKS/NRPS Module

A1 acetyl-CoA
malonyl-CoA
KS2..412
AT619..835
ACP956..1057
KR1237..1420
DH1489..1644
PPT1746..1956

close this sectionSequence

selected fasta
>polyketide synthase [warhead-forming iterative polyketide synthase]
MTRIAIVGMACRYPDATSPAELWANAIAGRRAFRRLPEERIRLEDYWDADPSTPDTFYAR
NAAVLEGYSFDRVTHRIAGSTFRSTDMTHWLALDTAGRALADAGFPAGEGLPHERTGVVM
GNTLTGEFTRANVMRLRWPYVRRVMAAALAGQQDWDEARVTAFLEEVETSYKAPFPPVDE
DTLAGGLSNTIAGRICNHFDLNGGGYTVDGACSSSLLSVTTAGTALVNGDLDVAVAGGVD
LSIDPFEIIGFAKTGALARGEMKLYDKGSNGFWPGEGCGVVVLMREEDAIARGHRIYATV
AGWGVSSDGQGGITRPEVDGYRLALERAYARAGFGIETVPLFEGHGTGTAVGDATELAAL
IKARSAADPQAPVAAIGSIKGMIGHTKAAAGVAGLIKAALAVDNQTLPPSIGTSDPHELL
TEPGANLKALRKAETWPRELPRRAGITAMGFGGINTHVVLDEPSGRRRPASVRRLTPLAD
SMQDSELLLFEGASARELSHRLSEVADYTVRLSYGEIADLAATLQRELRGLPHRAAAVVT
SPDDAENRLRHLADLLDRGETEHWAADGRTLLGKATGRKRIGLLFPGQGSGRGTGGGALS
RRFPEVAEVLARAGSAAGSDTVATEVAQPRIVTGSAAGLRVLDELRVEASVGIGHSLGEL
SALCWAGALDEDVLIEAAGVRGRAMAEHGSSGTMASLGAAPEQAEELIGALSVVVAGYNG
PQQTVVSGPVHEVEEVRRRAARSGVTCTPLAVSHAFHSPLVASAAESFGNWLKSVDFREP
AGRVVSTVTGAELTPGTDLSALLREQITAAVRFTEAVRAAAQDVDLFIEVGPGRVLGHLA
GTATNIPAVSLDTDDESLRSLLQVVGAAFVVGAPVAPERLFRDRLIRPLRIGQELSFLAS
PCEQAPATTLPVSRRSAQPPAVPADREQEPQPAAVSPPAAQNSPASNDTSTASTASTAGS
ERTPQEEESIGAKALDVLSALVVERAELPAHLVDPDSRLLDDLHLSSITVGQIVNQAMAQ
LGIAPAAQEPTNFATATLAELAEALESLASTGGPADAGAASFIAGAAPWARPFAVDLDAV
ARPPARPAAVRGTWELFAPAGYGIAATLRAALQDAQAGSGVLVCLPPQCSADGIDLALAA
AKRALAAPKDSRFVLVQHGRAAAGLVKTLHQEASHLVTTVVDTPLTEDTVDRVVAEVSAT
TRFSEVHYSADGVRRVPTLRALPMSPEQQDKPLSASDVLLVTGGGKGISAECALAIAQDS
GTRLAVLGRSDPATDRELADNLKRMEDSGVTMRYARADVTNPEQVRTAVAELRGELGPIT
GVLHGAGRNEPGPLHALEPEDFRRTFAPKVDGLRTVLEAVDAEELKLLVTFGSIIGRAGL
RGEAHYATANEWLADLTEEIARTHPQVRARCVEWSVWSGVGMGEKLSVVESLSRQGIVPV
SPDQGVEILLRLIRDPDAPVVTVVSGRTEGIETVRRDLPPLPLLRFTGTPLVRYHGVELV
TEVELNAGTDPYLGDHLLDGNLLLPAVMGMEAMVQVAAAATGWPGTPVIEGARFLRPIVV
PPDGSTTIRVAATVTGPDTVDVAVHASDTGFAAEHFRARLVYSVAGVPDGPPLQTGSDTP
EVPLDPASDLYGGILFQGSRFQRLRRFHRMAARHVDADVTVRRPEGWFAGFLPAEMLLAD
PGMRDALMHGNQVCVPDATLLPSGVERVHPLGNSGNVPDQLRYCAVERSRDGDTYVYDIA
VRDAEGTVVERWEGLTLHAVRKTNGSGPWVAPLLGPYLERTLEEVLGAHIAVTVEPHGDN
PAGSVAERRALTTIAASRTLGAAVTVRHRPDGRPEVDGGWHISASHGLELTVSAVARAEV
ACDIEAVSMREPSEWQGLLGEYAAVAELVARETGEAPDTAATRVWSAVECLRKAGAMAGT
PLTVLPQKKEAWVVFTAGDLRIATFVTALRDALEPAVFAFLTRTPELLEGRSQDYVG
selected fasta
>polyketide synthase [warhead-forming iterative polyketide synthase]
ATGACCAGAATCGCCATCGTCGGCATGGCCTGCCGCTACCCCGACGCCACCAGTCCCGCC
GAACTGTGGGCCAACGCCATTGCCGGACGCCGAGCCTTCCGACGCCTCCCCGAGGAACGA
ATACGTCTGGAGGACTACTGGGACGCCGATCCGTCCACACCCGACACCTTCTACGCCCGC
AACGCGGCCGTGCTCGAGGGGTATTCCTTCGACCGCGTTACCCACCGGATCGCCGGCAGT
ACGTTCAGGTCCACCGACATGACGCACTGGCTCGCCCTGGACACTGCCGGGCGGGCGCTG
GCCGACGCCGGGTTCCCGGCGGGTGAGGGGCTGCCTCACGAGCGGACCGGCGTCGTCATG
GGCAACACGCTCACCGGTGAATTCACCCGTGCCAACGTCATGCGGCTGCGCTGGCCGTAC
GTGCGGCGGGTGATGGCGGCCGCGCTCGCCGGACAGCAGGACTGGGACGAGGCCCGGGTC
ACCGCGTTCCTCGAGGAGGTCGAAACCTCCTACAAGGCGCCGTTCCCGCCCGTCGACGAG
GACACTCTGGCCGGTGGGCTCTCCAACACCATCGCCGGCCGGATCTGCAACCACTTCGAC
CTCAACGGCGGCGGATACACCGTCGACGGAGCCTGCTCCTCCTCGCTGCTGTCGGTCACC
ACCGCCGGAACAGCTCTGGTCAACGGTGACTTGGACGTCGCCGTCGCCGGTGGTGTCGAC
CTGTCCATCGACCCGTTCGAGATCATCGGCTTCGCCAAGACCGGTGCTCTGGCCCGGGGG
GAGATGAAGCTGTACGACAAGGGCTCCAACGGTTTCTGGCCCGGCGAGGGCTGCGGAGTG
GTCGTGCTGATGCGGGAAGAGGACGCGATCGCACGCGGCCACCGCATCTACGCGACCGTC
GCAGGCTGGGGGGTGTCCTCGGACGGTCAGGGCGGGATCACCCGGCCCGAGGTCGACGGC
TACCGCCTGGCCCTCGAGCGTGCCTACGCGCGTGCCGGGTTCGGCATCGAGACCGTCCCC
CTCTTCGAGGGCCACGGCACGGGAACGGCCGTTGGTGACGCGACGGAGCTGGCGGCGCTG
ATAAAGGCCCGCTCGGCAGCCGACCCGCAGGCGCCTGTCGCCGCCATCGGCTCCATCAAG
GGCATGATCGGTCACACCAAGGCGGCAGCAGGCGTGGCAGGTCTGATCAAGGCGGCCCTG
GCGGTGGACAACCAGACCCTGCCGCCCTCCATCGGCACCTCCGATCCGCACGAGCTGCTC
ACCGAGCCAGGGGCCAACCTCAAGGCGCTGCGCAAGGCGGAAACCTGGCCCCGGGAACTG
CCGCGCCGCGCGGGCATCACCGCCATGGGGTTCGGCGGCATCAACACGCACGTAGTCCTG
GACGAGCCGTCCGGCCGGCGCCGGCCGGCTTCCGTCCGCCGGCTCACCCCCCTGGCCGAC
TCCATGCAGGACAGCGAACTCCTGCTGTTCGAGGGGGCCTCGGCCCGAGAGCTGAGCCAC
AGGTTGTCCGAGGTCGCGGACTACACTGTGAGGCTCTCGTACGGGGAGATCGCCGACCTC
GCCGCCACTCTCCAGCGCGAGCTCCGGGGCCTTCCGCACCGGGCAGCGGCGGTGGTGACC
TCTCCGGACGACGCCGAGAACCGGCTGCGCCACCTCGCGGACCTTCTGGACCGGGGGGAG
ACGGAGCACTGGGCCGCGGACGGCCGGACCCTCCTTGGAAAGGCCACCGGCCGCAAACGG
ATCGGTCTGCTGTTTCCCGGCCAGGGCTCTGGACGCGGCACCGGTGGCGGTGCGTTGAGC
CGCCGCTTCCCCGAGGTCGCCGAGGTGCTGGCTCGCGCCGGGTCGGCGGCGGGCTCGGAC
ACCGTGGCCACCGAAGTGGCCCAGCCGCGCATCGTCACTGGTTCGGCAGCGGGTCTGCGT
GTTCTGGACGAGCTGCGGGTGGAGGCGTCCGTCGGTATCGGACACAGCCTCGGCGAGCTC
TCCGCCCTGTGCTGGGCCGGGGCTCTCGACGAGGACGTCCTGATCGAGGCGGCGGGCGTG
CGTGGCAGGGCAATGGCGGAGCACGGGTCGTCGGGAACCATGGCGTCACTGGGTGCCGCA
CCGGAGCAGGCGGAGGAGCTCATCGGCGCCCTCTCCGTGGTCGTGGCCGGCTACAACGGT
CCGCAGCAGACGGTCGTCTCGGGTCCCGTGCACGAAGTGGAGGAGGTGCGCAGGCGGGCC
GCTCGCTCCGGCGTGACGTGTACGCCGCTTGCCGTGTCCCACGCGTTCCACTCACCGCTC
GTGGCGTCCGCCGCCGAGTCGTTCGGCAACTGGCTGAAGAGCGTTGACTTTCGCGAGCCC
GCCGGACGTGTGGTGTCCACGGTCACCGGGGCCGAGCTGACACCGGGCACTGACCTGTCG
GCGCTGCTGCGGGAGCAGATCACCGCTGCGGTGCGTTTCACCGAAGCGGTCAGGGCCGCG
GCCCAGGACGTCGACCTGTTCATCGAGGTGGGACCCGGCCGGGTGCTCGGCCACCTGGCC
GGGACGGCGACGAACATTCCCGCGGTTTCCCTCGACACGGACGACGAGTCCCTGCGATCG
CTCCTGCAGGTGGTGGGCGCCGCGTTCGTCGTCGGCGCGCCCGTCGCCCCCGAACGCCTC
TTCCGGGACCGGTTGATACGCCCGCTCCGGATTGGCCAGGAGCTCTCCTTCCTGGCCAGT
CCATGCGAACAGGCACCGGCGACGACCCTACCCGTATCGCGCCGGTCCGCCCAGCCGCCC
GCCGTACCTGCTGATCGCGAACAAGAGCCGCAGCCCGCGGCCGTGTCACCTCCGGCAGCA
CAGAACTCCCCGGCCTCGAACGACACCTCCACCGCGTCCACCGCGTCCACCGCCGGGTCC
GAGCGGACGCCTCAGGAGGAGGAGAGCATCGGCGCCAAGGCCCTCGATGTCCTCAGTGCC
CTGGTCGTCGAGCGAGCCGAACTCCCGGCCCACCTGGTGGACCCGGACAGCAGGCTCCTG
GACGACCTGCACCTGAGCTCCATCACCGTCGGCCAGATCGTGAACCAGGCCATGGCGCAA
CTCGGTATCGCCCCGGCAGCGCAGGAGCCGACGAACTTCGCCACTGCCACGCTGGCGGAA
CTGGCCGAAGCGCTCGAGAGCCTGGCCAGTACCGGCGGCCCGGCCGATGCCGGTGCGGCT
TCGTTCATCGCCGGAGCGGCGCCGTGGGCGCGTCCCTTCGCGGTGGACCTGGACGCGGTC
GCCCGGCCGCCGGCGCGTCCGGCAGCGGTTCGCGGCACCTGGGAGCTGTTCGCACCGGCC
GGGTATGGGATCGCCGCGACACTGCGCGCGGCGCTCCAGGACGCCCAGGCGGGTTCCGGA
GTGCTGGTCTGTCTGCCGCCCCAGTGCTCTGCCGACGGGATCGACCTGGCGCTAGCAGCG
GCGAAGCGGGCGCTCGCCGCCCCGAAGGACAGCCGTTTCGTGCTGGTGCAGCACGGCCGC
GCTGCCGCCGGCCTGGTCAAGACCCTCCACCAGGAGGCGTCCCACCTGGTGACGACTGTC
GTCGACACCCCCCTCACCGAGGACACGGTGGACCGGGTGGTCGCCGAGGTGTCGGCGACC
ACCCGGTTCTCCGAGGTGCACTACAGCGCGGACGGAGTCCGCCGCGTCCCCACGCTGCGG
GCACTCCCCATGAGCCCGGAGCAACAGGACAAACCGCTCAGCGCATCCGACGTCCTGCTG
GTCACCGGGGGTGGCAAGGGCATCTCCGCCGAGTGCGCCCTGGCGATCGCCCAGGACAGC
GGGACACGGCTTGCGGTGCTGGGACGCTCCGACCCGGCCACGGACCGAGAACTGGCCGAC
AACCTGAAGCGGATGGAGGACAGCGGTGTAACCATGCGGTACGCGCGCGCCGACGTCACC
AATCCGGAGCAGGTCCGGACGGCAGTCGCCGAGCTGCGCGGCGAGCTGGGTCCGATCACC
GGCGTGCTGCACGGCGCCGGACGTAACGAACCCGGGCCGTTGCATGCGTTGGAACCGGAG
GACTTCCGGCGTACCTTCGCTCCCAAGGTGGACGGCCTACGGACCGTACTCGAGGCAGTG
GACGCCGAGGAACTGAAACTGCTCGTCACGTTCGGCAGCATCATCGGCCGTGCCGGCCTG
CGGGGCGAGGCGCACTACGCCACCGCGAACGAGTGGCTGGCCGACCTCACCGAGGAGATC
GCACGCACGCACCCGCAGGTACGCGCCCGCTGCGTGGAATGGTCGGTGTGGTCCGGGGTC
GGGATGGGTGAGAAGCTCTCGGTCGTCGAGTCGCTCTCCCGCCAAGGCATCGTCCCGGTC
TCCCCGGATCAGGGGGTAGAGATCCTCCTGCGGCTGATCCGGGATCCCGACGCGCCGGTG
GTGACGGTCGTCAGCGGCCGTACCGAAGGCATCGAGACGGTGCGCCGTGACCTGCCGCCC
CTGCCGCTTCTCCGGTTCACCGGCACCCCGCTGGTGCGCTACCACGGCGTGGAGCTCGTC
ACCGAGGTCGAGCTGAACGCGGGCACGGACCCCTACCTCGGCGACCACCTGCTGGACGGC
AATCTCCTGCTGCCTGCGGTGATGGGGATGGAAGCCATGGTTCAGGTCGCGGCCGCGGCC
ACCGGCTGGCCGGGGACACCGGTCATCGAGGGCGCGCGCTTCCTGCGTCCCATCGTGGTT
CCACCCGACGGGAGCACCACCATCCGTGTCGCCGCGACGGTGACCGGACCGGACACGGTC
GACGTCGCCGTCCACGCCAGCGACACCGGATTCGCCGCAGAGCACTTCCGCGCCCGGCTG
GTGTATTCCGTCGCCGGTGTCCCGGACGGGCCGCCGCTGCAGACGGGCTCCGACACCCCG
GAAGTTCCTCTGGACCCAGCAAGCGACCTCTACGGCGGCATCCTCTTCCAGGGCTCCCGC
TTCCAGCGGTTGCGGCGATTCCACCGAATGGCGGCCCGGCACGTGGACGCCGACGTGACA
GTGCGAAGGCCGGAGGGCTGGTTCGCCGGCTTCCTCCCTGCGGAGATGCTTCTGGCCGAC
CCCGGCATGCGCGACGCGCTGATGCACGGCAACCAAGTGTGCGTGCCCGACGCCACGCTG
CTTCCTTCGGGGGTCGAGCGTGTCCACCCCCTGGGCAACAGCGGGAATGTACCCGACCAA
CTGCGTTACTGCGCGGTCGAGCGCAGCCGTGACGGCGACACATACGTGTACGACATCGCG
GTACGCGACGCCGAGGGCACCGTCGTCGAACGCTGGGAAGGTCTGACCCTGCACGCGGTG
CGCAAGACCAACGGCTCCGGCCCCTGGGTCGCGCCCCTGTTGGGACCGTACCTGGAGCGG
ACCCTCGAGGAAGTGCTCGGTGCGCACATCGCGGTGACGGTCGAACCGCACGGCGACAAC
CCGGCTGGGTCGGTCGCCGAACGTCGGGCCCTGACCACCATCGCGGCCTCCCGGACCCTC
GGGGCCGCCGTGACCGTGCGTCACCGGCCCGACGGGCGGCCGGAGGTGGATGGTGGGTGG
CACATCTCGGCCTCCCACGGCCTGGAACTCACCGTGAGCGCTGTGGCCCGGGCGGAGGTT
GCCTGTGACATAGAGGCGGTCAGCATGCGGGAGCCGAGCGAGTGGCAGGGGCTGCTCGGC
GAGTACGCCGCGGTCGCCGAACTCGTCGCCCGGGAGACCGGCGAAGCTCCCGACACGGCC
GCCACCCGGGTGTGGAGCGCGGTCGAGTGCCTGAGGAAGGCGGGCGCCATGGCGGGCACA
CCGCTGACCGTACTGCCGCAGAAGAAGGAAGCGTGGGTGGTCTTCACCGCCGGCGACCTC
CGGATCGCGACCTTCGTCACGGCCCTGCGGGACGCTCTGGAACCCGCCGTCTTCGCATTC
TTGACGCGCACACCGGAACTGCTGGAAGGACGGTCCCAGGACTATGTCGGATGA
[1] KS2..412
[1] AT619..835
[1] acetyl-CoA malonyl-CoA754..758
[1] ACP956..1057
[1] KR1237..1420
[1] DH1489..1644
[1] PPT1746..1956
[1] KS4..1236
[1] AT1855..2505
[1] acetyl-CoA malonyl-CoA2260..2274
[1] ACP2866..3171
[1] KR3709..4260
[1] DH4465..4932
[1] PPT5236..5868

close this sectionFeature

BLASTP
Database:UniProtKB:2011_09
show BLAST table
InterPro
Database:interpro:38.0
IPR001227 Acyl transferase domain (Domain)
 [575-687]  7.5e-39 G3DSA:3.40.366.10 [754-862]  7.5e-39 G3DSA:3.40.366.10
G3DSA:3.40.366.10   Ac_transferase_reg
IPR009081 Acyl carrier protein-like (Domain)
 [974-1070]  2.6999988713206e-08 SSF47336
SSF47336   ACP_like
 [975-1049]  1.9e-05 G3DSA:1.10.1200.10
G3DSA:1.10.1200.10   ACP_like
IPR013968 Polyketide synthase, KR (Domain)
 [1239-1419]  5.00000000000001e-36 PF08659
PF08659   KR
IPR014030 Beta-ketoacyl synthase, N-terminal (Domain)
 [2-289]  6e-51 PF00109
PF00109   ketoacyl-synt
IPR014031 Beta-ketoacyl synthase, C-terminal (Domain)
 [297-412]  2.4e-28 PF02801
PF02801   Ketoacyl-synt_C
IPR014043 Acyl transferase (Domain)
 [619-835]  7e-26 PF00698
PF00698   Acyl_transf_1
IPR016035 Acyl transferase/acyl hydrolase/lysophospholipase (Domain)
 [580-861]  2.60000517657733e-42 SSF52151
SSF52151   Acyl_Trfase/lysoPlipase
IPR016036 Malonyl-CoA ACP transacylase, ACP-binding (Domain)
 [690-753]  1e-15 SSF55048
SSF55048   Malonyl_transacylase_ACP-bd
IPR016038 Thiolase-like, subgroup (Domain)
 [2-125]  1.09999999999999e-61 G3DSA:3.40.47.10 [159-301]  1.09999999999999e-61 G3DSA:3.40.47.10 [302-463]  5.30000000000001e-41 G3DSA:3.40.47.10
G3DSA:3.40.47.10   Thiolase-like_subgr
IPR016039 Thiolase-like (Domain)
 [1-410]  4.39999001237725e-61 SSF53901
SSF53901   Thiolase-like
IPR016040 NAD(P)-binding domain (Domain)
 [1236-1426]  8.20000000000006e-33 G3DSA:3.40.50.720
G3DSA:3.40.50.720   NAD(P)-bd
SignalP No significant hit
TMHMM No significant hit