close all open/close all

CDS information : Pimar_00170


close this sectionLocation

Organism
StrainATCC 27448 (=NBRC 13367)
Entry namePimaricin
Contig
Start / Stop / Direction87,896 / 67,503 / - [in whole cluster]
81,152 / 60,759 / - [in contig]
Locationcomplement(67503..87896) [in whole cluster]
complement(60759..81152) [in contig]
TypeCDS
Length20,394 bp (6,797 aa)
Click on the icon to see Genetic map.

close this sectionAnnotation

Category1.1 PKS
Productpolyketide synthase
Product (GenBank)PimS1 protein
Gene
Gene (GenBank)pimS1
EC number
Keyword
Note
Note (GenBank)
Reference
ACC
PmId
[10187796] The biosynthetic gene cluster for the 26-membered ring polyene macrolide pimaricin. A new polyketide synthase organization encoded by two subclusters separated by functionalization genes. (J Biol Chem. , 1999)
[11094342] A complex multienzyme system encoded by five polyketide synthase genes is involved in the biosynthesis of the 26-membered polyene macrolide pimaricin in Streptomyces natalensis. (Chem Biol. , 2000)

close this sectionPKS/NRPS Module

1 malonyl-CoA
2 malonyl-CoA
3 malonyl-CoA
4 malonyl-CoA
KS34..410
AT565..885
KR1191..1375
ACP1470..1540
KS1561..1933
AT2104..2414
DH2463..2629
KR2928..3104
ACP3204..3274
KS3299..3673
AT3829..4140
DH4189..4349
KR4631..4807
ACP4905..4975
KS5002..5374
AT5543..5854
DH5903..6063
KR6357..6533
ACP6636..6706

close this sectionSequence

selected fasta
>polyketide synthase [PimS1 protein]
MSNEEKLREYLKRAIADLHETRQQLDETEAKQREPLAIVSMACRFPGGVRSPEELWELLR
DGVDAVSSFPRNRGWDLDALYHSDPAHQGTSYAREGGFLHDAGEFDPGFFGISPREALAM
DPQQRLLLETAWEAVERAGIDPESLAGSRTGVFVGTGHGGYDAEGRRRADEVGGHLLTGN
HISIASGRISYVLGLEGPALTVDTACSSSLVALHLAMHALRRDECAMALVGGATVMSTPQ
MFVEFSRQRGLAPDGRCKPFAAAADGTGWSEGVGLLLVERLSDAVRNGYPVLAVLKGSAV
NQDGASNGLTAPNGPSQQRVIRQALTGAGLAASDIDAVEAHGTGTTLGDPVEAHALLATY
GQQRAADRPCGLGSMKSNIGHTQAAAGIAGVMKMVLAMRHGHLPRTLHLDEPTGHVDWSE
GNARLLAEPEPWPSAGRPRRAAVSSFGISGTNAHVILEQAPAHEAEPAPEPAARPGALPW
ILSARTEAGLRAQADRLGRHLRDRADLEPAAVAHALADTRTLMEHRAVVVAGDREEFLRG
LDALAAGRTANGLVSGVAVKAASAFLFAGQGSQRPGMGRELHAAHPVFATAFDAVCAELD
PHLDRPLRDIVFAEEGSAEAALLDQTAYTQAALFALETALFRLVESWGVAPRFVAGHSIG
ELTAAHVSGVLTLHDAARLVAARGTLMQALPAGGAMVAVQATEDEIRERLAGHEDHVALA
AANGPDSTVISGDEQAVTEIAAHWEAQGRRTKRLRVSHAFHSPHMDDMLEDFRRVARGLT
FHAPRIPVVSTVTGALATEDELRSPDYWVRQVRETVRFCAAVRTLEAEGVTTFVEIGTGG
VLTPMVQDCLTTLEEPVLVPLLRTGRPETVALTEGVATAFVHGVPVDRSAFPGAPGTSRA
DLPTYAFQRQWYWLDPADHDEGEAAAAEAGEAGFWAAVEREDLQELSAVLAIDGSEADSL
GSFLPTLSSWRRQRRTQAAADRFSYRTHWAPRTASGGPTATGHWLVVLPEGGTDDPWTAR
LLDALNDQGLHTDVRELPADHEPDAWGRHPVDGVLCLLALDERPTRSCPPYRRGLAATTN
AAARPEGAGIQAPLWCVTRGAVAVDRHEALKSPLQAQTWGLGRVAALESPQSWGGLIDLP
DNLDGRAVSALLSTLAGEEDQVAVRPAGVFARRLERITPGGDTGDRWSTHGTVLVTGGTG
ALGAHLAHWLADAGAEHLVLTGRRGPQAPGAPELAAALTDRGVKVTLAACDAADRDALAA
VLADIPPHLPLTGVVHAAGVLDDGVLDALTPERFETVLRPKARAAQNLHELTQDLDLDHF
VLFSSIVGVLGNAGQANYAAANAYLDALAEHRLAQGLPATSVSWGPGQAAAWHDSDAADR
MSRDGLLPMAAAPRRRPAPALAQGMTQVTVADIDWSAYAPALTAVRPSPLIGDLPEARRA
LGPAEGPRRERSPLRDRIGALPPAEQEKAFLTMVREEAARVLGHPSPDTVDAQRAFREQG
FDSLMAVDLRNRLSAATGLRLPATLLFDHPTPLAAAACLRSEVLGAAGPATVVQASTAAL
DEPVAIIGMACRFPGGVHSPEALWRLLAEGGDAITPMPADRGWDLDRLYHPDPDHQGTSY
ARGGGFLDGAADFDADFFGISPREALAMDPQQRLLLETWEVLEQAGIDPESLRGSSTGVF
AGTNTQDYGTALDAAQDEAGGHRLTGNAMSVVSGRVSYTFGFEGPALTVDTACSSSLVAL
HMAAQALRQGECSLAVAGGVTVMATPSSFVEFARQRGLAPDGRCKPFAAAADGTGWSEGV
GLLLVERLSDARRNGHQVLAVVRGSAVNQDGASNGLSAPSGPSQQRVIRQALANARVAAS
EVDAVEAHGTGTTLGDPIEAQALLATYGQERPLLLGAVKSNLGHTQAAAGVAGVMKMVLA
MRHGMLPRTLHVDEPTGHVDWTAGAVELLTEHTDWPETGHPRRAAVSAFGISGTNAHVVL
ELPAAEQPLVEQPSAAEPDAPATAPDRTPTASDGTAPLLLSAKSESALRAQAARLHSHLE
RDPALRLTDAAYTLMTHRTAFAHRAAVRAADHEAALRALTALAAGEADPAVDTGTAHTGR
DAVLFSGQGSQRIGMGRELSGRYPVFAEAFDTVCAALDEHLDRPLRDVVRGEDEELLNRT
VYAQAGLFAIEVALFRLVESWGVRPHYVAGHSVGEIAAAHVAGVFSLADACALVAARGRL
MQALPAGGAMAAIRATEDEVLPHLADSVSIAAVNGPSSVVVSGAEHAVLSIAAHFEGAGR
KTTRLRVSHAFHSPLMDPMLADFRAVAEGLTYGEPELAVVSNVTGQLATPDQLRTPEYWV
THVRAAVRFADGIRALGAEGVTRFLELGPDGVLSALARESAPDDAVCTPVLRKDRSEAAT
LLAALTHLHVHGTEIDWTAFLAGRDAHAVDLPTYAFQHQRFWPTPDHTRTGDLGAVGLEA
TGHPLLSAAVELPDGEGLLFTTRLSLQTHPWLAGHVVMGSVLLPGTAFAELALRAADEVG
CDRVDELTLAAPLVLPEHGGVQLQLRVGPADASGRRTLTARSRAEGDGDRPWVQHATGVL
AEGESTPEPGYDFHTESWPPADAAPVELSGLYPDFAAHGFDYGPHFQGLRTAWRRGDEVF
AEVALPAEAEGEASAYGLHPALLDAALHVVAFNGVDRGVVPFSWESVALHATGASAVRIR
VVRHSGDTVSVDVADTTGEPVASIGTLVLRAVSADQLAGGADPAVRDALFRVQWNPVRLP
PAGAAVTVATLGSLAGAPFDGYPDLASLARSGRVAGAVLVPVEAGAGEVVADDVVGATHA
TAARALDLARSWLADDRFAASRLVFVTRGAVSGADLAGAAVWGLVRSALSEHPGRFGLVD
LDDDAELALVPRVLASDEPQLLVRGGEVLAARLARAQSSHAVTWDPSGTVLVTGGTGGLG
RVMARHLVVEHGVRNLLLVSRRGPAAEGAEELVTELRHSGAEVAVEACDVTDAAAVADLV
ARHRISAVVHTAGVLDDGVVESLTPERLSAVLRPKVDAAWNLHEATRDLDLDAFVVFSSV
AGTIGSPGQANYAAGNAFLDALAHHRRAAGLPAASLAWGPWSRDGGMTGTLTDVDSSASP
GRHARTHPRTGRGLFDAALAAGDAHLLPVRFDWASLRAQGEVPPLLRGLIRTRARRSAVG
GSAAAAGLVGRLSGRGTVERREVLLDLVRAQIAVVLGHANPETIESTRVFQDLGFDSLTA
VELRNRLNNATGLRLSATAVFDYPTADALVDFLLDELFGAQEEAELPAPVPSPAGAADDP
VVIVGMSCRYPGGVGSPEDLWRLVSEGVDAVSDFPTDRGWDVESLYSPDPEALGTSYTRS
GGFLHEAAEFDPDFFGMSPREALATDAQQRLLLETTWEAIERTGIDPASLRGSRTGVFAG
VMYTDYGDLLVGDQFEGYRSNGSAASIASGRVSYTFGFEGPAVTVDTACSSSLVALHWAA
QSLRSGECSLAVAGGVTVMSTPTTFVEFSRQRGLSADGRCKAFADAADGVGWGEGVGMLV
LERLSDARRNGHRVLAVVRGSAVNQDGASNGLTAPNGPAQQRVIRQALASAGLSAADVDA
VEAHGTGTTLGDPIEAQALLATYGQERPEDRPLLLGSVKSNIGHAQAASGVAGVIKMVLA
MRHGVLPRTLHVDEPSSHVDWSAGAVELLTSEAEWPQGEGPRRAGVSSFGVSGTNAHVIL
EQPGPDAADAAPDATVTDPGALAWVLSARNEAALRCQAARLLSLVAGSDALCARDIGHSL
VTGRSSFAHRAVVWGQDRDALVRALSALAVGEADAGLAEGASGAGRTAFLFSGQGSQRLG
MGWELYARYPVFADAFDAVCAALDEHLERPLRDVVWGEDAELLNQTAYAQAGLFAIEVAL
YRLAESWGMRPDFVAGHSIGEVAAAHVSGVFSLPDACALVAARGRLMQQLPSGGAMMAIR
ATEDEVLPHLAEGVSLAAVNGPSSVVISGAEDAVLAIAAHFAGEGRKTTRLRVSHAFHSP
LMEPMLEEFRAVVTRLSFGTPTIPVVSNLTGRLAEPEQLAHADYWVRHVREAVRFADGIQ
ALRAEGVTRFLELGPDGVLSAMARESASDDAVLAPVLRRDRPEETALLGALAQLYVRGAH
VDWTVPFAGSGARWADLPTYAFQHERFWPSGGVARPGDVRSAGLGSAGHPLLGAAVELAG
SGGLLFTGRLSVSSHPWLADHVVLGSVLVPGTALVELVLRAADEAGCDLLEELTLAAPLV
LPASGAAVQVQVAVGEPDEAGRRPVSVHAREGEGPWTLHASGAVTSGAEVPPFDATVWPP
KGAEPVDVADCYDVLADAGLTYGPAFHGLQAAWKLGGDVYAEAKLPESTDGDAYGLHPAL
FDAALHASALGGAEAGGVPFSWAGVSLHATGASHLRVRIREAGGALSVAIADTSGAPVAS
VESLVIRPLSAGQVQAADRDALFKADWVPVPLTDERVEPGTGPEGEPLRTYADLDSLEGA
AVPGTVLVAPPSGAAGTVESVHAATVWALEMVQAWLADDRFATSRLVFVTRGAAFGADLA
AAAVRGLVRSAQSENPGRFGLVDMDGDADTTVPAQALATDEPELLVRGGEVLAARLVRAQ
SSHTVTWDPSGTVLITGGTGGLGRSVARHLVSEHGVRSLLLVSRRGPAAEGAGELVAELR
GSGAEVVIEACDVTDAVAVADLVARHRISAVVHTAGVLDDGVVESLTPERLAVVLRPKVD
AAWNLHEATRGLDLDAFVVFSSVAGTFGSAGQANYAAGNAFLDALAYHRRAVGLPAVSLA
WGPWSQDGGMTGTLSDADVQRIARQGMPPLTVEEGLALFDAALGSAEPMALPVRLDLAAL
RAQGEPQPLLRGLIRTRTRRSGAAAASGIAQRLAGLSTAERREALLDVVRAQIATVLGHA
GPETIAPDRAFQDLGLDSLTAIELRNLLGKATGLRLPATTVFDYPTVDALAAHLLDELFG
AETGTATETPLPVPGLPSLADDPVVIVGMSCRFPGGVASPEDLWRLVADGVDAVSAFPTD
RGWEIDDTYDPEREGAIATRSGGFLHDAAEFDPEFFGMSPREALTTDAQQRLLLETTWEA
LERAGMDPATLRGSRTGVFAGVMYHDYSTLLSGREFEGYQGSGSAGSVASGRVSYTFGFE
GPAVTVDTACSSSLVALHLAAQSLRSGECSLALAGGVTVMSTPLTFVEFSRQGGLSADGR
CKAFADAADGVGWAEGAGILVLERLSDARRNGHRILATVRGSAVNQDGASNGLTAPNGPA
QQRVIRQALASAGLSAADVDAVEAHGTGTTLGDPIEAQALLATYGQERPEDRPLLLGSVK
SNIGHAQAASGVAGVIKMVLAMRHGVLPRTLHVDEPSSHVDWSAGAVELLTSEAEWPQGE
GPRRAGVSSFGISGTNAHVILEQPEPVAAETESITPDTAPDAAEDEAADSGTPVPALLSG
RSASALRAQAARLLSRLDGDPGPRITDVAYSLATGRSAFPHRAVILAANRADLLHSLSAL
AEGHTEAPAVVAQDRARSGKLAFLFSGQGSQRLGMGRELYGRYPAFAEALDAVCAALDAH
LDRPLRDVIWGEDAELLNRTGYAQTGLFAIEVALFRLLESWGVRPDHLLGHSIGEIAAAH
VAGVLSLPDACALVAARGRLMQQLPSGGAMMAIRATEDEVLPHLAEGVSLAAVNGPSSVV
VSGAEDEVLALAAHFEEEGRKTTRLRVSHAFHSPLMEPMLADFRAVADGMTYAAPRIPVV
SNVTGRPATAEELCCAEYWVGHVREAVRFADGVGALREQGVTTFLELGPDGSLSALAAES
AADDSVLAPVLRKNRPEAPALLTALARLHAQGTPVDWSAAFAGTGARWVDLPTYAFQHER
FWPSGGAARAGDVRSAGLGSAGHPLLGAAVELAGSGGRLLTGRLSLSSHPWLADHVVLGS
VLVPGTALMELVLRAADEVDCAAVDELTLAAPLVLPASGAAIQVQVWVGEPDEAGRRPVS
VHAREGEGPWTLHADGALAPAAETVPFDTAIWPPQGAEHLDAAGCYERFADAGFAYGPVF
QGLRAAWKLGEDIYAEVALPEGTDGNAYGLHPALFDAALHAALLGGEGTDEAAVPFSWNG
VTLHATGASRVRVRIRPTEGGTSIALVDTAGAPVASVRSLTARPITAGQLQTGDRDSLFQ
VDWTTLHLTDERANSLALLGKDTEGILDTLSLQPHADLDDLAATGVHDTVLAPLPTRTAG
TVESVHAATTGALALIRSWLADDRFAASRLVFVTRGAVSGTDLAGASVWGLVRSALLEHP
GRFGLVDVDVDQDAEVPLVPRALASDEPQVLVRGGEVLAARLVRAQSSDTVTWDPSGTVL
ITGGTGGLGRSVARHLVSEHGVRSLLLVSRRGPAAEGVDALVAELAECGAQVTVEACDVT
DAVAVADLVARHRISAVVHTAGVLDDGVVESLTPERLSAVLRPKVDAAWNLHEATRGLDL
DAFVVFSSVAGTFGSAGQANYAAGNAFLDALAYHRRAVGLPAVSLAWGPWSQDGGMTGTL
SDADVQRIARQGMPPLTVEEGLALFDAALGSAEPMALPVRLDLAALRAQGEPQPLLRGLI
RTPGRRTAAAATEGDTAAAFAGRLTGLSAAEGREVVLGAVRSQIAGVLGHAEATEIDQDR
AFLDLGFDSLTAVELRNRLGAVTGIRLPATLLFDYPTPAELVAHLHARIAPEPTVGPEAL
LGELERMEKSFGGLDLTEEMHEQIAGRLEVLRAKWDALRDTAAAAGHDGSPSDEDFDFES
ASDDEVFDLLDNELGLS
selected fasta
>polyketide synthase [PimS1 protein]
ATGTCGAACGAGGAGAAGCTGCGCGAGTACCTCAAGCGCGCGATCGCGGACCTTCACGAG
ACTCGTCAGCAATTGGACGAGACCGAGGCGAAGCAGCGAGAGCCCCTCGCGATCGTGTCG
ATGGCCTGCCGCTTCCCCGGCGGCGTCCGTTCGCCCGAGGAGCTGTGGGAGCTGCTGCGC
GACGGCGTCGACGCGGTTTCCTCCTTCCCCCGTAACCGCGGCTGGGACCTGGACGCGCTC
TACCACTCCGACCCGGCCCACCAGGGCACCAGCTATGCGCGCGAGGGCGGATTCCTGCAT
GACGCGGGCGAGTTCGACCCCGGCTTCTTCGGGATCTCCCCGCGCGAGGCGCTCGCCATG
GACCCCCAGCAGCGGCTGCTGCTGGAGACCGCATGGGAAGCCGTCGAGCGGGCCGGTATC
GACCCGGAGTCCCTCGCGGGCAGCCGAACGGGTGTCTTTGTCGGCACCGGGCACGGAGGG
TACGACGCCGAGGGCCGACGGCGTGCCGACGAGGTCGGCGGGCACTTGCTGACGGGCAAT
CACATCAGCATCGCCTCCGGCCGGATTTCGTATGTCCTGGGGCTGGAAGGCCCTGCCCTG
ACCGTGGACACGGCCTGCTCCTCGTCGCTGGTCGCCCTGCATCTGGCGATGCACGCGCTG
CGGCGCGACGAATGCGCCATGGCCCTGGTGGGCGGCGCGACCGTGATGTCCACGCCGCAG
ATGTTCGTGGAGTTCTCCCGCCAGCGCGGCCTTGCCCCCGACGGCCGCTGCAAGCCGTTC
GCGGCCGCCGCCGACGGCACCGGCTGGAGCGAGGGTGTCGGACTGCTGCTCGTCGAGCGG
CTCAGTGACGCCGTACGCAACGGCTATCCCGTCCTCGCCGTGCTGAAGGGCTCGGCCGTC
AACCAGGACGGCGCGTCCAACGGCCTGACCGCCCCCAACGGCCCCTCGCAGCAACGCGTC
ATCCGCCAGGCGCTGACCGGCGCGGGCCTCGCCGCCTCGGACATCGACGCCGTGGAGGCG
CACGGCACCGGCACCACCCTCGGCGACCCCGTCGAGGCGCACGCCCTGCTGGCCACCTAC
GGGCAGCAGCGCGCCGCCGACCGGCCGTGCGGACTCGGCTCCATGAAGTCCAACATCGGG
CACACCCAGGCCGCCGCCGGTATCGCCGGCGTGATGAAAATGGTCCTGGCGATGCGGCAC
GGGCACCTGCCCAGGACCCTGCACCTGGACGAGCCCACCGGGCACGTCGACTGGAGCGAG
GGCAACGCCAGGCTCCTCGCGGAGCCCGAGCCCTGGCCGAGCGCCGGCCGGCCCCGTCGC
GCCGCCGTCTCCTCCTTCGGCATCAGCGGCACCAACGCCCACGTCATCCTGGAGCAGGCG
CCCGCCCACGAGGCCGAACCGGCCCCCGAACCGGCCGCCCGGCCGGGCGCGCTGCCCTGG
ATCCTGTCCGCCCGCACCGAAGCGGGCCTGCGTGCCCAGGCCGACCGCCTCGGCCGCCAC
CTACGGGACCGCGCCGACCTCGAACCGGCCGCCGTCGCCCATGCGCTCGCGGACACCCGG
ACCCTCATGGAACACCGCGCGGTAGTCGTCGCGGGCGACCGGGAGGAGTTCCTGCGCGGC
CTGGACGCCCTCGCCGCGGGCCGCACCGCCAACGGCCTGGTCAGCGGCGTCGCCGTCAAG
GCCGCCAGCGCGTTCCTCTTCGCCGGGCAGGGCTCCCAGCGACCGGGCATGGGGCGCGAA
CTGCACGCCGCGCACCCCGTGTTCGCCACGGCCTTCGACGCGGTGTGCGCCGAACTGGAC
CCACACCTGGACCGGCCGCTGCGCGACATCGTCTTCGCCGAGGAAGGCAGCGCCGAGGCC
GCCCTGCTCGACCAGACCGCCTACACCCAGGCCGCGCTCTTCGCCCTGGAAACCGCCCTG
TTCCGGCTCGTCGAATCCTGGGGCGTGGCACCCCGGTTCGTCGCCGGACACTCCATCGGC
GAGCTGACCGCCGCCCACGTCAGTGGCGTGCTGACCCTCCACGACGCCGCACGGCTGGTC
GCCGCGCGCGGCACCCTCATGCAGGCGCTGCCCGCAGGCGGCGCCATGGTGGCGGTCCAG
GCCACCGAGGACGAGATCCGCGAGCGTCTCGCCGGCCACGAGGACCACGTCGCCCTCGCG
GCCGCCAACGGGCCCGATTCCACCGTCATTTCGGGCGACGAACAGGCCGTCACCGAGATC
GCGGCCCACTGGGAGGCACAGGGCCGCCGCACCAAGCGGCTGCGGGTCAGCCACGCCTTC
CACTCCCCGCACATGGACGACATGCTGGAGGACTTCCGGCGCGTCGCCCGCGGTCTGACC
TTCCACGCCCCCCGCATCCCCGTGGTGTCCACGGTGACCGGCGCGCTCGCCACCGAAGAC
GAACTGCGCTCGCCCGACTACTGGGTGCGGCAGGTCCGCGAAACCGTCCGCTTCTGTGCT
GCCGTGCGCACCCTTGAGGCCGAGGGCGTCACCACCTTCGTGGAGATCGGCACCGGCGGC
GTCCTCACCCCCATGGTCCAGGACTGTCTGACCACCCTCGAAGAGCCCGTTCTCGTCCCT
CTGCTGCGCACCGGCCGCCCCGAAACCGTCGCTCTCACCGAGGGCGTCGCCACCGCCTTC
GTGCACGGTGTCCCCGTCGACCGGTCCGCCTTCCCGGGCGCGCCCGGTACCTCCCGCGCG
GACCTGCCCACCTACGCCTTCCAGCGTCAGTGGTACTGGCTGGACCCGGCCGACCACGAC
GAGGGGGAGGCGGCCGCCGCCGAAGCGGGCGAGGCCGGATTCTGGGCGGCCGTCGAACGC
GAGGACCTCCAGGAGCTGTCGGCCGTCCTGGCCATCGACGGCAGCGAAGCGGACTCCCTC
GGCAGCTTCCTGCCCACCCTCTCCTCCTGGCGCAGGCAGCGCAGGACCCAGGCCGCCGCG
GACCGCTTCAGCTACCGCACCCACTGGGCCCCGCGCACCGCCTCGGGCGGCCCCACCGCC
ACCGGGCACTGGCTCGTCGTCCTGCCCGAAGGCGGCACCGACGACCCGTGGACCGCCCGC
CTCCTGGACGCGCTGAACGACCAGGGCCTGCACACCGACGTACGCGAACTGCCCGCCGAC
CACGAGCCCGACGCCTGGGGCCGACACCCCGTGGACGGCGTGCTCTGTCTGCTGGCACTC
GACGAGCGGCCCACCCGCTCCTGCCCTCCGTACCGGCGCGGGCTGGCCGCCACCACCAAC
GCTGCTGCGCGCCCTGAGGGCGCGGGCATCCAGGCACCGCTGTGGTGCGTGACCCGCGGC
GCCGTCGCCGTCGACCGGCACGAGGCGCTCAAGAGCCCCTTACAGGCACAGACATGGGGC
CTGGGCCGGGTGGCCGCCCTGGAGTCCCCGCAGAGCTGGGGCGGGCTCATCGACCTGCCC
GACAACCTGGACGGACGGGCCGTCTCCGCGCTGCTGAGCACCCTCGCCGGGGAGGAGGAC
CAGGTCGCCGTCCGCCCCGCCGGGGTCTTCGCCCGCCGCCTGGAACGGATCACACCCGGC
GGCGACACCGGCGACCGGTGGAGCACCCACGGCACCGTCCTGGTCACCGGCGGCACCGGT
GCCCTCGGCGCGCACCTCGCCCACTGGCTGGCCGACGCCGGAGCCGAACACCTCGTGCTC
ACCGGCCGCCGCGGCCCGCAGGCCCCCGGCGCACCGGAACTCGCGGCCGCCCTCACCGAC
CGGGGCGTCAAGGTCACCCTCGCCGCCTGCGACGCCGCCGACCGTGATGCGCTGGCGGCC
GTCCTCGCGGACATCCCGCCGCACCTGCCGCTGACCGGCGTCGTCCACGCCGCGGGCGTA
CTGGACGACGGCGTACTGGACGCGCTCACCCCCGAGCGCTTCGAGACCGTACTGCGCCCC
AAGGCGCGGGCCGCACAGAACCTGCACGAACTCACCCAGGACCTCGACCTGGACCACTTC
GTGCTGTTCTCCTCGATCGTCGGCGTCCTGGGCAACGCCGGACAGGCCAACTACGCCGCC
GCCAACGCCTACTTGGACGCCCTCGCCGAACACCGTCTCGCCCAGGGGCTCCCGGCCACC
TCCGTGTCCTGGGGCCCTGGGCAGGCGGCGGCATGGCACGACAGCGACGCCGCCGACCGG
ATGAGCCGCGACGGACTGCTGCCCATGGCCGCGGCCCCGCGTCGCCGCCCTGCGCCAGCC
CTCGCCCAGGGCATGACACAGGTGACCGTGGCCGACATCGACTGGAGCGCATACGCCCCC
GCCCTGACCGCCGTCCGCCCCAGCCCCCTCATCGGCGACCTGCCCGAGGCACGCCGCGCG
CTCGGCCCCGCAGAAGGCCCCCGCCGGGAACGCTCCCCCCTGCGCGACCGGATCGGCGCA
CTGCCGCCCGCCGAACAGGAAAAGGCATTCCTGACCATGGTCAGGGAAGAGGCCGCGAGG
GTACTGGGACACCCCTCGCCGGACACCGTCGATGCCCAACGCGCCTTCCGCGAGCAGGGG
TTCGACTCCCTGATGGCCGTCGACCTGCGCAACCGGCTCTCCGCCGCGACGGGCCTGCGG
CTGCCCGCCACCCTGCTGTTCGACCACCCCACCCCCCTTGCGGCCGCCGCCTGCCTGCGC
TCCGAAGTCCTGGGCGCCGCAGGACCCGCCACGGTCGTTCAGGCATCGACCGCCGCCCTC
GACGAACCGGTGGCGATCATCGGCATGGCCTGCCGCTTCCCCGGCGGCGTGCACTCACCC
GAGGCCCTGTGGCGGCTGCTGGCCGAGGGCGGCGACGCCATCACCCCCATGCCCGCCGAC
CGGGGCTGGGACCTGGACCGGCTCTACCACCCCGACCCCGACCACCAGGGCACCAGCTAC
GCCCGCGGCGGCGGCTTCCTGGACGGCGCGGCCGACTTCGACGCGGACTTCTTCGGCATC
TCGCCGCGCGAGGCCCTCGCCATGGACCCGCAGCAGCGGCTGCTCCTGGAAACATGGGAG
GTGCTCGAACAGGCGGGGATCGACCCGGAATCCCTGCGGGGCAGCAGCACCGGTGTCTTC
GCGGGCACCAACACCCAGGACTACGGCACGGCCCTGGACGCGGCACAGGACGAAGCCGGC
GGACACCGGCTCACCGGCAACGCGATGAGCGTCGTCTCCGGCCGGGTCTCCTACACCTTC
GGCTTCGAGGGACCGGCCCTCACCGTGGACACGGCGTGCTCCTCCTCGCTGGTGGCCCTG
CACATGGCCGCGCAGGCGCTGCGCCAGGGCGAATGCTCCCTGGCGGTCGCGGGCGGTGTG
ACGGTGATGGCCACCCCGTCCTCCTTCGTGGAGTTCGCCCGGCAGCGCGGGCTGGCCCCC
GACGGCCGCTGCAAGCCGTTCGCGGCGGCCGCCGACGGCACCGGCTGGAGCGAGGGCGTC
GGCCTGCTGCTCGTGGAACGGCTCAGCGACGCCCGCCGAAACGGCCACCAGGTGCTCGCC
GTCGTCCGCGGTTCGGCGGTCAACCAGGACGGCGCGTCCAACGGTCTGAGCGCACCCAGC
GGCCCGTCCCAGCAGCGGGTGATCCGGCAGGCCCTGGCGAACGCCCGGGTGGCCGCCTCC
GAGGTCGACGCCGTGGAGGCCCACGGCACGGGCACCACGCTCGGTGACCCGATCGAGGCC
CAGGCGCTGCTGGCCACCTACGGCCAGGAGCGGCCGCTGCTGCTCGGCGCGGTGAAGTCC
AACCTCGGCCACACCCAGGCCGCCGCCGGTGTGGCGGGCGTGATGAAGATGGTGCTGGCG
ATGCGGCACGGCATGCTGCCGCGCACCCTGCACGTCGACGAGCCCACCGGGCATGTCGAC
TGGACCGCGGGCGCGGTCGAGCTGCTCACCGAGCACACGGACTGGCCCGAGACCGGCCAC
CCCCGGCGCGCCGCGGTCTCCGCGTTCGGCATCAGCGGCACCAATGCGCACGTGGTGCTG
GAACTGCCCGCAGCCGAACAGCCCTTGGTCGAACAGCCCTCGGCCGCGGAGCCCGACGCG
CCGGCCACCGCTCCCGACCGGACGCCCACCGCCTCCGACGGGACGGCGCCGCTGCTGCTC
TCCGCCAAGAGCGAGAGCGCCCTGCGCGCCCAGGCGGCCCGGCTGCACTCCCACCTGGAG
CGCGACCCCGCGCTCCGGCTCACGGACGCCGCGTACACGCTGATGACGCACCGCACGGCC
TTCGCCCACCGCGCGGCCGTCCGCGCCGCCGACCACGAAGCCGCGCTGCGCGCCCTGACC
GCCCTGGCTGCGGGCGAGGCCGACCCCGCCGTGGACACCGGCACCGCCCACACCGGCCGG
GACGCCGTCCTCTTCTCCGGCCAGGGATCGCAACGCATCGGAATGGGCCGGGAGTTGTCC
GGCCGCTACCCGGTGTTCGCAGAGGCCTTCGACACCGTGTGCGCGGCCTTGGACGAGCAT
CTGGACCGCCCCCTGCGGGACGTGGTCCGGGGCGAGGACGAGGAGCTGCTGAACCGGACC
GTCTACGCCCAGGCGGGGCTGTTCGCCATCGAGGTGGCCCTCTTCCGGCTCGTGGAGTCC
TGGGGCGTACGGCCGCACTACGTGGCCGGGCATTCCGTCGGCGAGATCGCCGCCGCGCAC
GTCGCCGGGGTGTTCTCGCTGGCCGATGCCTGCGCGCTGGTGGCGGCACGCGGACGGCTG
ATGCAGGCGCTGCCCGCCGGCGGCGCGATGGCGGCGATCCGGGCGACGGAGGACGAAGTC
CTCCCGCACCTGGCGGACAGCGTCTCGATCGCGGCCGTCAACGGCCCGTCGTCGGTCGTC
GTCTCCGGCGCCGAGCACGCCGTGCTCTCCATCGCCGCGCACTTCGAGGGCGCGGGCCGC
AAGACCACCAGGCTGCGGGTCTCGCACGCCTTCCACTCCCCGCTCATGGACCCGATGCTG
GCCGACTTCCGCGCCGTCGCCGAGGGCCTGACCTACGGCGAGCCGGAGCTGGCCGTCGTA
TCGAACGTCACCGGCCAACTCGCCACCCCGGACCAGCTGCGCACCCCCGAGTACTGGGTG
ACCCATGTCCGCGCGGCGGTGCGCTTCGCGGACGGGATACGGGCTCTGGGGGCGGAAGGG
GTGACGCGGTTCCTCGAACTCGGCCCGGACGGCGTCCTGTCGGCCTTGGCCAGGGAGTCG
GCACCGGACGACGCCGTGTGCACTCCCGTGCTGCGCAAGGACCGCTCCGAGGCGGCGACC
CTCCTCGCGGCCCTGACGCACCTGCACGTACACGGAACCGAGATCGACTGGACCGCGTTC
CTCGCCGGCCGCGACGCGCACGCCGTCGACCTGCCCACGTACGCCTTCCAGCACCAGCGG
TTCTGGCCGACCCCCGACCACACCCGCACCGGTGACCTGGGCGCCGTCGGCCTCGAAGCG
ACCGGGCACCCGCTGCTGAGCGCCGCCGTGGAACTGCCGGACGGTGAGGGCCTGTTGTTC
ACCACCCGCCTCTCGCTCCAGACCCACCCCTGGCTGGCCGGGCACGTCGTCATGGGCTCG
GTCCTGCTGCCGGGGACGGCCTTCGCCGAACTCGCCCTCCGCGCCGCCGACGAGGTGGGC
TGCGACCGCGTCGACGAACTGACCCTGGCCGCCCCGCTCGTCCTGCCCGAGCACGGCGGC
GTACAGCTCCAGCTGCGGGTGGGCCCCGCCGACGCGTCCGGCCGCCGCACCCTGACCGCC
CGCTCCAGGGCGGAGGGCGACGGCGACCGCCCGTGGGTCCAGCACGCCACCGGCGTCCTC
GCGGAAGGGGAGTCGACGCCCGAACCCGGCTACGACTTCCACACCGAGTCCTGGCCGCCC
GCCGACGCCGCGCCCGTCGAACTGTCCGGCCTCTACCCGGACTTCGCCGCACACGGTTTC
GACTACGGTCCCCACTTCCAGGGGCTGCGGACCGCCTGGCGCCGAGGCGACGAGGTGTTC
GCCGAGGTCGCCCTGCCCGCCGAGGCCGAAGGCGAGGCATCCGCGTACGGACTCCATCCG
GCGCTGCTCGACGCCGCCCTGCACGTCGTCGCGTTCAACGGAGTGGACCGCGGCGTCGTG
CCGTTCTCCTGGGAGAGCGTCGCGCTGCACGCCACCGGCGCCTCGGCCGTACGGATCCGG
GTCGTCCGGCACAGCGGCGACACGGTCTCCGTGGATGTCGCCGACACCACCGGCGAGCCC
GTCGCCTCCATCGGCACGCTCGTCCTGCGGGCGGTCTCCGCCGACCAGTTGGCGGGCGGC
GCGGACCCGGCCGTCCGCGATGCGCTGTTCCGCGTGCAGTGGAACCCCGTACGCCTGCCC
CCGGCCGGGGCCGCGGTGACCGTGGCGACGCTCGGCTCCCTTGCCGGCGCACCGTTCGAC
GGCTACCCGGACCTGGCGTCCCTGGCCCGGTCCGGTCGTGTGGCGGGTGCGGTGCTGGTA
CCGGTGGAAGCCGGTGCCGGCGAGGTGGTGGCGGACGATGTCGTGGGGGCGACGCACGCA
ACGGCCGCCCGGGCGCTGGACCTGGCCCGGTCGTGGCTGGCCGATGACCGGTTCGCGGCC
TCGCGCCTGGTGTTCGTGACGCGTGGCGCGGTGTCCGGTGCGGATCTCGCGGGTGCGGCG
GTGTGGGGTCTGGTGCGGTCGGCGCTGTCGGAGCACCCGGGCCGCTTCGGTCTGGTGGAT
CTGGATGACGATGCCGAACTGGCGCTGGTGCCACGGGTGTTGGCGTCGGATGAGCCGCAG
CTGCTGGTGCGCGGTGGTGAGGTGCTGGCGGCGCGGCTGGCCCGGGCGCAGTCCTCGCAC
GCGGTGACCTGGGATCCGTCCGGCACGGTGCTCGTCACCGGTGGCACGGGTGGTCTGGGC
CGTGTGATGGCACGTCACTTGGTGGTGGAACACGGGGTACGGAACCTGCTGCTGGTCAGC
CGCCGTGGGCCCGCCGCCGAAGGTGCCGAAGAGCTGGTGACGGAGCTCCGGCACAGCGGT
GCCGAAGTGGCCGTCGAAGCCTGTGATGTCACCGACGCGGCCGCCGTGGCCGACCTGGTG
GCCCGGCACCGGATCAGCGCTGTGGTGCATACGGCCGGTGTCCTGGATGACGGTGTGGTG
GAGTCGCTGACACCGGAGCGGCTGTCGGCGGTGTTGCGTCCGAAGGTGGATGCGGCCTGG
AACCTGCACGAGGCGACCAGGGATCTGGACCTGGACGCGTTCGTGGTCTTCTCCTCAGTG
GCAGGCACGATCGGGAGCCCCGGTCAGGCCAACTACGCGGCGGGCAACGCCTTCCTGGAT
GCCCTGGCCCACCACCGTCGGGCGGCGGGTCTTCCGGCGGCGTCGCTGGCATGGGGCCCC
TGGTCCCGGGACGGCGGCATGACCGGCACCCTGACCGACGTCGACTCCAGCGCATCGCCC
GGCAGGCATGCCCGAACTCACCCCCGCACAGGGCGTGGCCTCTTCGACGCCGCGCTGGCG
GCCGGTGACGCCCACCTGCTCCCCGTACGCTTCGACTGGGCGTCCCTGCGCGCCCAGGGC
GAGGTGCCACCGCTGTTGCGCGGCCTGATCAGGACCCGTGCCCGGCGCTCGGCGGTCGGC
GGCTCGGCCGCGGCAGCCGGCCTGGTGGGACGCCTGAGCGGACGGGGAACGGTGGAGCGG
CGCGAGGTGCTCCTGGACCTGGTACGGGCCCAGATCGCGGTCGTCCTGGGCCACGCGAAC
CCGGAGACGATCGAGTCCACCCGTGTCTTCCAGGACCTCGGCTTCGACTCCCTGACCGCG
GTCGAACTCCGCAACCGCCTCAACAACGCGACCGGCCTGCGCCTTTCGGCCACCGCCGTC
TTCGACTACCCCACGGCGGACGCGCTCGTCGACTTCCTGCTGGACGAGCTGTTCGGCGCG
CAGGAGGAGGCCGAGCTGCCGGCGCCGGTGCCGTCACCGGCGGGGGCCGCCGACGACCCG
GTCGTGATCGTCGGCATGAGCTGCCGCTACCCGGGCGGCGTCGGCTCGCCCGAGGACCTG
TGGCGCCTGGTGTCGGAGGGCGTGGACGCGGTGTCCGACTTCCCCACCGACCGTGGATGG
GACGTGGAGAGCCTCTACAGCCCCGACCCCGAGGCGCTCGGCACCTCGTACACCCGCTCC
GGTGGATTCCTCCACGAGGCGGCGGAGTTCGACCCCGATTTCTTCGGGATGAGCCCGCGC
GAGGCGCTGGCGACCGACGCCCAGCAGCGGCTGCTGCTGGAGACGACCTGGGAGGCCATC
GAGCGCACGGGCATCGACCCGGCGTCGCTGCGGGGCAGCCGTACGGGCGTCTTCGCGGGC
GTGATGTACACCGACTACGGCGACCTCCTCGTCGGCGACCAGTTCGAGGGCTACCGCAGC
AACGGCAGCGCGGCCAGCATCGCCTCCGGCCGGGTCTCGTACACCTTCGGTTTCGAGGGT
CCGGCGGTCACGGTGGACACGGCATGCTCGTCGTCCCTGGTCGCCCTGCACTGGGCGGCG
CAGTCGCTGCGCTCGGGCGAGTGCTCGCTCGCGGTCGCGGGCGGTGTGACGGTGATGTCC
ACACCGACGACGTTCGTCGAGTTCTCGCGGCAACGCGGACTGTCGGCGGACGGCCGCTGC
AAGGCGTTCGCCGATGCGGCCGACGGCGTCGGCTGGGGCGAGGGCGTCGGCATGCTCGTA
CTGGAGCGTCTGTCGGACGCGCGCCGCAACGGGCACCGGGTGCTCGCGGTGGTGCGCGGC
AGTGCGGTGAACCAGGACGGTGCGTCCAATGGTCTGACGGCGCCGAACGGCCCCGCCCAG
CAGCGGGTGATCCGGCAGGCGCTGGCGAGTGCGGGGCTGTCGGCGGCGGATGTGGACGCG
GTGGAGGCGCACGGTACGGGTACGACGCTGGGCGATCCGATCGAGGCCCAGGCGCTGCTC
GCCACGTATGGCCAGGAGCGACCTGAGGACCGGCCGTTGCTGCTGGGGTCGGTCAAATCC
AACATCGGTCATGCGCAGGCGGCTTCGGGTGTGGCGGGTGTCATCAAGATGGTGCTGGCG
ATGCGGCACGGTGTGCTGCCTCGGACGCTGCATGTGGATGAACCGTCGTCGCATGTCGAC
TGGAGTGCCGGTGCCGTCGAGCTGCTGACCTCCGAGGCCGAGTGGCCGCAGGGCGAGGGG
CCGCGCCGCGCGGGCGTCTCCTCCTTCGGCGTCAGCGGGACGAACGCGCATGTGATCCTG
GAGCAGCCCGGACCGGACGCGGCCGACGCCGCACCGGACGCCACGGTGACCGATCCCGGC
GCGCTGGCATGGGTGCTCTCCGCACGGAACGAAGCGGCCCTGCGCTGCCAGGCGGCGCGC
CTGCTGTCCCTGGTCGCCGGCAGTGACGCGCTGTGCGCGCGGGACATCGGCCACTCGCTG
GTGACCGGGCGGTCGAGCTTCGCCCACCGTGCGGTGGTGTGGGGCCAGGACCGCGACGCA
CTGGTGCGTGCCCTGTCCGCACTCGCGGTGGGCGAGGCCGACGCCGGTCTGGCGGAGGGC
GCGTCCGGCGCGGGGAGGACGGCCTTCCTGTTCTCGGGCCAGGGATCACAACGGCTGGGA
ATGGGATGGGAGTTGTACGCTCGCTACCCGGTGTTCGCGGACGCATTCGACGCCGTGTGC
GCGGCCTTGGACGAGCACCTGGAGCGCCCCCTGCGGGACGTGGTCTGGGGCGAGGACGCG
GAGCTGCTGAACCAGACCGCGTACGCCCAGGCCGGGCTGTTCGCGATCGAGGTGGCGCTG
TACCGGCTGGCGGAATCGTGGGGCATGCGCCCGGACTTCGTGGCGGGGCATTCGATCGGT
GAGGTCGCCGCGGCCCATGTGTCGGGTGTCTTCTCGCTCCCGGATGCCTGTGCGCTGGTG
GCGGCCCGAGGCCGACTGATGCAGCAACTGCCCTCCGGCGGCGCGATGATGGCGATCCGG
GCGACCGAGGACGAGGTCCTTCCGCATCTGGCGGAAGGCGTCTCGCTCGCGGCGGTCAAT
GGCCCGTCGTCGGTCGTGATCTCGGGCGCCGAGGACGCGGTGCTGGCCATCGCGGCGCAC
TTCGCGGGGGAGGGGCGCAAAACCACCCGACTGCGGGTCTCGCATGCCTTCCACTCGCCG
CTCATGGAACCGATGCTGGAGGAATTCCGCGCGGTGGTGACACGGCTGTCCTTCGGCACG
CCGACGATCCCCGTCGTCTCCAACCTGACGGGCCGCCTCGCCGAACCCGAACAGCTCGCG
CACGCCGACTACTGGGTCCGGCACGTCCGCGAGGCAGTGCGCTTCGCGGACGGGATACAG
GCGCTGCGGGCGGAAGGGGTGACGCGGTTCCTGGAGCTCGGCCCGGACGGTGTGCTGTCG
GCGATGGCCCGCGAGTCGGCATCGGACGACGCCGTGCTCGCGCCCGTACTGCGCAGGGAC
CGGCCCGAGGAGACGGCGCTGCTGGGCGCCCTGGCGCAGCTGTACGTCCGGGGTGCGCAC
GTGGACTGGACGGTGCCGTTCGCCGGTTCGGGTGCGCGCTGGGCGGATCTGCCGACGTAC
GCGTTCCAGCACGAGCGGTTCTGGCCGTCGGGCGGTGTGGCACGTCCGGGCGATGTGCGG
TCCGCGGGCCTGGGCTCGGCCGGGCATCCGCTGCTGGGCGCGGCGGTGGAACTGGCGGGC
TCGGGCGGCCTGTTGTTCACGGGCCGGCTGTCGGTGTCCTCGCACCCGTGGCTGGCGGAC
CATGTGGTGCTGGGCTCCGTCCTCGTGCCCGGCACCGCGCTGGTGGAACTGGTGCTGCGG
GCGGCCGACGAGGCCGGCTGTGACCTCCTGGAGGAGCTGACGCTCGCCGCACCGCTGGTG
CTGCCCGCCTCGGGCGCCGCGGTCCAGGTTCAGGTAGCGGTGGGCGAGCCCGATGAGGCG
GGCCGCCGGCCGGTCTCGGTCCATGCACGTGAGGGCGAGGGCCCATGGACGCTGCACGCC
AGTGGTGCGGTGACCTCGGGCGCCGAAGTGCCCCCCTTCGACGCCACCGTATGGCCGCCC
AAGGGCGCGGAGCCCGTGGACGTGGCGGACTGCTACGACGTACTCGCCGATGCCGGGCTC
ACCTACGGCCCGGCCTTCCACGGCCTGCAAGCGGCCTGGAAGCTCGGTGGGGACGTCTAC
GCCGAGGCGAAGCTCCCCGAGAGCACCGACGGCGACGCATACGGTCTGCACCCCGCGCTC
TTCGACGCCGCGCTGCACGCGTCGGCGCTGGGCGGCGCGGAAGCGGGCGGAGTCCCGTTC
TCCTGGGCCGGAGTGTCGCTGCACGCGACCGGCGCCTCGCACCTCCGCGTCCGCATCCGC
GAAGCGGGCGGCGCGCTGTCGGTCGCGATCGCGGACACGTCCGGCGCGCCGGTCGCCTCG
GTGGAGTCGCTGGTGATACGTCCGCTCTCGGCCGGGCAGGTGCAGGCCGCCGACCGTGAC
GCCCTCTTCAAGGCCGACTGGGTCCCCGTACCGCTCACGGACGAACGCGTCGAGCCGGGC
ACCGGCCCGGAGGGCGAGCCGCTGCGGACGTACGCGGATCTGGATTCCCTGGAGGGCGCG
GCCGTGCCCGGGACGGTCCTGGTCGCGCCGCCTTCCGGCGCTGCCGGGACGGTGGAGTCC
GTACACGCCGCGACCGTCTGGGCGCTGGAGATGGTGCAGGCGTGGCTGGCCGACGACCGG
TTCGCCACCTCGCGACTGGTGTTCGTCACCCGCGGCGCGGCCTTCGGCGCGGATCTTGCG
GCGGCCGCCGTCCGGGGCCTGGTGCGCTCGGCACAGTCGGAGAACCCGGGCCGCTTCGGC
CTGGTGGACATGGACGGCGACGCCGATACGACCGTACCGGCGCAAGCGCTCGCGACCGAC
GAGCCCGAACTGCTGGTGCGTGGTGGTGAGGTGCTGGCGGCCCGGCTGGTCCGGGCGCAG
TCCTCGCACACGGTGACGTGGGATCCGTCCGGTACGGTCCTGATCACCGGCGGGACCGGT
GGGCTGGGCCGTAGTGTCGCCCGGCACTTGGTGAGCGAGCACGGGGTGCGCAGTCTGCTG
CTGGTCAGCCGCCGTGGTCCCGCGGCCGAGGGTGCCGGGGAGTTGGTGGCCGAACTCAGG
GGCAGTGGCGCCGAGGTGGTCATCGAGGCTTGTGATGTGACCGATGCGGTGGCGGTGGCC
GATCTGGTGGCTCGGCATCGGATCAGTGCTGTGGTGCATACGGCCGGTGTTCTGGATGAC
GGTGTGGTGGAGTCGCTGACGCCGGAGCGGCTTGCGGTGGTGTTGCGTCCGAAGGTGGAT
GCGGCCTGGAACCTGCACGAGGCGACCAGGGGTCTGGATCTGGATGCGTTTGTGGTGTTC
TCGTCCGTGGCAGGCACTTTCGGCAGTGCGGGTCAGGCCAATTACGCGGCGGGTAATGCT
TTCCTGGACGCGCTGGCGTATCACCGTCGGGCGGTGGGTCTGCCGGCGGTGTCGCTGGCG
TGGGGCCCTTGGTCGCAGGACGGTGGTATGACCGGCACCTTGAGCGACGCCGATGTCCAG
CGCATCGCCCGGCAGGGCATGCCGCCGCTGACCGTCGAGGAGGGTCTGGCCCTCTTCGAC
GCCGCGCTCGGCAGCGCCGAACCCATGGCACTCCCGGTCCGCCTGGACCTGGCGGCGCTG
CGGGCACAAGGCGAGCCCCAGCCACTGCTGCGCGGCCTCATCCGGACGAGGACCCGCCGG
TCCGGCGCCGCCGCGGCATCCGGCATCGCGCAGCGCCTTGCCGGGCTGTCCACGGCGGAG
CGGCGCGAGGCGCTGCTCGATGTCGTACGGGCCCAGATCGCGACGGTCCTGGGCCACGCC
GGCCCGGAAACGATCGCCCCTGACCGGGCCTTCCAGGACCTCGGCCTCGACTCCCTGACG
GCGATCGAACTCCGTAACCTGCTCGGCAAGGCCACCGGGCTGCGGCTCCCGGCAACGACC
GTGTTCGACTACCCGACGGTGGATGCCCTGGCCGCCCACCTCTTGGACGAACTGTTCGGC
GCGGAGACGGGGACCGCGACGGAGACGCCCCTCCCGGTGCCCGGCCTGCCGTCCCTGGCG
GACGATCCGGTCGTGATCGTCGGCATGAGCTGCCGCTTCCCCGGCGGCGTCGCCTCGCCG
GAGGACCTGTGGCGCCTGGTGGCGGACGGCGTGGACGCCGTCTCCGCCTTCCCGACCGAC
CGGGGCTGGGAGATCGACGACACCTACGACCCCGAGCGGGAGGGCGCCATCGCCACCCGT
TCCGGTGGATTCCTCCACGACGCGGCGGAGTTCGACCCCGAGTTCTTCGGGATGAGCCCG
CGCGAGGCCCTGACCACCGACGCCCAGCAGCGGCTGTTGCTGGAGACGACCTGGGAGGCG
CTGGAGCGCGCCGGTATGGACCCGGCCACGCTCCGCGGCAGCCGCACGGGTGTCTTCGCC
GGCGTGATGTACCACGACTACTCGACGCTGCTCTCCGGGCGCGAGTTCGAGGGCTACCAG
GGCAGCGGCAGCGCAGGCAGTGTGGCCTCGGGCCGGGTCTCGTACACCTTCGGTTTCGAG
GGTCCGGCGGTCACGGTGGACACGGCGTGCTCGTCGTCCCTGGTCGCCCTGCACCTGGCA
GCACAGTCGCTGCGCTCGGGCGAGTGCTCGCTGGCGCTCGCGGGCGGTGTGACGGTGATG
TCCACACCGCTGACCTTCGTGGAGTTCTCCCGCCAGGGCGGACTGTCGGCGGACGGCCGC
TGCAAGGCGTTCGCCGATGCGGCCGACGGCGTCGGCTGGGCCGAAGGCGCCGGAATCCTG
GTGCTGGAGCGTCTGTCGGACGCCCGCCGCAACGGGCACCGCATCCTCGCGACGGTGCGC
GGCAGTGCGGTGAACCAGGACGGTGCGTCCAATGGTCTGACGGCGCCGAACGGTCCCGCC
CAGCAGCGGGTGATCCGGCAGGCGCTGGCGAGTGCGGGGCTGTCGGCGGCGGATGTGGAC
GCGGTGGAGGCGCACGGTACGGGTACGACGCTGGGCGATCCGATCGAGGCCCAGGCGCTG
CTCGCGACGTATGGCCAGGAGCGGCCGGAGGACCGGCCGTTGCTGCTCGGCTCCGTGAAG
TCCAACATCGGTCACGCGCAAGCGGCTTCGGGTGTTGCCGGTGTCATCAAGATGGTGCTG
GCGATGCGGCACGGTGTGCTGCCTCGGACGCTGCATGTCGACGAGCCGTCGTCGCATGTC
GACTGGAGCGCCGGTGCCGTCGAGCTGCTGACCTCCGAGGCCGAGTGGCCGCAGGGCGAG
GGGCCGCGCCGCGCGGGCGTCTCCTCCTTCGGCATCAGTGGGACGAACGCGCATGTGATC
CTGGAGCAGCCCGAACCGGTCGCGGCGGAAACGGAATCGATCACGCCCGACACCGCACCG
GACGCCGCCGAGGACGAGGCGGCCGATTCCGGGACGCCGGTGCCGGCACTGCTGTCCGGC
AGGAGCGCATCGGCGCTGCGGGCCCAGGCAGCACGACTGCTGTCCCGACTCGACGGCGAT
CCGGGGCCGCGGATCACTGACGTCGCCTACTCCCTCGCGACCGGCCGTTCGGCCTTCCCG
CACCGCGCGGTGATCCTCGCCGCGAACCGAGCGGACCTGCTGCACTCGCTGTCCGCCCTG
GCCGAGGGCCACACCGAGGCGCCGGCCGTAGTCGCACAGGACCGAGCCCGCTCGGGCAAG
CTGGCCTTCCTGTTCTCGGGGCAGGGATCGCAACGCCTGGGCATGGGACGGGAGTTGTAC
GGTCGCTACCCGGCGTTCGCCGAGGCCCTCGACGCGGTGTGCGCCGCCCTGGACGCCCAC
CTGGACCGTCCCCTGCGGGACGTCATCTGGGGCGAGGACGCGGAACTGCTGAACCGGACC
GGGTACGCCCAGACAGGGCTGTTCGCCATCGAGGTGGCCCTGTTCCGCCTGCTGGAGTCG
TGGGGCGTACGCCCGGACCACCTGCTGGGGCACTCCATCGGAGAAATCGCCGCGGCCCAT
GTGGCGGGCGTCCTCTCCCTCCCGGACGCCTGTGCGCTGGTGGCGGCCCGAGGTCGGCTG
ATGCAGCAACTGCCGTCCGGCGGCGCGATGATGGCGATCCGGGCGACCGAGGACGAGGTC
CTTCCGCATCTGGCGGAAGGCGTCTCGCTCGCGGCGGTCAACGGGCCGTCGTCGGTCGTG
GTCTCCGGCGCCGAGGACGAGGTACTCGCCCTCGCGGCGCACTTCGAGGAAGAGGGACGC
AAGACCACCCGACTGCGGGTCTCGCACGCCTTCCACTCCCCGCTCATGGAACCGATGCTG
GCCGACTTCCGGGCCGTCGCCGACGGCATGACCTACGCCGCGCCGCGCATCCCCGTGGTC
TCCAACGTCACCGGCCGGCCCGCCACCGCGGAAGAGCTGTGCTGCGCCGAGTACTGGGTC
GGCCACGTACGCGAGGCCGTACGGTTCGCCGACGGGGTCGGCGCGCTCCGCGAGCAGGGT
GTGACGACGTTCCTGGAACTCGGCCCCGACGGCTCTCTCTCCGCGCTCGCCGCCGAATCC
GCCGCCGACGACTCCGTACTGGCCCCCGTACTGCGCAAGAACCGCCCCGAGGCACCGGCA
CTGCTCACGGCCCTGGCACGACTGCACGCCCAGGGCACGCCGGTCGACTGGTCCGCCGCC
TTCGCCGGTACGGGTGCGCGGTGGGTGGACCTGCCGACGTACGCATTCCAGCACGAGCGG
TTCTGGCCGTCGGGCGGGGCGGCGCGCGCAGGCGATGTGCGGTCCGCGGGCCTGGGCTCG
GCCGGGCACCCGCTGCTGGGTGCTGCGGTGGAACTGGCGGGCTCCGGCGGGCGGTTGCTC
ACCGGGCGGCTGTCCCTGTCCTCGCACCCGTGGCTGGCGGATCACGTGGTGCTGGGCTCC
GTACTGGTGCCCGGCACGGCGCTCATGGAACTGGTGCTGCGGGCGGCCGACGAGGTGGAC
TGCGCCGCGGTGGACGAACTCACGCTCGCCGCGCCACTGGTCCTGCCCGCCTCGGGCGCC
GCGATCCAGGTACAGGTATGGGTGGGCGAGCCCGATGAGGCGGGCCGCCGGCCGGTCTCG
GTCCATGCACGCGAGGGCGAGGGCCCATGGACGCTGCACGCCGACGGCGCCCTGGCCCCG
GCGGCCGAGACGGTGCCGTTCGATACCGCGATATGGCCCCCGCAGGGTGCCGAGCACCTG
GACGCGGCGGGCTGTTACGAGCGGTTCGCGGACGCCGGATTCGCGTACGGCCCGGTGTTC
CAGGGCCTGCGGGCGGCCTGGAAGCTCGGCGAGGACATCTACGCCGAGGTCGCACTCCCC
GAAGGCACGGACGGCAACGCCTACGGCCTGCACCCCGCACTCTTCGACGCCGCGCTGCAC
GCAGCGCTCCTGGGCGGCGAGGGAACGGACGAAGCCGCGGTCCCCTTCTCCTGGAACGGG
GTGACGCTCCACGCCACCGGCGCTTCCCGGGTGAGGGTACGCATCCGTCCCACCGAAGGC
GGTACGTCGATAGCCCTCGTGGACACCGCCGGTGCGCCGGTCGCCTCGGTGCGATCCCTG
ACCGCACGTCCGATCACCGCCGGGCAGTTGCAGACCGGTGACCGCGATTCCCTTTTCCAG
GTCGACTGGACCACCCTCCACCTCACGGACGAGCGCGCGAACTCCCTCGCCCTGCTCGGC
AAGGACACCGAGGGCATCCTCGACACACTCTCCCTCCAGCCCCACGCGGACCTCGACGAC
CTCGCGGCGACGGGCGTCCACGACACCGTGCTCGCCCCGCTGCCCACCCGGACCGCCGGA
ACGGTGGAATCCGTCCATGCCGCCACGACAGGGGCACTGGCCCTGATCCGGTCCTGGCTG
GCCGACGACCGGTTCGCCGCCTCGCGCCTGGTGTTCGTGACGCGTGGCGCGGTGTCCGGC
ACGGATCTCGCGGGTGCGTCGGTGTGGGGCCTGGTGCGGTCGGCGTTGTTGGAGCACCCG
GGCCGCTTCGGTCTGGTGGACGTGGACGTGGACCAAGACGCTGAAGTGCCGCTTGTGCCA
AGGGCGTTGGCGTCGGATGAACCGCAGGTGTTGGTGCGTGGTGGTGAGGTGCTGGCGGCC
CGGCTGGTCCGGGCGCAGTCCTCGGACACGGTGACGTGGGATCCGTCCGGTACGGTCCTG
ATCACCGGCGGGACCGGTGGGCTGGGTCGTAGTGTCGCCCGGCACTTGGTGAGCGAGCAC
GGGGTGCGCAGTCTGCTGCTGGTCAGCCGCCGTGGTCCCGCCGCCGAGGGTGTCGATGCA
CTCGTTGCCGAACTTGCCGAGTGCGGCGCGCAGGTCACCGTCGAGGCTTGTGATGTGACT
GACGCGGTGGCGGTGGCCGATCTGGTGGCTCGGCATCGGATCAGTGCTGTGGTGCATACG
GCCGGTGTTCTGGATGACGGTGTGGTGGAGTCGCTGACGCCGGAGCGGCTGTCGGCGGTG
CTGCGTCCGAAGGTGGATGCGGCCTGGAACCTGCACGAGGCGACCAGGGGTCTGGATCTG
GATGCGTTTGTGGTGTTCTCGTCCGTGGCAGGCACCTTCGGCAGTGCGGGTCAGGCCAAT
TACGCGGCGGGTAATGCTTTCCTGGACGCGCTGGCGTATCACCGTCGGGCGGTGGGTTTG
CCGGCGGTGTCGCTGGCGTGGGGCCCTTGGTCGCAGGACGGTGGTATGACCGGCACCTTG
AGCGACGCCGATGTCCAGCGCATCGCCCGGCAGGGCATGCCGCCGCTGACCGTCGAGGAG
GGTCTGGCCCTCTTCGACGCCGCGCTCGGCAGCGCCGAACCCATGGCACTCCCCGTCCGC
CTGGACCTCGCGGCCCTACGGGCACAAGGCGAGCCCCAGCCACTGCTGCGCGGCCTCATC
CGGACCCCGGGTCGACGCACGGCGGCGGCCGCGACGGAGGGCGACACCGCTGCCGCCTTC
GCCGGGCGCCTGACCGGGCTGTCGGCGGCAGAAGGACGCGAGGTCGTACTGGGCGCCGTA
CGCAGCCAGATCGCGGGGGTCCTCGGACACGCCGAAGCCACGGAAATCGACCAGGACCGC
GCCTTCCTGGACCTCGGATTCGACTCCCTCACCGCGGTCGAACTCCGCAACCGCCTGGGC
GCCGTCACCGGAATCCGCCTGCCGGCGACCCTGCTCTTCGACTACCCGACGCCGGCAGAA
CTCGTCGCCCACCTCCATGCCCGGATCGCACCGGAGCCGACCGTCGGCCCGGAGGCGCTC
CTGGGCGAACTCGAAAGGATGGAGAAGTCCTTCGGCGGACTCGACCTCACGGAGGAGATG
CACGAACAGATAGCCGGCCGTCTGGAAGTCCTCCGGGCCAAGTGGGACGCCCTGCGGGAC
ACGGCAGCGGCAGCCGGGCACGACGGTTCCCCGTCCGACGAGGACTTCGACTTCGAGTCC
GCCTCCGACGACGAGGTCTTCGACCTCCTCGACAACGAACTCGGCCTGTCCTGA
[1] KS34..410
[1] AT565..885
[1] malonyl-CoA758..762
[1] KR1191..1375
[1] ACP1470..1540
[2] KS1561..1933
[2] AT2104..2414
[2] malonyl-CoA2289..2293
[2] DH2463..2629
[2] KR2928..3104
[2] ACP3204..3274
[3] KS3299..3673
[3] AT3829..4140
[3] malonyl-CoA4015..4019
[3] DH4189..4349
[3] KR4631..4807
[3] ACP4905..4975
[4] KS5002..5374
[4] AT5543..5854
[4] malonyl-CoA5729..5733
[4] DH5903..6063
[4] KR6357..6533
[4] ACP6636..6706
[1] KS100..1230
[1] AT1693..2655
[1] malonyl-CoA2272..2286
[1] KR3571..4125
[1] ACP4408..4620
[2] KS4681..5799
[2] AT6310..7242
[2] malonyl-CoA6865..6879
[2] DH7387..7887
[2] KR8782..9312
[2] ACP9610..9822
[3] KS9895..11019
[3] AT11485..12420
[3] malonyl-CoA12043..12057
[3] DH12565..13047
[3] KR13891..14421
[3] ACP14713..14925
[4] KS15004..16122
[4] AT16627..17562
[4] malonyl-CoA17185..17199
[4] DH17707..18189
[4] KR19069..19599
[4] ACP19906..20118

close this sectionFeature

BLASTP
Database:UniProtKB:2011_09
show BLAST table
InterPro
Database:interpro:38.0
IPR001227 Acyl transferase domain (Domain)
 [564-687]  G3DSA:3.40.366.10 [757-866]  G3DSA:3.40.366.10 [2100-2224]  G3DSA:3.40.366.10 [2289-2404]  G3DSA:3.40.366.10 [3822-3950]  G3DSA:3.40.366.10 [4015-4129]  G3DSA:3.40.366.10 [5532-5664]  G3DSA:3.40.366.10 [5729-5844]  G3DSA:3.40.366.10
G3DSA:3.40.366.10   Ac_transferase_reg
IPR002198 Short-chain dehydrogenase/reductase SDR (Family)
 [1191-1357]  5.00000000000001e-63 PF00106 [4631-4793]  1.80000000000002e-59 PF00106
PF00106   adh_short
IPR006162 Phosphopantetheine attachment site (PTM)
 [3232-3247]  PS00012 [4933-4948]  PS00012 [6664-6679]  PS00012
PS00012   PHOSPHOPANTETHEINE
IPR009081 Acyl carrier protein-like (Domain)
 [1470-1543]  4.89999999999996e-93 G3DSA:1.10.1200.10 [3201-3277]  4.89999999999996e-93 G3DSA:1.10.1200.10 [4902-4978]  4.89999999999996e-93 G3DSA:1.10.1200.10 [6635-6710]  4.89999999999996e-93 G3DSA:1.10.1200.10
G3DSA:1.10.1200.10   ACP_like
 [1474-1538]  7.7e-12 PF00550 [3207-3273]  1.2e-11 PF00550 [4908-4974]  2.3e-14 PF00550 [6640-6705]  5e-14 PF00550
PF00550   PP-binding
 [1463-1578]  3.39999972437717e-28 SSF47336 [3197-3315]  1.39999892049878e-29 SSF47336 [4898-5018]  1.20000117458134e-30 SSF47336 [6627-6713]  8.40001703163685e-25 SSF47336
SSF47336   ACP_like
 [1470-1540]  PS50075 [3204-3274]  PS50075 [4905-4975]  PS50075 [6636-6706]  PS50075
PS50075   ACP_DOMAIN
IPR013968 Polyketide synthase, KR (Domain)
 [2928-3102]  2.30000000000004e-65 PF08659 [6357-6532]  3.29999999999997e-63 PF08659
PF08659   KR
IPR014030 Beta-ketoacyl synthase, N-terminal (Domain)
 [34-284]  3.99999999999998e-96 PF00109 [1561-1811]  4.59999999999996e-98 PF00109 [3299-3547]  1.69999999999998e-97 PF00109 [5002-5248]  5.40000000000002e-95 PF00109
PF00109   ketoacyl-synt
IPR014031 Beta-ketoacyl synthase, C-terminal (Domain)
 [292-410]  3.29999999999997e-46 PF02801 [1819-1933]  1.6e-45 PF02801 [3555-3673]  6.29999999999998e-49 PF02801 [5256-5374]  1.70000000000001e-48 PF02801
PF02801   Ketoacyl-synt_C
IPR014043 Acyl transferase (Domain)
 [565-885]  2.1e-64 PF00698 [2104-2414]  6.59999999999996e-62 PF00698 [3829-4140]  4.39999999999997e-64 PF00698 [5543-5854]  2.20000000000002e-66 PF00698
PF00698   Acyl_transf_1
IPR015083 Polyketide synthase, docking (Domain)
 [2-34]  9.89999630839717e-08 SSF101173
SSF101173   Polyketide_synth_docking
 [1-27]  2.7e-13 PF08990
PF08990   Docking
IPR016035 Acyl transferase/acyl hydrolase/lysophospholipase (Domain)
 [562-858]  4.89996723796399e-70 SSF52151 [2102-2406]  7.90003079443025e-74 SSF52151 [3825-4132]  8.10000734323267e-76 SSF52151 [5540-5828]  4.60000044330866e-76 SSF52151
SSF52151   Acyl_Trfase/lysoPlipase
IPR016036 Malonyl-CoA ACP transacylase, ACP-binding (Domain)
 [692-757]  4.80000830876748e-20 SSF55048 [2226-2288]  5.59999989417023e-17 SSF55048 [3952-4014]  3.69999438558106e-17 SSF55048 [5666-5728]  2.90000354677981e-17 SSF55048
SSF55048   Malonyl_transacylase_ACP-bd
IPR016038 Thiolase-like, subgroup (Domain)
 [37-296]  G3DSA:3.40.47.10 [297-462]  G3DSA:3.40.47.10 [1561-1822]  G3DSA:3.40.47.10 [1823-1982]  G3DSA:3.40.47.10 [3301-3559]  G3DSA:3.40.47.10 [3560-3723]  G3DSA:3.40.47.10 [5001-5258]  G3DSA:3.40.47.10 [5260-5425]  G3DSA:3.40.47.10
G3DSA:3.40.47.10   Thiolase-like_subgr
IPR016039 Thiolase-like (Domain)
 [26-408]  1.09999184471405e-102 SSF53901 [1554-1982]  3.19999899046355e-98 SSF53901 [3298-3725]  5.89998809405682e-105 SSF53901 [4994-5426]  3.99998544139406e-106 SSF53901
SSF53901   Thiolase-like
IPR016040 NAD(P)-binding domain (Domain)
 [1191-1376]  G3DSA:3.40.50.720 [2929-3110]  G3DSA:3.40.50.720 [4632-4813]  G3DSA:3.40.50.720 [6358-6539]  G3DSA:3.40.50.720
G3DSA:3.40.50.720   NAD(P)-bd
IPR018201 Beta-ketoacyl synthase, active site (Active_site)
 [197-213]  PS00606 [1724-1740]  PS00606 [3460-3476]  PS00606 [5161-5177]  PS00606
PS00606   B_KETOACYL_SYNTHASE
IPR020801 Polyketide synthase, acyl transferase domain (Domain)
 [566-866]  SM00827 [2104-2395]  5.00005301775164e-129 SM00827 [3830-4121]  2.60002801425225e-129 SM00827 [5544-5835]  SM00827
SM00827   PKS_AT
IPR020806 Polyketide synthase, phosphopantetheine-binding domain (Domain)
 [1471-1543]  5.00000909915354e-33 SM00823 [3205-3277]  4.30000170645869e-35 SM00823 [4906-4978]  3.70001063537244e-35 SM00823 [6637-6709]  7.60000011115808e-35 SM00823
SM00823   PKS_PP
IPR020807 Polyketide synthase, dehydratase domain (Domain)
 [2463-2629]  1.3000049540733e-82 SM00826 [4189-4349]  4.99996518094122e-86 SM00826 [5903-6063]  1.39999277195148e-83 SM00826
SM00826   PKS_DH
IPR020841 Polyketide synthase, beta-ketoacyl synthase domain (Domain)
 [36-462]  SM00825 [1564-1985]  SM00825 [3301-3725]  SM00825 [5004-5426]  SM00825
SM00825   PKS_KS
IPR020842 Polyketide synthase/Fatty acid synthase, KR (Domain)
 [1191-1375]  1.3000049540733e-64 SM00822 [2928-3104]  6.99997659850661e-64 SM00822 [4631-4807]  1e-61 SM00822 [6357-6533]  8.59998776239544e-64 SM00822
SM00822   PKS_KR
SignalP No significant hit
TMHMM No significant hit