close all open/close all

CDS information : MUP_00320


close this sectionLocation

Organism
StrainAgy99
Entry nameMycolactone
Contig
Start / Stop / Direction72,446 / 30,054 / - [in whole cluster]
72,446 / 30,054 / - [in contig]
Locationcomplement(30054..72446) [in whole cluster]
complement(30054..72446) [in contig]
TypeCDS
Length42,393 bp (14,130 aa)
Click on the icon to see Genetic map.

close this sectionAnnotation

Category1.1 PKS
Productpolyketide synthase
Product (GenBank)Type I modular polyketide synthase
Gene
Gene (GenBank)mlsB
EC number
Keyword
Note
Note (GenBank)
  • MUP032c, mlsB, len: 14130 aa. Type I modular polyketide synthase, composed of a loading module and seven extender modules; required for the synthesis of the mycolactone side-chain in M. ulcerans (see Stinear et al., 2003). Similar to the modules in several other bacterial polyketide synthases e.g. Q93NX9 AmphI from Streptomyces nodosus (9510 aa), fasta scores: opt: 10163, E(): 0, (41.418% identity in 10394 aa overlap); Q9ALM3 Polyketide synthase extender modules 5-7 from Saccharopolyspora spinosa (4928 aa), fasta scores: opt: 8980, E(): 0, (46.330% identity in 4945 aa overlap); Q93HJ2 Modular polyketide synthase from Streptomyces avermitilis (4685 aa), fasta scores: opt: 5558, E(): 0, (44.649% identity in 4934 aa overlap). Contains 8 x Pfam matches to entry PF00109 ketoacyl-synt: Beta-ketoacyl synthase, N-terminal domain and 8 x Pfam matches to entry PF02801 ketoacyl-synt_C: Beta-ketoacyl synthase, C-terminal domain. Contains 8 x Pfam matches to entry PF00550 pp-binding: Phosphopantetheine attachment site and 8 x Pfam matches to entry PF00698 Acyl_transf: Acyl transferase domain.
Reference
ACC
PmId
[14736915] Giant plasmid-encoded polyketide synthases produce the macrolide toxin of Mycobacterium ulcerans. (Proc Natl Acad Sci U S A. , 2004)

close this sectionPKS/NRPS Module

B0 malonyl-CoA
1 malonyl-CoA
2 malonyl-CoA
3 methylmalonyl-CoA
not conserved YASHS(A->P)
4 malonyl-CoA
5 methylmalonyl-CoA
not conserved YASHS(A->P)
6 methylmalonyl-CoA
not conserved YASHS(A->P)
7 malonyl-CoA
KSQ19..390
AT567..889
dh932..1105
KR1444..1624
ACP1724..1794
KS1817..2190
AT2359..2672
KR2985..3161
ACP3263..3333
KS3356..3729
AT3898..4211
KR4524..4700
ACP4802..4872
KS4895..5268
AT5444..5766
DH5809..5982
KR6321..6501
ACP6601..6671
KS6694..7067
AT7243..7565
DH7608..7781
KR8120..8300
ACP8400..8470
KS8493..8866
AT9042..9364
DH9407..9580
KR9919..10099
ACP10199..10269
KS10292..10665
AT10841..11163
DH11206..11379
KR11718..11898
ACP11998..12068
KS12091..12464
AT12640..12962
DH13005..13178
KR13517..13697
ACP13797..13867

close this sectionSequence

selected fasta
>polyketide synthase [Type I modular polyketide synthase]
MIFGDAHQNCRGGRVLGDAVAVVGMSCRVPGASDPDALWALLRDGISVVDEIPSARWNLD
GLVAHRLTDEQRSALRHGAFLDDVEGFDAAFFGINPSEAGSMDPQQRLMLELTWAALEDA
RIVPEHLSGSSSGVFTGAMSDDYTTAVTYRAAMTAHTFAGTHRSLIANRVSYTLGLRGPS
LVIDTGQSSSLVAVHVAMESLRREETSLAIAGGIHLNLSLAAALSAAHFGALSPDGRCYT
FDARANGYVRGEGGGVVVLKRLNDALADGNHIYCVIRGSSVNNDGATQDLTAPGVDGQRQ
ALLQAYERAEIDPSEVQYVELHGTGTRLGDPTEAHSLHSVFGTSTVPRSPLLVGSIKTNI
GHLEGAAGILGLIKTALAVHHRQLPPSLNYTVPNPKIPLEQLGLRVQTTLSEWPDLDKPL
TAGVSSFSMGGTNAHLILQQPPTPDTTQTPNPTTGSDPAVGSDPAVGVLVWPLSARSAPG
LSAQAARLYQHLSAHPDLDPIDVAHSLATTRSHHPHRATITTSIEHHSENNHDTTDALAA
LHALANNGTHPLLSRGLLTPQGPGKTVFVFPGQGSQYPGMGADLYRQFPVFAHALDEVAA
ALNPHLDVALLEVMFSQQDTAMAQLLDQTFYAQPALFALGTALHRLFTHAGIHPDYLLGH
SIGELTAAYAAGVLSLQDAATLVTSRGRLMQSCTPGGTMLALQASEAEVQPLLEGLDHAV
SIAAINGATSIVLSGDHDSLEQIGEHFITQDRRTTRLQVSHAFHSPHMDPILEQFRQIAA
QLTFSAPTLPILSNLTGQIARHDQLASPDYWTQQLRNTVRFHDTVAALLGAGEQVFLELS
PHPVLTQAITDTVEQAGGGGAAVPALRKDRPDAVAFAAALGQLHCHGISPSWNVLYCQAR
PLTLPTYAFQHQRYWLLPTAGDFSGANTHAMHPLLDTATELAENRGWVFTGRISPRTQPW
LNEHAVESAVLFPNTGFVELALHVADRAGYSSVNELIVHTPLLLAGHDTADLQITVTDTD
DMGRQSLNIHSHPHIGHDNTTTGDEQPEWVLHASAVLTAQTTDHNHLPLTPVPWPPPGTA
AIEVDDFYDDLAAQGYNYGPTFQGVQRIWRDHATPDVIYAEVELPEDTDIDGYGIHPALF
DAALHPLLALTQPPTNDTDDTNTADTGDQVRLPYAFTGISLHATHATRLRVRLTRTGADA
ITVHTSDTTGAPVAIIDSLITRPLTTATGSAPATTAAGLLHLSWPPHPDTTTDTDTDTDA
LRYQVIAEPTQQLPRYLHDLHTSTDLHTSTTEADVVVWPVPVPSNEELQAHQASDTAVSS
RIHTLTRQTLTVVQDWLTHPDTTGTRLVIVTRHGVSTSAHDPVPDLAHAAVWGLIRSAQN
EHPGRFTLLDTDDNTNSDTLTTALTLPTRENQLAIRRDTIHIPRLTRHSSDGALTAPVVV
DPEGTVLITGGTGTLGALFAEHLVSAHGVRHLLLTSRRGPQAHGATDLQQRLTDLGAHVT
ITACDISDPEALAALVNSVPTQHRLTAVVHTAAVLADTPVTELTGDQLDQVLAPKIDAAW
QLHQLTYEHNLSAFIMFSSMAGMIGSPGQGNYAAANTALDALADYRHRLGLPATSLAWGY
WQTHTGLTAHLTDVDLARMTRLGLMPIATSHGLALFDAALATGQPVSIPAPINTHTLARH
ARDNTLAPILSALITTPRRRAASAATDLAARLNGLSPQQQQQTLATLVAAATATVLGHHT
PESISPATAFKDLGIDSLTALELRNTLTHNTGLNLSSTLIFDHPTPHAVAEHLLEQIPGI
GALVPAPVVIAAGRTEEPVAVVGMACRFPGGVASADQLWDLVIAGRDVVGNFPADRGWDV
EGLFDPDPDAVGKTYTRYGAFLDDAAGFDAGFFGISPREARAMDPQQRLLLEVCWEALET
AGIPAHTLAGTSTGVFVGAWAQSYGATNSDDAEGYAMTGGATSVMSGRIAYTLGLEGPAI
TVDTACSSSLVAIHLACQSLRNNESQLALAGGVTVMSTPAVFTEFSRQRGLAPDGRCKAF
AATADGTGWGEGAAVLVLERLSEARRNNHPVLAIVAGSAINQDGASNGLTAPHGPSQQRV
INQALANAGLTHDQVDAVEAHGTGTTLGDPIEASALHATYGHHHTPDQPLWLGSIKSNIG
HTQAAAGAAGVVKMIQAITHATLPATLHVDQPSPHIDWSSGTVRLLTEPIQWPNTDHPRT
AAVSSFGISGTNAHLILQQPPTPNPTQTPEDCSPAQSPCATITDAGTGLSFVPWVISAKS
AEALSAQASRLLTRLDDDPVVDAIDLGWSLIATRSMFEHRAVVVGADRHQLQRGLAELAS
GNLGADVVVGRARAAGETVMVFPGQGSQRLGMGAQLYEQFPVFAAAFDDVVDALDQYLRL
PLRQVMWGDDEGLLNSTEFAQPSLFAVEVALFALLRFWGVVPDYVIGHSVGELAAAQVAG
VLSLQDAAKLVSARGRLMQALPAGGAMVAVAASQHEVEPLLVEGVDIAALNAPGSVVISG
DQAAVRLIANRLADRGYRAHELAVSHAFHSSLMEPMLEEFARLASEIVVEQPQIPLISNV
TGQLANADYGSAGYWVDHIRRPVRFADSVASLEAMGASCFIEVGPASGLGAAIEQSLKSA
EPTVSVSALSTDKPESVAVLRAAARLSTSGIPVDWQSVFDGRSTQTVNLPTYAFQRQRFW
LDANRIGQGDPASQPQAQNVESRFWEAVEREDVDGLADSIGVTASAMQTVLPALSSWRRA
ERTQSELDSWRYQVTWLSSPATPSSITLSGIWLLIVPSELAKTDPVIGCAAALEAHGALV
TIITIFEPDFNRSLMGASLKDIGSHISGVISFLGIHGSEFSDSGAVKTLNLVQAMGDVHL
DVPLWCLTQGAVSISADDLIRCSSAALVWGLGRVVALEHPGSWGGLVDLPESPDDAAWER
LCALLAQPTDEDQFAIRPSGVFLRRLIHAPATTTSKSSTAWAPRGTVLITGGTGALGAHV
ARWLAHKYESVDLLLTSRRGMAADGATELVDDLRTAGASVTVHACDVTDRTSVEAAIAGK
SLDAVFHLAGRHQPTLLTELEDESFSDELAPKVHGAQVLSDITSNLTLSAFVMFSSVAGI
WGGKSQGAYAAANAFLDSLAEKRRTLGLPATSVAWGLWAGGGMGDRPSASGLNLIGLKSM
SADLAVQALSDAIDRPQATLTVASVNWDRFYPTFALARPRPFLHEITEVMAYRESMRSSS
ASTATLLTSKLAGLTATEQRAVTRKLVLDQAASVLGYASTESLDTHESFKDLGFDSLTAL
ELRDHLQTATGLNLSSTLIFDHPTPHAVAEHLLEQIPGIGALVPAPVVIAAGRTEEPVAV
VGMACRFPGGVASADQLWDLVIAGRDVVGNFPADRGWDVEGLFDPDPDAVGKTYTRYGAF
LDDAAGFDAGFFGISPREARAMDPQQRLLLEVCWEALETAGIPAHTLAGTSTGVFVGAWA
QSYGATNSDDAEGYAMTGGATSVMSGRIAYTLGLEGPAITVDTACSSSLVAIHLACQSLR
NNESQLALAGGVTVMSTPAVFTEFSRQRGLAPDGRCKAFAATADGTGWGEGAAVLVLERL
SEARRNNHPVLAIVAGSAINQDGASNGLTAPHGPSQQRVINQALANAGLTHDQVDAVEAH
GTGTTLGDPIEASALHATYGHHHTPDQPLWLGSIKSNIGHTQAAAGAAGVVKMIQAITHA
TLPATLHVDQPSPHIDWSSGTVRLLTEPIQWPNTDHPRTAAVSSFGISGTNAHLILQQPP
TPNPTQTPEDCSPAQSPCATITDAGTGLSFVPWVISAKSAEALSAQASRLLTRLDDDPVV
DAIDLGWSLIATRSMFEHRAVVVGADRHQLQRGLAELASGNLGADVVVGRARAAGETVMV
FPGQGSQRLGMGAQLYEQFPVFAAAFDDVVDALDQYLRLPLRQVMWGDDEGLLNSTEFAQ
PSLFAVEVALFALLRFWGVVPDYVIGHSVGELAAAQVAGVLSLQDAAKLVSARGRLMQAL
PAGGAMVAVAASQHEVEPLLVEGVDIAALNAPGSVVISGDQAAVRLIANRLADRGYRAHE
LAVSHAFHSSLMEPMLEEFARLASEIVVEQPQIPLISNVTGQLANADYGSAGYWVDHIRR
PVRFADSVASLEAMGASCFIEVGPASGLGAAIEQSLKSAEPTVSVSALSTDKPESVAVLR
AAARLSTSGIPVDWQSVFDGRSTQTVNLPTYAFQRQRFWLDANRIGQGDPASQPQAQNVE
SRFWEAVEREDVDGLADSIGVTASAMQTVLPALSSWRRAERTQSELDSWRYQVTWLSSPA
TPSSITLSGIWLLIVPSELAKTDPVIGCAAALEAHGALVTIITIFEPDFNRSLMGASLKD
IGSHISGVISFLGIHGSEFSDSGAVKTLNLVQAMGDVHLDVPLWCLTQGAVSISADDLIR
CSSAALVWGLGRVVALEHPGSWGGLVDLPESPDDAAWERLCALLAQPTDEDQFAIRPSGV
FLRRLIHAPATTTSKSSTAWAPRGTVLITGGTGALGAHVARWLAHKYESVDLLLTSRRGM
AADGATELVDDLRTAGASVTVHACDVTDRTSVEAAIAGKSLDAVFHLAGRHQPTLLTELE
DESFSDELAPKVHGAQVLSDITSNLTLSAFVMFSSVAGIWGGKSQGAYAAANAFLDSLAE
KRRTLGLPATSVAWGLWAGGGMGDRPSASGLNLIGLKSMSADLAVQALSDAIDRPQATLT
VASVNWDRFYPTFALARPRPFLHEITEVMAYRESMRSSSASTATLLTSKLAGLTATEQRA
VTRKLVLDQAASVLGYASTESLDTHESFKDLGFDSLTALELRDHLQTATGLNLSSTLIFD
HPTPHAVAEHLLEQIPGIGALVPAPVVIAAGRTEEPVAVVGMACRFPGGVASADQLWDLV
IAGRDVVGNFPADRGWDVEGLFDPDPDAVGKTYTRYGAFLDDAAGFDAGFFGISPREARA
MDPQQRLLLEVCWEALETAGIPAHTLAGTSTGVFVGAWAQSYGATNSDDAEGYAMTGGAI
SVMSGRIAYTLGLEGPAITVDTACSSSLVAIHLACQSLRNNESQLALTGGVTVMSTPAIF
TEFSRQRGLAPDGRCKAFAATADGTGWGEGAAVLVLERLSEARRNNHPVLAIVAGSAINQ
DGASNGLTAPHGPSQQRVINQALANAGLTHDQVDAVEAHGTGTTLGDPIEASALHATYGH
HHTPDQPLWLGSIKSNIGHTQAAAGAAGVVKMIQAITHATLPATLHVDQPSPHIDWSSGT
VRLLTEPIQWPNTDHPRTAAVSSFGISGTNAHLILQQPPTPDTTQTPNTTTGSDPAVGSD
PAVGVLVWPLSARSAPGLSAQAARLYQHLSAHPDLDPIDVAHSLATTRSHHPHRATITTS
IEHHSENNHDTTDALAALHALANNGTHPLLSRGLLTPQGPGKTVFVFPGQGSQYPGMGAD
LYRQFPVFAHALDACDAALQPFTGWSVLAVLHDEPEAPSLERVDVVQPVLFSVMVSLAAL
WRWAGITPDAVIGHSQGEIAAAHVAGALTLPEAAAVVALRSRVLTDLAGAGAMASVLSPE
EPLTQLLARWDGKITVAAVNGPASAVVSGDTTAITELLITCEHENIDARAIPVDYPSHSP
YMEHIRHQFLDELPELTPRPSTIAMYSTVDGEPHDTAYDTTTMTADYWYRNIRNTVRFHD
TVAALLGAGEQVFLELSPHPVLTQAITDTVEQAGGGGAAVPALRKDRPDAVAFAAALGQL
HCHGISPSWNVLYCQARPLTLPTYAFQHQRYWLLPTAGDFSGANTHAMHPLLDTATELAE
NRGWVFTGRISPRTQPWLNEHAVESAVLFPGTGFVELALHVADRAGYSSVNELIVHTPLL
LAGHDTADLQITVTDTDDMGRQSLNIHSRPHIGHDNTTTGDEQPEWVLHASAVLTAQTTD
HNHLPLTPVPWPPPGTAAIEVDDFYDDLAAQGYNYGPTFQGVQRIWRDHATPDVIYAEVE
LPEDTDIDGYGIHPALFDAALHPLLALTQPPTNDTDDTNTADTGDQVRLPYAFTGISLHA
THATRLRVRLTRTGADAITVHTSDTTGAPVAIIDSLITRPLTTATGSAPATTAAGLLHLS
WPPHPDTTTDTDTDTDALRYQVIAEPTQQLPRYLHDLHTSTDLHTSTTEADVVVWPVPVP
SNEELQAHQASDTAVSSRIHTLTRQTLTVVQDWLTHPDTTGTRLVIVTRHGVSTSAHDPV
PDLAHAAVWGLIRSAQNEHPGRFTLLDTDDNTNSDTLTTALTLPTRENQLAIRRDTIHIP
RLTRHSSDGALTAPVVVDPEGTVLITGGTGTLGALFAEHLVSAHGVRHLLLTSRRGPQAH
GATDLQQRLTDLGAHVTITACDISDPEALAALVNSVPTQHRLTAVVHTAAVLADTPVTEL
TGDQLDQVLAPKIDAAWQLHQLTYEHNLSAFIMFSSMAGMIGSPGQGNYAAANTALDALA
DYRHRLGLPATSLAWGYWQTHTGLTAHLTDVDLARMTRLGLMPIATSHGLALFDAALATG
QPVSIPAPINTHTLARHARDNTLAPILSALITTPRRRAASAATDLAARLNGLSPQQQQQT
LATLVAAATATVLGHHTPESISPATAFKDLGIDSLTALELRNTLTHNTGLDLPPTLIFDH
PTPHAVAEHLLEQIPGIGALVPAPVVIAAGRTEEPVAVVGMACRFPGGVASADQLWDLVI
AGRDVVGNFPADRGWDVEGLFDPDPDAVGKTYTRYGAFLDDAAGFDAGFFGISPREARAM
DPQQRLLLEVCWEALETAGIPAHTLAGTSTGVFAGAWAQSYGATNSDDAEGYAMTGGSTS
VMSGRIAYTLGLEGPAITVDTACSSSLVAIHLACQSLRNNESQLALAGGVTVMSTPAVFT
EFSRQRGLAPDGRCKAFAATADGTGFGEGAAVLVLERLSEARRNNHPVLAIVAGSAINQD
GASNGLTAPHGPSQQRVINQALANAGLTHDQVDAVEAHGTGTTLGDPIEASALHATYGHH
HTPDQPLWLGSIKSNIGHTQAAAGAAGVVKMIQAITHATLPATLHVDQPSPHIDWSSGTV
RLLTEPIQWPNTDHPRTAAVSSFGISGTNAHLILQQPPTPDTTQTPNPTTGSDPAVGSDP
AVGVLVWPLSARSAPGLSAQAARLYQHLSAHPDLDPIDVAHSLATTRSHHPHRATITTSI
EHHSENNHDTTDALAALHALANNGTHPLLSRGLLTPQGPGKTVFVFPGQGSQYPGMGADL
YRQFPVFAHALDEVAAALNPHLDVALLEVMFSQQDTAMAQLLDQTFYAQPALFALGTALH
RLFTHAGIHPDYLLGHSIGELTAAYAAGVLSLQDAATLVTSRGRLMQSCTPGGTMLALQA
SEAEVQPLLEGLDHAVSIAAINGATSIVLSGDHDSLEQIGEHFITQDRRTTRLQVSHAFH
SPHMDPILEQFRQIAAQLTFSAPTLPILSNLTGQIARHDQLASPDYWTQQLRNTVRFHDT
VAALLGAGEQVFLELSPHPVLTQAITDTVEQAGGGGAAVPALRKDRPDAVAFAAALGQLH
CHGISPSWNVLYCQARPLTLPTYAFQHQRYWLLPTAGDFSGANTHAMHPLLDTATELAEN
RGWVFTGRISPRTQPWLNEHAVESAVLFPGTGFVELALHVADRAGYSSVNELIVHTPLLL
AGHDTADLQITVTDTDDMGRQSLNIHSRPHIGHDNTTTGDEQPEWVLHASAVLTAQTTDH
NHLPLTPVPWPPPGTAAIEVDDFYDDLAAQGYNYGPTFQGVQRIWRDHATPDVIYAEVEL
PEDTDIDGYGIHPALFDAALHPLLALTQPPTNDTDDTNTADTGDQVRLPYAFTGISLHAT
HATRLRVRLTRTGADAITVHTSDTTGAPVAIIDSLITRPLTTATGSAPATTAAGLLHLSW
PPHPDTTTDTDTDTDALRYQVIAEPTQQLPRYLHDLHTSTDLHTSTTEADVVVWPVPVPS
NEELQAHQASDTAVSSRIHTLTRQTLTVVQDWLTHPDTTGTRLVIVTRHGVSTSAHDPVP
DLAHAAVWGLIRSAQNEHPGRFTLLDTDDNTNSDTLTTALTLPTRENQLAIRRDTIHIPR
LTRHSSDGALTAPVVVDPEGTVLITGGTGTLGALFAEHLVSAHGVRHLLLTSRRGPQAHG
ATDLQQRLTDLGAHVTITACDISDPEALAALVNSVPTQHRLTAVVHTAAVLADTPVTELT
GDQLDQVLAPKIDAAWQLHQLTYEHNLSAFIMFSSMAGMIGSPGQGNYAAANTALDALAD
YRHRLGLPATSLAWGYWQTHTGLTAHLTDVDLARMTRLGLMPIATSHGLALFDAALATGQ
PVSIPAPINTHTLARHARDNTLAPILSALITTPRRRAASAATDLAARLNGLSPQQQQQTL
ATLVAAATATVLGHHTPESISPATAFKDLGIDSLTALELRNTLTHNTGLDLPPTLIFDHP
TPHAVAEHLLEQIPGIGALVPAPVVIAAGRTEEPVAVVGMACRFPGGVASADQLWDLVIA
GRDVVGNFPADRGWDVEGLFDPDPDAVGKTYTRYGAFLDDAAGFDAGFFGISPREARAMD
PQQRLLLEVCWEALETAGIPAHTLAGTSTGVFAGAWAQSYGATNSDDAEGYAMTGGATSV
MSGRIAYTLGLEGPAITVDTACSSSLVAIHLACQSLRNNESQLALAGGVTVMSTPAVFTE
FSRQRGLAPDGRCKAFAATADGTGFGEGAAVLVLERLSEARRNNHPVLAIVAGSAINQDG
ASNGLTAPHGPSQQRVINQALANAGLTHDQVDAVEAHGTGTTLGDPIEASALHATYGHHH
TPDQPLWLGSIKSNIGHTQAAAGAAGVVKMIQAITHATLPATLHVDQPSPHIDWSSGTVR
LLTEPIQWPNTDHPRTAAVSSFGISGTNAHLILQQPPTPDTTQTPNTTTGSDPAVGSDPA
VGVLVWPLSARSAPGLSAQAARLYQHLSAHPDLDPIDVAHSLATTRSHHPHRATITTSIE
HHSENNHDTTDALAALHALANNGTHPLLSRGLLTPQGPGKTVFVFPGQGSQYPGMGADLY
RQFPVFAHALDACDAALQPFTGWSVLAVLHDEPEAPSLERVDVVQPVLFSVMVSLAALWR
WAGITPDAVIGHSQGEIAAAHVAGALTLPEAAAVVALRSRVLTDLAGAGAMASVLSPEEP
LTQLLARWDGKITVAAVNGPASAVVSGDTTAITELLITCEHENIDARAIPVDYPSHSPYM
EHIRHQFLDELPELTPRPSTIAMYSTVDGEPHDTAYDTTTMTADYWYRNIRNTVRFHDTV
AALLGAGEQVFLELSPHPVLTQAITDTVEQAGGGGAAVPALRKDRPDAVAFAAALGQLHC
HGISPSWNVLYCQARPLTLPTYAFQHQRYWLLPTAGDFSGANTHAMHPLLDTATELAENR
GWVFTGRISPRTQPWLNEHAVESAVLFPGTGFVELALHVADRAGYSSVNELIVHTPLLLA
GHDTADLQITVTDTDDMGRQSLNIHSHPHIGHDNTTTGDEQPEWVLHASAVLTAQTTDHN
HLPLTPVPWPPPGTAAIEVDDFYDDLAAQGYNYGPTFQGVQRIWRDHATPDVIYAEVELP
EDTDIDGYGIHPALFDAALHPLLALTQPPTNDTDDTNTADTGDQVRLPYAFTGISLHATH
ATRLRVRLTRTGADAITVHTSDTTGAPVAIIDSLITRPLTTATGSAPATTAAGLLHLSWP
PHPDTTTDTDTDTDALRYRVIAEPTQQLPRYLHDLHTSTDLHTSTTEADVVVWPVPVPSN
EELQAHQASDTAVSSRIHTLTRQTLTVVQDWLTHPDTTGTRLVIVTRHGVSTSAHDPVPD
LAHAAVWGLIRSAQNEHPGRFTLLDTDDNTNSDTLTTALTLPTRENQLAIRRDTIHIPRL
TRHSSDGALTAPVVVDPEGTVLITGGTGTLGALFAEHLVSAHGVRHLLLTSRRGPQAHGA
TDLQQRLTDLGAHVTITACDISDPEALAALVNSVPTQHRLTAVVHTAAVLADTPVTELTG
DQLDQVLAPKIDAAWQLHQLTYEHNLSAFIMFSSMAGMIGSPGQGNYAAANTALDALADY
RHRLGLPATSLAWGYWQTHTGLTAHLTDVDLARMTRLGLMPIATSHGLALFDAALATGQP
VSIPAPINTHTLARHARDNTLAPILSALITTPRRRAASAATDLAARLNGLSPQQQQQTLA
TLVAAATATVLGHHTPESISPATAFKDLGIDSLTALELRNTLTHNTGLDLPPTLIFDHPT
PHAVAEHLLEQIPGIGALVPAPVVIAAGRTEEPVAVVGMACRFPGGVASADQLWDLVIAG
RDVVGNFPADRGWDVEGLFDPDPDAVGKTYTRYGAFLDDAAGFDAGFFGISPREARAMDP
QQRLLLEVCWEALETAGIPAHTLAGTSTGVFAGAWAQSYGATNSDDAEGYAMTGGATSVM
SGRIAYTLGLEGPAITVDTACSSSLVAIHLACQSLRNNESQLALAGGVTVMSTPAVFTEF
SRQRGLAPDGRCKAFAATADGTGFGEGAAVLVLERLSEARRNNHPVLAIVAGSAINQDGA
SNGLTAPHGPSQQRVINQALANAGLTHDQVDAVEAHGTGTTLGDPIEASALHATYGHHHT
PDQPLWLGSIKSNIGHTQAAAGAAGVVKMIQAITHATLPATLHVDQPSPHIDWSSGTVRL
LTEPIQWPNTDHPRTAAVSSFGISGTNAHLILQQPPTPDTTQTPNTTTGSDPAVGSDPAV
GVLVWPLSARSAPGLSAQAARLYQHLSAHPDLDPIDVAHSLATTRSHHPHRATITTSIEH
HSENNHDTTDALAALHALANNGTHPLLSRGLLTPQGPGKTVFVFPGQGSQYPGMGADLYR
QFPVFAHALDACDAALQPFTGWSVLAVLHDEPEAPSLERVDVVQPVLFSVMVSLAALWRW
AGITPDAVIGHSQGEIAAAHVAGALTLPEAAAVVALRSRVLTDLAGAGAMASVLSPEEPL
TQLLARWDGKITVAAVNGPASAVVSGDTTAITELLITCEHENIDARAIPVDYPSHSPYME
HIRHQFLDELPELTPRPSTIAMYSTVDGEPHDTAYDTTTMTADYWYRNIRNTVRFHDTVA
ALLGAGEQVFLELSPHPVLTQAITDTVEQAGGGGAAVPALRKDRPDAVAFAAALGQLHCH
GISPSWNVLYCQARPLTLPTYAFQHQRYWLLPTAGDFSGANTHAMHPLLDTATELAENRG
WVFTGRISPRTQPWLNEHAVESAVLFPGTGFVELALHVADRAGYSSVNELIVHTPLLLAG
HDTADLQITVTDTDDMGRQSLNIHSRPHIGHDNTTTGDEQPEWVLHASAVLTAQTTDHNH
LPLTPVPWPPPGTAAIEVDDFYDDLAAQGYNYGPTFQGVQRIWRDHATPDVIYAEVELPE
DTDIDGYGIHPALFDAALHPLLALTQPPTNDTDDTNTADTGDQVRLPYAFTGISLHATHA
TRLRVRLTRTGADAITVHTSDTTGAPVAIIDSLITRPLTTATGSAPATTAAGLLHLSWPP
HPDTTTDTDTDTDALRYQVIAEPTQQLPRYLHDLHTSTDLHTSTTEADVVVWPVPVPSNE
ELQAHQASDTAVSSRIHTLTRQTLTVVQDWLTHPDTTGTRLVIVTRHGVSTSAHDPVPDL
AHAAVWGLIRSAQNEHPGRFTLLDTDDNTNSDTLTTALTLPTRENQLAIRRDTIHIPRLT
RHSSDGALTAPVVVDPEGTVLITGGTGTLGALFAEHLVSAHGVRHLLLTSRRGPQAHGAT
DLQQRLTDLGAHVTITACDISDPEALAALVNSVPTQHRLTAVVHTAAVLADTPVTELTGD
QLDQVLAPKIDAAWQLHQLTYEHNLSAFIMFSSMAGMIGSPGQGNYAAANTALDALADYR
HRLGLPATSLAWGYWQTHTGLTAHLTDVDLARMTRLGLMPIATSHGLALFDAALATGQPV
SIPAPINTHTLARHARDNTLAPILSALITTPRRRAASAATDLAARLNGLSPQQQQQTLAT
LVAAATATVLGHHTPESISPATAFKDLGIDSLTALELRNTLTHNTGLDLPPTLIFDHPTP
HAVAEHLLEQIPGIGALVPAPVVIAAGRTEEPVAVVGMACRFPGGVASADQLWDLVIAGR
DVVGNFPADRGWDVEGLFDPDPDAVGKTYTRYGAFLDDAAGFDAGFFGISPREARAMDPQ
QRLLLEVCWEALETAGIPAHTLAGTSTGVFAGAWAQSYGATNSDDAEGYAMTGGATSVMS
GRIAYTLGLEGPAITVDTACSSSLVAIHLACQSLRNNESQLALAGGVTVMSTPAVFTEFS
RQRGLAPDGRCKAFAATADGTGFGEGAAVLVLERLSEARRNNHPVLAIVAGSAINQDGAS
NGLTAPHGPSQQRVINQALANAGLTHDQVDAVEAHGTGTTLGDPIEASALHATYGHHHTP
DQPLWLGSIKSNIGHTQAAAGAAGVVKMIQAITHATLPATLHVDQPSPHIDWSSGTVRLL
TEPIQWPNTDHPRTAAVSSFGISGTNAHLILQQPPTPDTTQTPNTTTGSDPAVGSDPAVG
VLVWPLSARSAPGLSAQAARLYQHLSAHPDLDPIDVAHSLATTRSHHPHRATITTSIEHH
SENNHDTTDALAALHALANNGTHPLLSRGLLTPQGPGKTVFVFPGQGSQYPGMGADLYRQ
FPVFAHALDEVAAALNPHLDVALLEVMFSQQDTAMAQLLDQTFYAQPALFALGTALHRLF
THAGIHPDYLLGHSIGELTAAYAAGVLSLQDAATLVTSRGRLMQSCTPGGTMLALQASEA
EVQPLLEGLDHAVSIAAINGATSIVLSGDHDSLEQIGEHFITQDRRTTRLQVSHAFHSPH
MDPILEQFRQIAAQLTFSAPTLPILSNLTGQIARHDQLASPDYWTQQLRNTVRFHDTVAA
LLGAGEQVFLELSPHPVLTQAITDTVEQAGGGGAAVPALRKDRPDAVAFAAALGQLHCHG
ISPSWNVLYCQARPLTLPTYAFQHQRYWLLPTAGDFSGANTHAMHPLLDTATELAENRGW
VFTGRISPRTQPWLNEHAVESAVLFPGTGFVELALHVADRAGYSSVNELIVHTPLLLAGH
DTADLQITVTDTDDMGRQSLNIHSRPHIGHDNTTTGDEQPEWVLHASAVLTAQTTDHNHL
PLTPVPWPPPGTAAIEVDDFYDDLAAQGYNYGPTFQGVQRIWRDHATPDVIYAEVELPED
TDIDGYGIHPALFDAALHPLLALTQPPTNDTDDTNTADTGDQVRLPYAFTGISLHATHAT
RLRVRLTRTGADAITVHTSDTTGAPVAIIDSLITRPLTTATGSAPATTAAGLLHLSWPPH
PDTTTDTDTDTDALRYQVIAEPTQQLPRYLHDLHTSTDLHTSTTEADVVVWPVPVPSNEE
LQAHQASDTAVSSRIHTLTRQTLTVVQDWLTHPDTTGTRLVIVTRHGVSTSAHDPVPDLA
HAAVWGLIRSAQNEHPGRFTLLDTDDNTNSDTLTTALTLPTRENQLAIRRDTIHIPRLTR
HSSDGALTAPVVVDPEGTVLITGGTGTLGALFAEHLVSAHGVRHLLLTSRRGPQAHGATD
LQQRLTDLGAHVTITACDISDPEALAALVNSVPTQHRLTAVVHTAAVLADTPVTELTGDQ
LDQVLAPKIDAAWQLHQLTYEHNLSAFIMFSSMAGMIGSPGQGNYAAANTALDALADYRH
RLGLPATSLAWGYWQTHTGLTAHLTDVDLARMTRLGLMPIATSHGLALFDAALATGQPVS
IPAPINTHTLARHARDNTLAPILSALITTPRRRAASAATDLAARLNGLSPQQQQQTLATL
VAAATATVLGHHTPESISPATAFKDLGIDSLTALELRNTLTHNTGLDLPPTLIFDHPTPH
ALTQHLHTRLTQSHTPVGPIASLLSHAIDEGKFRAGADLLMAASNLNQSFSNMAELNQLP
AVTDIADASPDGLLTLICISTSENEYARLAAANIHSLTFAEIAAPGFYDAQLPNSIETSA
EALATAITGAYANTSIVLVAHSIVCELAQATMTRLQDADIDLVGLVLLDPLEGTNSTEDY
VETVLTRIEHINAPRVGVDGYLAALGRYLQFHEDRRIPIPETRHMTLHSDTKIDRAQTPM
NLLQDEAALTALKIGNWMNDVGVALSVNLE
selected fasta
>polyketide synthase [Type I modular polyketide synthase]
GTGATCTTCGGAGATGCTCACCAAAACTGCAGGGGAGGTCGGGTGTTGGGTGATGCAGTC
GCAGTGGTCGGAATGTCTTGCCGGGTTCCTGGCGCATCTGATCCGGACGCTCTGTGGGCG
CTGCTGCGAGACGGGATCAGTGTGGTCGATGAGATACCTTCTGCACGTTGGAATTTAGAC
GGCCTCGTTGCTCACCGACTGACCGATGAGCAACGATCAGCGCTTCGGCATGGCGCCTTT
CTTGATGACGTCGAAGGGTTTGACGCCGCGTTCTTCGGAATTAACCCCTCCGAAGCTGGG
TCGATGGATCCGCAGCAACGATTGATGCTTGAACTGACCTGGGCAGCACTCGAAGATGCT
CGAATCGTGCCAGAACATCTTTCCGGTAGCAGTAGCGGGGTGTTTACCGGCGCCATGAGC
GATGATTACACGACCGCGGTGACCTACCGCGCAGCGATGACTGCACATACCTTTGCGGGG
ACTCACCGCAGCCTCATAGCCAACCGTGTCTCCTACACACTCGGTCTACGCGGACCTAGT
TTGGTCATCGATACCGGGCAATCGTCCTCACTGGTGGCTGTGCACGTGGCAATGGAAAGC
TTGCGCAGAGAAGAAACTTCACTTGCTATCGCGGGTGGTATTCACCTTAACCTCAGCCTC
GCCGCCGCACTGAGCGCAGCACACTTTGGAGCCCTTTCACCTGACGGACGCTGCTACACC
TTCGACGCACGTGCCAACGGATACGTTCGTGGCGAAGGCGGCGGCGTCGTCGTCCTCAAA
CGTCTCAACGACGCCCTAGCCGACGGCAACCATATTTACTGTGTGATCCGCGGCAGCTCA
GTCAACAACGACGGCGCCACTCAAGACTTGACAGCGCCCGGAGTCGACGGCCAGCGTCAA
GCGCTCCTTCAAGCTTATGAGCGAGCCGAAATCGACCCCTCAGAAGTCCAATACGTCGAG
CTACATGGCACCGGCACCCGACTCGGCGATCCCACCGAAGCCCACTCGCTTCACTCCGTC
TTCGGCACATCCACGGTCCCGCGCAGCCCGCTGCTAGTCGGGTCAATCAAAACCAATATC
GGTCACCTCGAAGGCGCCGCAGGAATCCTCGGCCTAATCAAGACTGCCCTTGCCGTTCAT
CATCGCCAGCTTCCCCCCAGCCTCAACTACACGGTTCCTAACCCAAAAATCCCGCTAGAG
CAGCTAGGGCTCCGCGTCCAAACCACTCTCAGTGAATGGCCGGACTTAGACAAACCGCTA
ACGGCGGGCGTGTCATCTTTTTCCATGGGTGGCACCAACGCCCACCTCATCCTCCAACAA
CCCCCCACCCCCGACACCACACAAACCCCCAACCCCACAACAGGTTCTGATCCCGCAGTG
GGTTCTGATCCCGCAGTGGGTGTACTGGTGTGGCCGTTGTCAGCGCGTTCAGCGCCGGGG
TTAAGCGCACAAGCGGCCCGTCTGTACCAGCATCTCAGCGCCCACCCCGATCTGGATCCG
ATCGATGTAGCCCACAGCCTGGCTACCACACGCAGCCACCACCCCCACCGCGCCACCATC
ACCACCAGCATTGAGCACCACAGCGAAAACAACCACGACACAACCGATGCGCTGGCCGCA
CTGCACGCCCTGGCCAACAACGGCACACACCCCCTGCTGAGCAGAGGCCTGCTGACCCCA
CAGGGCCCCGGCAAAACAGTGTTCGTGTTCCCCGGACAGGGCAGTCAATACCCCGGCATG
GGCGCAGATCTCTACCGCCAATTCCCCGTGTTCGCCCACGCCCTCGACGAGGTCGCTGCG
GCGCTGAACCCGCATCTCGATGTTGCGTTGCTTGAGGTGATGTTCAGCCAACAAGACACT
GCCATGGCGCAACTGCTGGACCAGACCTTCTATGCACAACCGGCGTTGTTCGCGCTGGGA
ACCGCTCTACATCGATTGTTCACCCACGCCGGTATCCACCCGGACTACCTGCTAGGCCAC
TCCATCGGAGAACTCACCGCGGCATACGCCGCCGGTGTGCTGTCACTGCAAGACGCAGCC
ACCTTGGTCACAAGCCGAGGACGACTGATGCAATCCTGCACGCCCGGCGGGACGATGCTC
GCACTACAAGCCAGCGAAGCAGAAGTACAACCGCTGCTTGAAGGCCTAGACCACGCCGTG
TCCATCGCCGCGATCAACGGAGCAACGTCGATCGTACTGTCAGGAGATCACGACAGCCTC
GAACAAATCGGCGAGCACTTCATTACCCAAGATCGACGTACCACCCGACTGCAGGTCAGT
CACGCTTTCCACTCTCCACATATGGACCCCATCCTCGAACAATTCCGCCAGATCGCGGCC
CAACTCACCTTCAGCGCACCCACCCTGCCCATCTTGTCCAACCTCACCGGGCAGATCGCC
CGCCACGACCAACTCGCCTCACCTGACTATTGGACCCAACAGCTACGTAACACTGTCCGG
TTCCATGACACTGTCGCTGCCCTGCTCGGGGCGGGTGAGCAGGTTTTCCTGGAACTTTCA
CCTCACCCGGTGTTGACACAAGCGATCACCGACACCGTCGAACAAGCCGGCGGCGGCGGC
GCAGCAGTGCCAGCTCTACGCAAGGATCGCCCTGATGCTGTCGCGTTCGCTGCAGCACTC
GGCCAGCTGCACTGCCATGGCATCAGCCCATCCTGGAATGTTCTTTACTGCCAGGCCCGC
CCCCTCACACTGCCCACCTACGCTTTCCAGCATCAGCGTTACTGGCTGCTGCCCACCGCT
GGTGATTTCAGCGGGGCCAATACCCACGCCATGCATCCGCTGCTAGACACCGCCACCGAA
CTGGCCGAAAACCGCGGATGGGTGTTCACCGGCCGGATCAGCCCACGCACCCAACCATGG
CTAAACGAACACGCCGTCGAATCAGCCGTGCTGTTCCCGAACACCGGATTTGTCGAGCTA
GCGCTGCATGTCGCTGACCGTGCCGGATATTCCTCGGTCAACGAACTGATCGTGCACACC
CCCCTGCTGCTCGCTGGCCACGACACCGCGGATCTACAGATCACCGTCACCGACACCGAT
GACATGGGCCGGCAGTCTCTTAACATCCACTCGCACCCACATATCGGCCATGACAACACC
ACCACCGGCGATGAACAACCCGAGTGGGTCCTGCATGCCAGCGCAGTCCTGACCGCACAA
ACCACCGACCACAACCACCTCCCCCTAACGCCTGTGCCGTGGCCTCCACCCGGCACAGCC
GCGATCGAGGTGGATGACTTCTACGACGACCTGGCTGCACAGGGCTACAACTACGGCCCG
ACATTCCAAGGTGTGCAACGGATATGGCGTGACCACGCCACACCCGATGTCATCTACGCC
GAAGTTGAACTACCCGAAGACACCGACATCGACGGCTACGGCATCCACCCCGCCCTATTC
GACGCCGCTTTACACCCCCTACTCGCCCTGACCCAACCCCCCACCAACGACACCGATGAC
ACCAACACCGCAGACACCGGGGACCAGGTGCGGCTGCCCTACGCCTTTACCGGCATCAGT
TTGCACGCCACCCACGCCACCCGATTGCGGGTACGGCTGACCCGTACCGGCGCCGATGCC
ATCACCGTGCACACCAGTGACACCACCGGAGCCCCGGTGGCGATCATCGACTCATTGATC
ACCCGCCCCCTCACCACCGCCACAGGGTCTGCTCCGGCAACCACAGCAGCTGGCCTACTA
CACCTGAGCTGGCCACCACACCCTGACACCACGACCGACACCGACACCGACACCGATGCC
CTGCGGTATCAGGTGATCGCCGAACCCACTCAACAACTGCCCCGCTACCTGCACGACCTA
CACACCAGCACCGACCTGCACACCAGCACCACCGAAGCAGACGTGGTTGTGTGGCCGGTA
CCGGTGCCCAGCAACGAAGAGCTCCAGGCACACCAAGCATCCGACACCGCGGTGTCTTCT
CGGATACACACCCTGACCCGCCAAACACTTACCGTGGTGCAGGACTGGCTCACTCACCCC
GACACCACCGGCACCCGACTGGTCATCGTGACCCGCCACGGCGTCAGCACCAGTGCCCAC
GACCCGGTCCCCGACCTAGCCCACGCCGCAGTGTGGGGCCTGATCCGCAGCGCCCAAAAC
GAACACCCCGGACGCTTCACACTGCTCGACACCGACGACAACACCAACAGCGACACCCTC
ACCACCGCCCTAACCCTGCCAACCCGCGAAAACCAACTGGCCATACGCCGCGACACCATC
CACATCCCCCGCCTGACCCGACACAGCAGTGACGGTGCGCTCACTGCGCCGGTGGTGGTA
GATCCTGAGGGCACGGTGTTGATCACCGGGGGGACCGGGACGCTGGGTGCCTTGTTCGCC
GAGCATCTGGTTTCTGCCCATGGTGTCCGGCATCTGTTGTTGACCTCGCGGCGCGGACCT
CAGGCCCACGGTGCCACCGATCTGCAGCAGCGGCTCACCGATCTAGGTGCTCATGTCACC
ATCACGGCCTGCGATATCAGCGACCCCGAAGCACTGGCCGCCCTGGTCAATTCAGTGCCC
ACACAACACCGTTTAACCGCGGTAGTGCACACCGCCGCGGTATTGGCCGACACCCCGGTC
ACCGAGTTGACCGGCGATCAACTCGACCAGGTGCTGGCCCCCAAAATCGACGCGGCATGG
CAGCTGCACCAACTCACCTACGAACACAACCTGTCTGCATTCATCATGTTCTCGTCCATG
GCCGGAATGATAGGCAGTCCCGGTCAGGGTAACTACGCGGCAGCCAACACCGCGTTAGAT
GCTCTCGCCGACTACCGCCACCGCCTGGGCTTGCCCGCGACCAGCCTGGCCTGGGGCTAC
TGGCAGACTCACACCGGTCTCACCGCGCATCTAACCGATGTAGATCTAGCCCGCATGACC
CGCCTGGGTTTGATGCCCATCGCCACCAGCCACGGACTGGCCCTGTTCGATGCCGCCCTC
GCCACCGGACAGCCCGTTTCGATACCCGCCCCGATCAACACCCACACCCTGGCCCGACAC
GCCCGCGACAACACCCTGGCCCCGATCCTGTCTGCGCTGATCACCACACCACGGCGCCGG
GCGGCCTCTGCCGCAACCGATCTCGCTGCCCGCCTCAACGGACTTAGCCCCCAACAGCAA
CAACAAACACTGGCCACCCTCGTGGCCGCGGCCACCGCCACCGTGCTGGGCCACCACACC
CCCGAAAGCATCAGCCCAGCCACCGCGTTCAAAGACCTCGGAATCGATTCGCTGACCGCC
CTTGAACTGCGCAACACCCTCACCCACAACACCGGCCTCAACCTTTCGTCCACTCTTATC
TTCGATCACCCCACACCCCATGCGGTGGCCGAGCATCTGCTTGAACAGATCCCTGGCATC
GGTGCCCTGGTGCCGGCTCCGGTGGTGATCGCAGCTGGTCGTACCGAGGAGCCGGTGGCG
GTGGTGGGGATGGCGTGTCGTTTCCCCGGTGGTGTCGCATCAGCGGATCAGTTGTGGGAC
TTGGTGATCGCTGGCCGTGATGTGGTGGGTAATTTTCCGGCCGATCGGGGTTGGGATGTG
GAGGGACTGTTTGATCCCGATCCGGACGCGGTCGGCAAAACCTACACCCGTTACGGCGCG
TTCCTTGACGATGCGGCAGGTTTTGATGCCGGGTTCTTTGGGATCTCTCCACGGGAGGCA
CGCGCGATGGACCCCCAGCAGCGGCTGCTGCTGGAGGTGTGCTGGGAAGCGCTAGAAACC
GCGGGTATTCCCGCGCACACCTTGGCCGGCACCTCCACCGGGGTATTCGTCGGAGCCTGG
GCCCAGTCCTACGGCGCCACCAACTCCGATGACGCTGAGGGGTATGCGATGACCGGCGGC
GCGACTAGCGTCATGTCCGGCCGTATCGCCTACACCTTGGGCCTAGAAGGTCCAGCGATC
ACCGTTGACACCGCCTGCTCGTCATCGCTGGTGGCAATTCACCTGGCCTGCCAATCCTTA
CGCAACAACGAATCCCAGCTAGCACTGGCCGGCGGCGTCACCGTGATGAGCACACCTGCG
GTTTTCACCGAGTTCTCCCGCCAACGCGGCCTGGCCCCAGATGGACGCTGCAAAGCCTTC
GCCGCTACCGCCGATGGCACCGGCTGGGGTGAAGGCGCCGCGGTCTTGGTCCTTGAACGG
CTCTCCGAGGCCCGCCGCAACAACCACCCGGTCCTTGCGATCGTCGCTGGATCGGCGATC
AACCAAGACGGCGCATCCAACGGACTGACCGCACCCCACGGCCCGTCACAACAACGCGTC
ATCAACCAAGCACTAGCCAACGCCGGCCTCACCCACGACCAGGTCGACGCCGTCGAAGCC
CACGGCACCGGCACCACACTGGGTGACCCCATCGAAGCCAGCGCCCTACACGCCACCTAC
GGCCACCACCACACGCCCGATCAACCGCTTTGGCTGGGATCCATCAAATCCAACATCGGC
CACACCCAAGCCGCCGCCGGCGCCGCCGGTGTGGTCAAGATGATCCAAGCCATCACCCAC
GCCACCTTGCCCGCCACCTTGCACGTCGACCAACCCAGCCCCCACATCGACTGGTCCAGC
GGCACAGTCCGACTCCTAACCGAGCCCATCCAATGGCCCAACACCGACCACCCCCGCACC
GCGGCGGTGTCCTCATTCGGCATCAGCGGCACCAACGCCCACCTCATCCTCCAACAACCC
CCCACCCCTAACCCCACACAAACCCCCGAGGACTGCAGCCCCGCACAATCTCCCTGCGCA
ACAATCACCGATGCAGGCACGGGATTATCGTTTGTGCCCTGGGTGATTTCAGCGAAGTCG
GCTGAGGCGTTGTCTGCGCAGGCGAGCCGATTGTTGACGCGCCTTGACGATGATCCAGTT
GTCGATGCAATCGACCTGGGGTGGTCATTGATAGCCACTCGATCGATGTTTGAGCATCGC
GCAGTAGTTGTGGGTGCGGATCGTCACCAGTTGCAGCGCGGGTTGGCCGAGTTGGCTTCT
GGTAACTTGGGCGCCGATGTAGTGGTGGGCCGGGCCCGCGCAGCGGGCGAGACTGTAATG
GTGTTTCCCGGTCAGGGATCACAGCGGTTGGGCATGGGCGCGCAGCTTTATGAACAATTC
CCGGTATTCGCGGCGGCGTTTGATGACGTTGTTGATGCGCTGGACCAGTATCTGCGGTTG
CCGCTACGCCAAGTTATGTGGGGTGACGATGAAGGCCTGCTCAATTCAACGGAGTTCGCC
CAGCCGTCGTTGTTTGCTGTCGAGGTCGCACTGTTTGCGTTGCTGCGCTTCTGGGGTGTC
GTTCCGGATTACGTGATAGGCCATTCGGTAGGAGAGCTGGCCGCTGCACAAGTGGCTGGC
GTTTTGAGCCTGCAGGACGCGGCTAAATTAGTTTCAGCGCGGGGCCGACTGATGCAGGCC
CTGCCCGCCGGTGGAGCGATGGTCGCGGTAGCCGCCAGCCAGCATGAAGTCGAGCCTTTG
CTGGTTGAAGGGGTCGATATCGCGGCGCTCAATGCGCCAGGGTCAGTTGTGATCTCTGGT
GATCAGGCGGCAGTCCGTTTGATCGCTAATCGATTGGCGGATAGGGGCTACAGGGCGCAC
GAACTTGCGGTTTCGCATGCCTTTCATTCATCGTTGATGGAGCCGATGTTGGAGGAGTTC
GCTCGGCTCGCTTCTGAAATCGTTGTGGAGCAACCGCAGATTCCACTGATTTCGAACGTG
ACTGGTCAGCTGGCCAACGCCGACTACGGGTCGGCAGGTTACTGGGTGGACCACATCCGC
CGTCCAGTCCGTTTCGCCGATAGTGTCGCTTCGTTGGAAGCCATGGGGGCTAGCTGCTTC
ATTGAAGTCGGTCCAGCCAGCGGGTTGGGCGCAGCTATCGAGCAATCCTTGAAATCTGCC
GAGCCGACCGTGTCAGTGTCGGCACTGTCCACCGATAAACCTGAATCCGTCGCCGTATTG
CGCGCTGCAGCACGACTTTCCACCTCCGGCATTCCTGTGGATTGGCAGTCGGTGTTCGAC
GGCCGCAGCACCCAGACAGTTAACCTGCCCACCTACGCCTTCCAGCGGCAACGGTTCTGG
CTCGACGCCAACCGTATCGGTCAAGGCGATCCCGCCAGTCAACCACAGGCCCAGAACGTT
GAATCCCGTTTTTGGGAGGCGGTCGAGCGGGAAGACGTTGATGGCTTGGCTGATTCTATA
GGTGTCACCGCCAGTGCCATGCAGACCGTGCTACCTGCATTGTCTTCATGGCGTCGCGCG
GAGCGCACACAGTCCGAGCTTGATTCCTGGCGCTATCAGGTGACATGGCTGTCTTCCCCA
GCAACGCCGAGTTCGATCACGCTGTCCGGCATTTGGTTGCTGATAGTTCCAAGCGAACTT
GCAAAGACTGACCCAGTAATTGGATGTGCTGCAGCGCTCGAAGCGCACGGCGCCTTAGTC
ACGATTATCACAATTTTCGAGCCGGACTTCAATCGCTCATTGATGGGCGCTTCCCTAAAA
GATATCGGTTCACACATATCTGGTGTCATATCGTTCTTAGGGATTCACGGGTCCGAATTC
TCCGATAGCGGCGCGGTCAAGACATTAAATCTTGTGCAAGCAATGGGCGATGTCCACTTA
GACGTTCCTTTGTGGTGCCTAACGCAGGGCGCGGTATCGATCAGCGCCGACGATTTGATC
CGATGCTCGTCAGCAGCCCTGGTGTGGGGTCTGGGGAGAGTCGTCGCATTAGAGCACCCG
GGATCGTGGGGTGGCTTAGTAGACCTCCCCGAGTCACCCGACGATGCAGCATGGGAGCGC
TTGTGCGCCCTCCTCGCGCAGCCGACGGATGAAGATCAGTTTGCGATCAGGCCGTCTGGG
GTTTTCCTACGGAGATTGATCCACGCCCCGGCAACCACGACATCCAAATCCTCGACCGCG
TGGGCTCCGAGGGGGACCGTGTTAATCACAGGCGGCACAGGCGCGTTAGGCGCACACGTC
GCAAGGTGGTTGGCCCACAAATATGAATCGGTAGATTTGCTCTTAACCAGCCGTCGCGGG
ATGGCAGCCGATGGAGCTACAGAGCTAGTGGATGACCTCCGCACGGCTGGCGCCAGTGTG
ACAGTGCACGCCTGCGACGTGACAGACCGCACTTCAGTCGAGGCTGCAATAGCAGGTAAA
TCCCTTGATGCGGTCTTTCATCTTGCAGGACGACACCAGCCAACTCTGCTAACAGAACTC
GAGGACGAATCCTTTAGTGACGAATTGGCGCCGAAGGTTCACGGTGCCCAAGTATTGAGT
GACATCACGTCTAACCTCACACTATCAGCGTTTGTCATGTTCTCGTCAGTAGCCGGAATC
TGGGGCGGCAAAAGTCAAGGCGCATATGCTGCCGCTAACGCATTCTTAGATTCGCTCGCC
GAGAAACGGCGCACGTTGGGGTTACCAGCAACATCGGTCGCTTGGGGACTGTGGGCTGGC
GGCGGCATGGGAGACCGGCCATCCGCTTCGGGACTAAACCTTATTGGCTTGAAATCGATG
TCAGCAGATTTAGCTGTGCAGGCGCTAAGCGACGCCATTGACAGACCGCAAGCAACATTG
ACTGTTGCGAGCGTCAACTGGGATCGGTTCTACCCCACATTCGCTTTGGCGCGACCGAGG
CCCTTCCTACACGAAATCACAGAGGTAATGGCTTACCGCGAGTCGATGCGCTCAAGCTCT
GCATCGACGGCGACGCTCCTGACGAGCAAATTAGCCGGACTAACGGCGACAGAACAGCGT
GCAGTCACCCGGAAGTTGGTCCTTGATCAAGCCGCATCCGTTCTCGGGTACGCCTCAACT
GAGAGTCTCGATACTCATGAGTCATTCAAAGACCTCGGATTTGATTCGCTGACCGCCCTT
GAACTGCGCGACCACCTCCAAACTGCGACCGGCCTCAACCTTTCGTCCACTCTTATCTTC
GATCACCCCACACCCCATGCGGTGGCCGAGCATCTGCTTGAACAGATCCCTGGCATCGGT
GCCCTGGTGCCGGCTCCGGTGGTGATCGCAGCTGGTCGTACCGAGGAGCCGGTGGCGGTG
GTGGGGATGGCGTGTCGTTTCCCCGGTGGTGTCGCATCAGCGGATCAGTTGTGGGACTTG
GTGATCGCTGGCCGTGATGTGGTGGGTAATTTTCCGGCCGATCGGGGTTGGGATGTGGAG
GGACTGTTTGATCCCGATCCGGACGCGGTCGGCAAAACCTACACCCGTTACGGCGCGTTC
CTTGACGATGCGGCAGGTTTTGATGCCGGGTTCTTTGGGATCTCTCCACGGGAGGCACGC
GCGATGGACCCCCAGCAGCGGCTGCTGCTGGAGGTGTGCTGGGAAGCGCTAGAAACCGCG
GGTATTCCCGCGCACACCTTGGCCGGCACCTCCACCGGGGTATTCGTCGGAGCCTGGGCC
CAGTCCTACGGCGCCACCAACTCCGATGACGCTGAGGGGTATGCGATGACCGGCGGCGCG
ACTAGCGTCATGTCCGGCCGTATCGCCTACACCTTGGGCCTAGAAGGTCCAGCGATCACC
GTTGACACCGCCTGCTCGTCATCGCTGGTGGCAATTCACCTGGCCTGCCAATCCTTACGC
AACAACGAATCCCAGCTAGCACTGGCCGGCGGCGTCACCGTGATGAGCACACCTGCGGTT
TTCACCGAGTTCTCCCGCCAACGCGGCCTGGCCCCAGATGGACGCTGCAAAGCCTTCGCC
GCTACCGCCGATGGCACCGGCTGGGGTGAAGGCGCCGCGGTCTTGGTCCTTGAACGGCTC
TCCGAGGCCCGCCGCAACAACCACCCGGTCCTTGCGATCGTCGCTGGATCGGCGATCAAC
CAAGACGGCGCATCCAACGGACTGACCGCACCCCACGGCCCGTCACAACAACGCGTCATC
AACCAAGCACTAGCCAACGCCGGCCTCACCCACGACCAGGTCGACGCCGTCGAAGCCCAC
GGCACCGGCACCACACTGGGTGACCCCATCGAAGCCAGCGCCCTACACGCCACCTACGGC
CACCACCACACGCCCGATCAACCGCTTTGGCTGGGATCCATCAAATCCAACATCGGCCAC
ACCCAAGCCGCCGCCGGCGCCGCCGGTGTGGTCAAGATGATCCAAGCCATCACCCACGCC
ACCTTGCCCGCCACCTTGCACGTCGACCAACCCAGCCCCCACATCGACTGGTCCAGCGGC
ACAGTCCGACTCCTAACCGAGCCCATCCAATGGCCCAACACCGACCACCCCCGCACCGCG
GCGGTGTCCTCATTCGGCATCAGCGGCACCAACGCCCACCTCATCCTCCAACAACCCCCC
ACCCCTAACCCCACACAAACCCCCGAGGACTGCAGCCCCGCACAATCTCCCTGCGCAACA
ATCACCGATGCAGGCACGGGATTATCGTTTGTGCCCTGGGTGATTTCAGCGAAGTCGGCT
GAGGCGTTGTCTGCGCAGGCGAGCCGATTGTTGACGCGCCTTGACGATGATCCAGTTGTC
GATGCAATCGACCTGGGGTGGTCATTGATAGCCACTCGATCGATGTTTGAGCATCGCGCA
GTAGTTGTGGGTGCGGATCGTCACCAGTTGCAGCGCGGGTTGGCCGAGTTGGCTTCTGGT
AACTTGGGCGCCGATGTAGTGGTGGGCCGGGCCCGCGCAGCGGGCGAGACTGTAATGGTG
TTTCCCGGTCAGGGATCACAGCGGTTGGGCATGGGCGCGCAGCTTTATGAACAATTCCCG
GTATTCGCGGCGGCGTTTGATGACGTTGTTGATGCGCTGGACCAGTATCTGCGGTTGCCG
CTACGCCAAGTTATGTGGGGTGACGATGAAGGCCTGCTCAATTCAACGGAGTTCGCCCAG
CCGTCGTTGTTTGCTGTCGAGGTCGCACTGTTTGCGTTGCTGCGCTTCTGGGGTGTCGTT
CCGGATTACGTGATAGGCCATTCGGTAGGAGAGCTGGCCGCTGCACAAGTGGCTGGCGTT
TTGAGCCTGCAGGACGCGGCTAAATTAGTTTCAGCGCGGGGCCGACTGATGCAGGCCCTG
CCCGCCGGTGGAGCGATGGTCGCGGTAGCCGCCAGCCAGCATGAAGTCGAGCCTTTGCTG
GTTGAAGGGGTCGATATCGCGGCGCTCAATGCGCCAGGGTCAGTTGTGATCTCTGGTGAT
CAGGCGGCAGTCCGTTTGATCGCTAATCGATTGGCGGATAGGGGCTACAGGGCGCACGAA
CTTGCGGTTTCGCATGCCTTTCATTCATCGTTGATGGAGCCGATGTTGGAGGAGTTCGCT
CGGCTCGCTTCTGAAATCGTTGTGGAGCAACCGCAGATTCCACTGATTTCGAACGTGACT
GGTCAGCTGGCCAACGCCGACTACGGGTCGGCAGGTTACTGGGTGGACCACATCCGCCGT
CCAGTCCGTTTCGCCGATAGTGTCGCTTCGTTGGAAGCCATGGGGGCTAGCTGCTTCATT
GAAGTCGGTCCAGCCAGCGGGTTGGGCGCAGCTATCGAGCAATCCTTGAAATCTGCCGAG
CCGACCGTGTCAGTGTCGGCACTGTCCACCGATAAACCTGAATCCGTCGCCGTATTGCGC
GCTGCAGCACGACTTTCCACCTCCGGCATTCCTGTGGATTGGCAGTCGGTGTTCGACGGC
CGCAGCACCCAGACAGTTAACCTGCCCACCTACGCCTTCCAGCGGCAACGGTTCTGGCTC
GACGCCAACCGTATCGGTCAAGGCGATCCCGCCAGTCAACCACAGGCCCAGAACGTTGAA
TCCCGTTTTTGGGAGGCGGTCGAGCGGGAAGACGTTGATGGCTTGGCTGATTCTATAGGT
GTCACCGCCAGTGCCATGCAGACCGTGCTACCTGCATTGTCTTCATGGCGTCGCGCGGAG
CGCACACAGTCCGAGCTTGATTCCTGGCGCTATCAGGTGACATGGCTGTCTTCCCCAGCA
ACGCCGAGTTCGATCACGCTGTCCGGCATTTGGTTGCTGATAGTTCCAAGCGAACTTGCA
AAGACTGACCCAGTAATTGGATGTGCTGCAGCGCTCGAAGCGCACGGCGCCTTAGTCACG
ATTATCACAATTTTCGAGCCGGACTTCAATCGCTCATTGATGGGCGCTTCCCTAAAAGAT
ATCGGTTCACACATATCTGGTGTCATATCGTTCTTAGGGATTCACGGGTCCGAATTCTCC
GATAGCGGCGCGGTCAAGACATTAAATCTTGTGCAAGCAATGGGCGATGTCCACTTAGAC
GTTCCTTTGTGGTGCCTAACGCAGGGCGCGGTATCGATCAGCGCCGACGATTTGATCCGA
TGCTCGTCAGCAGCCCTGGTGTGGGGTCTGGGGAGAGTCGTCGCATTAGAGCACCCGGGA
TCGTGGGGTGGCTTAGTAGACCTCCCCGAGTCACCCGACGATGCAGCATGGGAGCGCTTG
TGCGCCCTCCTCGCGCAGCCGACGGATGAAGATCAGTTTGCGATCAGGCCGTCTGGGGTT
TTCCTACGGAGATTGATCCACGCCCCGGCAACCACGACATCCAAATCCTCGACCGCGTGG
GCTCCGAGGGGGACCGTGTTAATCACAGGCGGCACAGGCGCGTTAGGCGCACACGTCGCA
AGGTGGTTGGCCCACAAATATGAATCGGTAGATTTGCTCTTAACCAGCCGTCGCGGGATG
GCAGCCGATGGAGCTACAGAGCTAGTGGATGACCTCCGCACGGCTGGCGCCAGTGTGACA
GTGCACGCCTGCGACGTGACAGACCGCACTTCAGTCGAGGCTGCAATAGCAGGTAAATCC
CTTGATGCGGTCTTTCATCTTGCAGGACGACACCAGCCAACTCTGCTAACAGAACTCGAG
GACGAATCCTTTAGTGACGAATTGGCGCCGAAGGTTCACGGTGCCCAAGTATTGAGTGAC
ATCACGTCTAACCTCACACTATCAGCGTTTGTCATGTTCTCGTCAGTAGCCGGAATCTGG
GGCGGCAAAAGTCAAGGCGCATATGCTGCCGCTAACGCATTCTTAGATTCGCTCGCCGAG
AAACGGCGCACGTTGGGGTTACCAGCAACATCGGTCGCTTGGGGACTGTGGGCTGGCGGC
GGCATGGGAGACCGGCCATCCGCTTCGGGACTAAACCTTATTGGCTTGAAATCGATGTCA
GCAGATTTAGCTGTGCAGGCGCTAAGCGACGCCATTGACAGACCGCAAGCAACATTGACT
GTTGCGAGCGTCAACTGGGATCGGTTCTACCCCACATTCGCTTTGGCGCGACCGAGGCCC
TTCCTACACGAAATCACAGAGGTAATGGCTTACCGCGAGTCGATGCGCTCGAGCTCTGCA
TCGACGGCGACGCTCCTGACGAGCAAATTAGCCGGACTAACGGCGACAGAACAGCGTGCA
GTCACCCGGAAGTTGGTCCTTGATCAAGCCGCATCCGTTCTCGGGTACGCCTCAACTGAG
AGTCTCGATACTCATGAGTCATTCAAAGACCTCGGATTTGATTCGCTGACCGCCCTTGAA
CTGCGCGACCACCTCCAAACTGCGACCGGCCTCAACCTTTCGTCCACTCTTATCTTCGAT
CACCCCACACCCCATGCGGTGGCCGAGCATCTGCTTGAACAGATCCCTGGCATCGGTGCC
CTGGTGCCGGCTCCGGTGGTGATCGCAGCTGGTCGTACCGAGGAGCCGGTGGCGGTGGTG
GGGATGGCGTGTCGTTTCCCCGGTGGTGTCGCATCAGCGGATCAGTTGTGGGACTTGGTG
ATCGCTGGCCGTGATGTGGTGGGTAATTTTCCGGCCGATCGGGGTTGGGATGTGGAGGGA
CTGTTTGATCCCGATCCGGACGCGGTCGGCAAAACCTACACCCGTTACGGCGCGTTCCTT
GACGATGCGGCAGGTTTTGATGCCGGGTTCTTTGGGATCTCTCCACGGGAGGCACGCGCG
ATGGACCCCCAGCAGCGGCTGCTGCTGGAGGTGTGCTGGGAAGCGCTAGAAACCGCGGGT
ATTCCCGCGCACACCTTGGCCGGCACCTCCACCGGGGTATTCGTCGGAGCCTGGGCCCAG
TCCTACGGCGCCACCAACTCCGATGACGCTGAGGGGTATGCGATGACCGGCGGCGCGATC
AGCGTCATGTCCGGCCGTATCGCCTACACCTTGGGCCTAGAAGGTCCAGCGATCACCGTT
GACACCGCCTGCTCGTCATCGCTGGTGGCAATTCACCTGGCCTGCCAATCCTTACGCAAC
AACGAATCCCAGCTAGCACTGACCGGCGGCGTCACCGTGATGAGCACACCTGCGATTTTC
ACCGAGTTCTCCCGCCAACGCGGCCTGGCCCCAGATGGACGCTGCAAAGCCTTCGCCGCT
ACCGCCGATGGCACCGGCTGGGGTGAAGGCGCCGCGGTCTTGGTCCTTGAACGGCTCTCC
GAGGCCCGCCGCAACAACCACCCGGTCCTTGCGATCGTCGCTGGATCGGCGATCAACCAA
GACGGCGCATCCAACGGACTGACCGCACCCCACGGCCCGTCACAACAACGCGTCATCAAC
CAAGCACTAGCCAACGCCGGCCTCACCCACGACCAGGTCGACGCCGTCGAAGCCCACGGC
ACCGGCACCACACTGGGTGACCCCATCGAAGCCAGCGCCCTACACGCCACCTACGGCCAC
CACCACACGCCCGATCAACCGCTTTGGCTGGGATCCATCAAATCCAACATCGGCCACACC
CAAGCCGCCGCCGGCGCCGCCGGTGTGGTCAAGATGATCCAAGCCATCACCCACGCCACC
TTGCCCGCCACCTTGCACGTCGACCAACCCAGCCCCCACATCGACTGGTCCAGCGGCACA
GTCCGACTCCTAACCGAGCCCATCCAATGGCCCAACACCGACCACCCCCGCACCGCGGCG
GTGTCCTCATTCGGCATCAGCGGCACCAACGCCCACCTCATCCTCCAACAACCCCCCACC
CCCGACACCACACAAACCCCCAACACCACAACAGGTTCTGATCCCGCAGTGGGTTCTGAT
CCCGCAGTGGGTGTACTGGTGTGGCCGTTGTCAGCGCGTTCAGCGCCGGGGTTAAGCGCA
CAAGCGGCCCGTCTGTACCAGCATCTCAGCGCCCACCCCGATCTGGATCCGATCGATGTA
GCCCACAGCCTGGCTACCACACGCAGCCACCACCCCCACCGCGCCACCATCACCACCAGC
ATTGAGCACCACAGCGAAAACAACCACGACACAACCGATGCGCTGGCCGCACTGCACGCC
CTGGCCAACAACGGCACACACCCCCTGCTGAGCAGAGGCCTGCTGACCCCACAGGGCCCC
GGCAAAACAGTGTTCGTGTTCCCCGGACAGGGCAGTCAATACCCCGGCATGGGCGCAGAT
CTCTACCGCCAATTCCCCGTGTTCGCCCACGCCCTCGACGCATGCGACGCAGCGTTACAG
CCTTTCACTGGATGGTCGGTGCTAGCTGTGTTACACGACGAACCCGAGGCCCCGTCGTTG
GAGCGAGTCGATGTGGTCCAGCCTGTGTTGTTCTCGGTGATGGTGTCGTTAGCCGCACTC
TGGCGGTGGGCCGGAATCACCCCCGATGCAGTCATCGGCCACTCCCAGGGCGAGATCGCC
GCGGCACATGTGGCCGGAGCCCTGACCTTGCCCGAAGCAGCTGCGGTAGTGGCTTTGCGC
AGCCGTGTCTTGACCGACCTGGCCGGTGCCGGTGCCATGGCTTCAGTGCTATCGCCCGAG
GAACCACTGACCCAGCTGCTGGCACGGTGGGACGGCAAGATCACTGTCGCCGCAGTTAAC
GGCCCCGCTAGCGCTGTGGTCTCCGGCGATACCACAGCGATCACCGAATTGCTGATTACC
TGCGAACACGAAAACATCGACGCTCGCGCTATCCCGGTGGACTACCCCTCTCATTCCCCC
TATATGGAACACATCCGCCATCAGTTCCTCGACGAGCTACCCGAGCTGACACCGCGGCCA
TCAACCATCGCGATGTATTCCACCGTCGACGGCGAACCTCACGACACCGCCTACGACACC
ACCACAATGACCGCGGACTACTGGTACCGCAACATCCGTAACACTGTCCGGTTCCATGAC
ACTGTCGCTGCCCTGCTCGGGGCGGGTGAGCAGGTTTTCCTGGAACTTTCACCTCACCCG
GTGTTGACACAAGCGATCACCGACACCGTCGAACAAGCCGGCGGCGGCGGCGCAGCAGTG
CCAGCTCTACGCAAGGATCGCCCTGATGCTGTCGCGTTCGCTGCAGCACTCGGCCAGCTG
CACTGCCATGGCATCAGCCCATCCTGGAATGTTCTTTACTGCCAGGCCCGCCCCCTCACA
CTGCCCACCTACGCTTTCCAGCATCAGCGTTACTGGCTGCTGCCCACCGCTGGTGATTTC
AGCGGGGCCAATACCCACGCCATGCATCCGCTGCTAGACACCGCCACCGAACTGGCCGAA
AACCGCGGATGGGTGTTCACCGGCCGGATCAGCCCACGCACCCAACCATGGCTAAACGAA
CACGCCGTCGAATCAGCCGTGCTGTTCCCAGGCACCGGATTTGTCGAGCTAGCGCTGCAT
GTCGCTGACCGTGCCGGATATTCCTCGGTCAACGAACTGATCGTGCACACCCCCCTGCTG
CTCGCTGGCCACGACACCGCGGATCTACAGATCACCGTCACCGACACCGATGACATGGGC
CGGCAGTCTCTTAACATCCACTCGCGCCCACATATCGGCCATGACAACACCACCACCGGC
GATGAACAACCCGAGTGGGTCCTGCATGCCAGCGCAGTCCTGACCGCACAAACCACCGAC
CACAACCACCTCCCCCTAACGCCTGTGCCGTGGCCTCCACCCGGCACAGCCGCGATCGAG
GTGGATGACTTCTACGACGACCTGGCTGCACAGGGCTACAACTACGGCCCGACATTCCAA
GGTGTGCAACGGATATGGCGTGACCACGCCACACCCGATGTCATCTACGCCGAAGTTGAA
CTACCCGAAGACACCGACATCGACGGCTACGGCATCCACCCCGCCCTATTCGACGCCGCT
TTACACCCCCTACTCGCCCTGACCCAACCCCCCACCAACGACACCGATGACACCAACACC
GCAGACACCGGTGACCAGGTGCGGCTGCCCTACGCCTTTACCGGCATCAGTTTGCACGCC
ACCCACGCCACCCGATTACGGGTACGGCTGACCCGTACCGGCGCCGATGCCATCACCGTG
CACACCAGTGACACCACCGGAGCCCCGGTGGCGATCATCGACTCATTGATCACCCGCCCC
CTCACCACCGCCACAGGGTCTGCTCCGGCAACCACAGCAGCTGGCCTACTACACCTGAGC
TGGCCACCACACCCTGACACCACGACCGACACCGACACCGACACCGATGCCCTGCGGTAT
CAGGTGATCGCCGAACCCACTCAACAACTGCCCCGCTACCTGCACGACCTACACACCAGC
ACCGACCTGCACACCAGCACCACCGAAGCAGACGTGGTTGTGTGGCCGGTACCGGTGCCC
AGCAACGAAGAGCTCCAGGCACACCAAGCATCCGACACCGCGGTGTCTTCTCGGATACAC
ACCCTGACCCGCCAAACACTTACCGTGGTGCAGGACTGGCTCACTCACCCCGACACCACC
GGCACCCGACTGGTCATCGTGACCCGCCACGGCGTCAGCACCAGTGCCCACGACCCGGTC
CCCGACCTAGCCCACGCCGCAGTGTGGGGCCTGATCCGCAGCGCCCAAAACGAACACCCC
GGACGCTTCACACTGCTCGACACCGACGACAACACCAACAGCGACACCCTCACCACCGCC
CTAACCCTGCCAACCCGCGAAAACCAACTGGCCATACGCCGCGACACCATCCACATCCCC
CGCCTGACCCGACACAGCAGTGACGGTGCGCTCACTGCGCCGGTGGTGGTAGATCCTGAG
GGCACGGTGTTGATCACCGGGGGGACCGGGACGCTGGGTGCCTTGTTCGCCGAGCATCTG
GTTTCTGCCCATGGTGTCCGGCATCTGTTGTTGACCTCGCGGCGCGGACCTCAGGCCCAC
GGTGCCACCGATCTGCAGCAGCGGCTCACCGATCTAGGTGCTCATGTCACCATCACGGCC
TGCGATATCAGCGACCCCGAAGCACTGGCCGCCCTGGTCAATTCAGTGCCCACACAACAC
CGTTTAACCGCGGTAGTGCACACCGCCGCGGTATTGGCCGACACCCCGGTCACCGAGTTG
ACCGGCGATCAACTCGACCAGGTGCTGGCCCCCAAAATCGACGCGGCATGGCAGCTGCAC
CAACTCACCTACGAACACAACCTGTCTGCATTCATCATGTTCTCGTCCATGGCCGGAATG
ATAGGCAGTCCCGGTCAGGGTAACTACGCGGCAGCCAACACCGCGTTAGATGCTCTCGCC
GACTACCGCCACCGCCTGGGCTTGCCCGCGACCAGCCTGGCCTGGGGCTACTGGCAGACT
CACACCGGTCTCACCGCGCATCTAACCGATGTAGATCTAGCCCGCATGACCCGCCTGGGT
TTGATGCCCATCGCCACCAGCCACGGACTGGCCCTGTTCGATGCCGCCCTCGCCACCGGA
CAGCCCGTTTCGATACCCGCCCCGATCAACACCCACACCCTGGCCCGACACGCCCGCGAC
AACACCCTGGCCCCGATCCTGTCTGCGCTGATCACCACACCACGGCGCCGGGCGGCCTCT
GCCGCAACCGATCTCGCTGCCCGCCTCAACGGACTTAGCCCCCAACAGCAACAACAAACA
CTGGCCACCCTCGTGGCCGCGGCCACCGCCACCGTGCTGGGCCACCACACCCCCGAAAGC
ATCAGCCCAGCCACCGCGTTCAAAGACCTCGGAATCGATTCGCTGACCGCCCTTGAACTG
CGCAACACCCTCACCCACAACACCGGCCTGGATCTGCCCCCCACCCTCATCTTCGATCAC
CCCACACCCCATGCGGTGGCCGAGCATCTGCTTGAACAGATCCCTGGCATCGGTGCCCTG
GTGCCGGCTCCGGTGGTGATCGCAGCTGGTCGTACCGAGGAGCCGGTGGCGGTGGTGGGG
ATGGCGTGTCGTTTCCCCGGTGGTGTCGCATCAGCGGATCAGTTGTGGGACTTGGTGATC
GCTGGCCGTGATGTGGTGGGTAATTTTCCGGCCGATCGGGGTTGGGATGTGGAGGGACTG
TTTGATCCCGATCCGGACGCGGTCGGCAAAACCTACACCCGTTACGGCGCGTTCCTTGAC
GATGCGGCAGGTTTTGATGCCGGGTTCTTTGGGATCTCTCCACGGGAGGCACGCGCGATG
GACCCCCAGCAGCGGCTGCTGCTGGAGGTGTGCTGGGAAGCGCTAGAAACCGCGGGTATT
CCCGCGCACACCTTGGCCGGCACCTCCACCGGGGTATTCGCCGGAGCCTGGGCCCAGTCC
TACGGCGCCACCAACTCCGATGACGCTGAGGGGTATGCGATGACCGGCGGCTCGACTAGC
GTCATGTCCGGCCGTATCGCCTACACCTTGGGCCTAGAAGGTCCAGCGATCACCGTTGAC
ACCGCCTGCTCGTCATCGCTGGTGGCAATTCACCTGGCCTGCCAATCCTTACGCAACAAC
GAATCCCAGCTAGCACTGGCCGGCGGCGTCACCGTGATGAGCACACCTGCGGTTTTCACC
GAGTTCTCCCGCCAACGCGGCCTGGCCCCAGATGGACGCTGCAAAGCCTTCGCCGCTACC
GCCGATGGCACCGGCTTTGGTGAAGGCGCCGCGGTCTTGGTCCTTGAACGGCTCTCCGAG
GCCCGCCGCAACAACCACCCGGTCCTTGCGATCGTCGCTGGATCGGCGATCAACCAAGAC
GGCGCATCCAACGGACTGACCGCACCCCACGGCCCGTCACAACAACGCGTCATCAACCAA
GCACTAGCCAACGCCGGCCTCACCCACGACCAGGTCGACGCCGTCGAAGCCCACGGCACC
GGCACCACACTGGGTGACCCCATCGAAGCCAGCGCCCTACACGCCACCTACGGCCACCAC
CACACGCCCGATCAACCGCTTTGGCTGGGATCCATCAAATCCAACATCGGCCACACCCAA
GCCGCCGCCGGCGCCGCCGGTGTGGTCAAGATGATCCAAGCCATCACCCACGCCACCTTG
CCCGCCACCTTGCACGTCGACCAACCCAGCCCCCACATCGACTGGTCCAGCGGCACAGTC
CGACTCCTAACCGAGCCCATCCAATGGCCCAACACCGACCACCCCCGCACCGCGGCGGTG
TCCTCATTCGGCATCAGCGGCACCAACGCCCACCTCATCCTCCAACAACCCCCCACCCCC
GACACCACACAAACCCCCAACCCCACAACAGGTTCTGATCCCGCAGTGGGTTCTGATCCC
GCAGTGGGTGTACTGGTGTGGCCGTTGTCAGCGCGTTCAGCGCCGGGGTTAAGCGCACAA
GCGGCCCGTCTGTACCAGCATCTCAGCGCCCACCCCGATCTGGATCCGATCGATGTAGCC
CACAGCCTGGCTACCACACGCAGCCACCACCCCCACCGCGCCACCATCACCACCAGCATT
GAGCACCACAGCGAAAACAACCACGACACAACCGATGCGCTGGCCGCACTGCACGCCCTG
GCCAACAACGGCACACACCCCCTGCTGAGCAGAGGCCTGCTGACCCCACAGGGCCCCGGC
AAAACAGTGTTCGTGTTCCCCGGACAGGGCAGTCAATACCCCGGCATGGGCGCAGATCTC
TACCGCCAATTCCCCGTGTTCGCCCACGCCCTCGACGAGGTCGCTGCGGCGCTGAACCCG
CATCTCGATGTTGCGTTGCTTGAGGTGATGTTCAGCCAACAAGACACTGCCATGGCGCAA
CTGCTGGACCAGACCTTCTATGCACAACCGGCGTTGTTCGCGCTGGGAACCGCTCTACAT
CGATTGTTCACCCACGCCGGTATCCACCCGGACTACCTGCTAGGCCACTCCATCGGAGAA
CTCACCGCGGCATACGCCGCCGGTGTGCTGTCACTGCAAGACGCAGCCACCTTGGTCACA
AGCCGAGGACGACTGATGCAATCCTGCACGCCCGGCGGGACGATGCTCGCACTACAAGCC
AGCGAAGCAGAAGTACAACCGCTGCTTGAAGGCCTAGACCACGCCGTGTCCATCGCCGCG
ATCAACGGAGCAACGTCGATCGTACTGTCAGGAGATCACGACAGCCTCGAACAAATCGGC
GAGCACTTCATTACCCAAGATCGACGTACCACCCGACTGCAGGTCAGTCACGCTTTCCAC
TCTCCACATATGGACCCCATCCTCGAACAATTCCGCCAGATCGCGGCCCAACTCACCTTC
AGCGCACCCACCCTGCCCATCTTGTCCAACCTCACCGGGCAGATCGCCCGCCACGACCAA
CTCGCCTCACCTGACTATTGGACCCAACAGCTACGTAACACTGTCCGGTTCCATGACACT
GTCGCTGCCCTGCTCGGGGCGGGTGAGCAGGTTTTCCTGGAACTTTCACCTCACCCGGTG
TTGACACAAGCGATCACCGACACCGTCGAACAAGCCGGCGGCGGCGGCGCAGCAGTGCCA
GCTCTACGCAAGGATCGCCCTGATGCTGTCGCGTTCGCTGCAGCACTCGGCCAGCTGCAC
TGCCATGGCATCAGCCCATCCTGGAATGTTCTTTACTGCCAGGCCCGCCCCCTCACACTG
CCCACCTACGCTTTCCAGCATCAGCGTTACTGGCTGCTGCCCACCGCTGGTGATTTCAGC
GGGGCCAATACCCACGCCATGCATCCGCTGCTAGACACCGCCACCGAACTGGCCGAAAAC
CGCGGATGGGTGTTCACCGGCCGGATCAGCCCACGCACCCAACCATGGCTAAACGAACAC
GCCGTCGAATCAGCCGTGCTGTTCCCAGGCACCGGATTTGTCGAGCTAGCGCTGCATGTC
GCTGACCGTGCCGGATATTCCTCGGTCAACGAACTGATCGTGCACACCCCCCTGCTGCTC
GCTGGCCACGACACCGCGGATCTACAGATCACCGTCACCGACACCGATGACATGGGCCGG
CAGTCTCTTAACATCCACTCGCGCCCACATATCGGCCATGACAACACCACCACCGGCGAT
GAACAACCCGAGTGGGTCCTGCATGCCAGCGCAGTCCTGACCGCACAAACCACCGACCAC
AACCACCTCCCCCTAACGCCTGTGCCGTGGCCTCCACCCGGCACAGCCGCGATCGAGGTG
GATGACTTCTACGACGACCTGGCTGCACAGGGCTACAACTACGGCCCGACATTCCAAGGT
GTGCAACGGATATGGCGTGACCACGCCACACCCGATGTCATCTACGCCGAAGTTGAACTA
CCCGAAGACACCGACATCGACGGCTACGGCATCCACCCCGCCCTATTCGACGCCGCTTTA
CACCCCCTACTCGCCCTGACCCAACCCCCCACCAACGACACCGATGACACCAACACCGCA
GACACCGGTGACCAGGTGCGGCTGCCCTACGCCTTTACCGGCATCAGTTTGCACGCCACC
CACGCCACCCGATTGCGGGTACGGCTGACCCGTACCGGCGCCGATGCCATCACCGTGCAC
ACCAGTGACACCACCGGAGCCCCGGTGGCGATCATCGACTCATTGATCACCCGCCCCCTC
ACCACCGCCACAGGGTCTGCTCCGGCAACCACAGCAGCTGGCCTACTACACCTGAGCTGG
CCACCACACCCTGACACCACGACCGACACCGACACCGACACCGATGCCCTGCGGTATCAG
GTGATCGCCGAACCCACTCAACAACTGCCCCGCTACCTGCACGACCTACACACCAGCACC
GACCTGCACACCAGCACCACCGAAGCAGACGTGGTTGTGTGGCCGGTACCGGTGCCCAGC
AACGAAGAGCTCCAGGCACACCAAGCATCCGACACCGCGGTGTCTTCTCGGATACACACC
CTGACCCGCCAAACACTTACCGTGGTGCAGGACTGGCTCACTCACCCCGACACCACCGGC
ACCCGACTGGTCATCGTGACCCGCCACGGCGTCAGCACCAGTGCCCACGACCCGGTCCCC
GACCTAGCCCACGCCGCAGTGTGGGGCCTGATCCGCAGCGCCCAAAACGAACACCCCGGA
CGCTTCACACTGCTCGACACCGACGACAACACCAACAGCGACACCCTCACCACCGCCCTA
ACCCTGCCAACCCGCGAAAACCAACTGGCCATACGCCGCGACACCATCCACATCCCCCGC
CTGACCCGACACAGCAGTGACGGTGCGCTCACTGCGCCGGTGGTGGTAGATCCTGAGGGC
ACGGTGTTGATCACCGGGGGGACCGGGACGCTGGGTGCCTTGTTCGCCGAGCATCTGGTT
TCTGCCCATGGTGTCCGGCATCTGTTGTTGACCTCGCGGCGCGGACCTCAGGCCCACGGT
GCCACCGATCTGCAGCAGCGGCTCACCGATCTAGGTGCTCATGTCACCATCACGGCCTGC
GATATCAGCGACCCCGAAGCACTGGCCGCCCTGGTCAATTCAGTGCCCACACAACACCGT
TTAACCGCGGTAGTGCACACCGCCGCGGTATTGGCCGACACCCCGGTCACCGAGTTGACC
GGCGATCAACTCGACCAGGTGCTGGCCCCCAAAATCGACGCGGCATGGCAGCTGCACCAA
CTCACCTACGAACACAACCTGTCTGCATTCATCATGTTCTCGTCCATGGCCGGAATGATA
GGCAGTCCCGGTCAGGGTAACTACGCGGCAGCCAACACCGCGTTAGATGCTCTCGCCGAC
TACCGCCACCGCCTGGGCTTGCCCGCGACCAGCCTGGCCTGGGGCTACTGGCAGACTCAC
ACCGGTCTCACCGCGCATCTAACCGATGTAGATCTAGCCCGCATGACCCGCCTGGGTTTG
ATGCCCATCGCCACCAGCCACGGACTGGCCCTGTTCGATGCCGCCCTCGCCACCGGACAG
CCCGTTTCGATACCCGCCCCGATCAACACCCACACCCTGGCCCGACACGCCCGCGACAAC
ACCCTGGCCCCGATCCTGTCTGCGCTGATCACCACACCACGGCGCCGGGCGGCCTCTGCC
GCAACCGATCTCGCTGCCCGCCTCAACGGACTTAGCCCCCAACAGCAACAACAAACACTG
GCCACCCTCGTGGCCGCGGCCACCGCCACCGTGCTGGGCCACCACACCCCCGAAAGCATC
AGCCCAGCCACCGCGTTCAAAGACCTCGGAATCGATTCGCTGACCGCCCTTGAACTGCGC
AACACCCTCACCCACAACACCGGCCTGGATCTGCCCCCCACCCTCATCTTCGATCACCCC
ACACCCCATGCGGTGGCCGAGCATCTGCTTGAACAGATCCCTGGCATCGGTGCCCTGGTG
CCGGCTCCGGTGGTGATCGCAGCTGGTCGTACCGAGGAGCCGGTGGCGGTGGTGGGGATG
GCGTGTCGTTTCCCCGGTGGTGTCGCATCAGCGGATCAGTTGTGGGACTTGGTGATCGCT
GGCCGTGATGTGGTGGGTAATTTTCCGGCCGATCGGGGTTGGGATGTGGAGGGACTGTTT
GATCCCGATCCGGACGCGGTCGGCAAAACCTACACCCGTTACGGCGCGTTCCTTGACGAT
GCGGCAGGTTTTGATGCCGGGTTCTTTGGGATCTCTCCACGGGAGGCACGCGCGATGGAC
CCCCAGCAGCGGCTGCTGCTGGAGGTGTGCTGGGAAGCGCTAGAAACCGCGGGTATTCCC
GCGCACACCTTGGCCGGCACCTCCACCGGGGTATTCGCCGGAGCCTGGGCCCAGTCCTAC
GGCGCCACCAACTCCGATGACGCTGAGGGGTATGCGATGACCGGCGGCGCGACTAGCGTC
ATGTCCGGCCGTATCGCCTACACCTTGGGCCTAGAAGGTCCAGCGATCACCGTTGACACC
GCCTGCTCGTCATCGCTGGTGGCAATTCACCTGGCCTGCCAATCCTTACGCAACAACGAA
TCCCAGCTAGCACTGGCCGGCGGCGTCACCGTGATGAGCACACCTGCGGTTTTCACCGAG
TTCTCCCGCCAACGCGGCCTGGCCCCAGATGGACGCTGCAAAGCCTTCGCCGCTACCGCC
GATGGCACCGGCTTTGGTGAAGGCGCCGCGGTCTTGGTCCTTGAACGGCTCTCCGAGGCC
CGCCGCAACAACCACCCGGTCCTTGCGATCGTCGCTGGATCGGCGATCAACCAAGACGGC
GCATCCAACGGACTGACCGCACCCCACGGCCCGTCACAACAACGCGTCATCAACCAAGCA
CTAGCCAACGCCGGCCTCACCCACGACCAGGTCGACGCCGTCGAAGCCCACGGCACCGGC
ACCACACTGGGTGACCCCATCGAAGCCAGCGCCCTACACGCCACCTACGGCCACCACCAC
ACGCCCGATCAACCGCTTTGGCTGGGATCCATCAAATCCAACATCGGCCACACCCAAGCC
GCCGCCGGCGCCGCCGGTGTGGTCAAGATGATCCAAGCCATCACCCACGCCACCTTGCCC
GCCACCTTGCACGTCGACCAACCCAGCCCCCACATCGACTGGTCCAGCGGCACAGTCCGA
CTCCTAACCGAGCCCATCCAATGGCCCAACACCGACCACCCCCGCACCGCGGCGGTGTCC
TCATTCGGCATCAGCGGCACCAACGCCCACCTCATCCTCCAACAACCCCCCACCCCCGAC
ACCACACAAACCCCCAACACCACAACAGGTTCTGATCCCGCAGTGGGTTCTGATCCCGCA
GTGGGTGTACTGGTGTGGCCGTTGTCAGCGCGTTCAGCGCCGGGGTTAAGCGCACAAGCG
GCCCGTCTGTACCAGCATCTCAGCGCCCACCCCGATCTGGATCCGATCGATGTAGCCCAC
AGCCTGGCTACCACACGCAGCCACCACCCCCACCGCGCCACCATCACCACCAGCATTGAG
CACCACAGCGAAAACAACCACGACACAACCGATGCGCTGGCCGCACTGCACGCCCTGGCC
AACAACGGCACACACCCCCTGCTGAGCAGAGGCCTGCTGACCCCACAGGGCCCCGGCAAA
ACAGTGTTCGTGTTCCCCGGACAGGGCAGTCAATACCCCGGCATGGGCGCAGATCTCTAC
CGCCAATTCCCCGTGTTCGCCCACGCCCTCGACGCATGCGACGCAGCGTTACAGCCTTTC
ACTGGATGGTCGGTGCTAGCTGTGTTACACGACGAACCCGAGGCCCCGTCGTTGGAGCGG
GTCGATGTGGTCCAGCCTGTGTTGTTCTCGGTGATGGTGTCGTTAGCCGCACTCTGGCGG
TGGGCCGGAATCACCCCCGATGCAGTCATCGGCCACTCCCAGGGCGAGATCGCCGCGGCA
CATGTGGCCGGAGCCCTGACCTTGCCCGAAGCAGCTGCGGTAGTGGCTTTGCGCAGCCGT
GTCTTGACCGACCTGGCCGGTGCCGGTGCCATGGCTTCAGTGCTATCGCCCGAGGAACCA
CTGACCCAGCTGCTGGCACGGTGGGACGGCAAGATCACTGTCGCCGCAGTTAACGGCCCC
GCTAGCGCTGTGGTCTCCGGCGATACCACAGCGATCACCGAATTGCTGATTACCTGCGAA
CACGAAAACATCGACGCTCGCGCTATCCCGGTGGACTACCCCTCTCATTCCCCCTATATG
GAACACATCCGCCATCAGTTCCTCGACGAGCTACCCGAGCTGACACCGCGGCCATCAACC
ATCGCGATGTATTCCACCGTCGACGGCGAACCTCACGACACCGCCTACGACACCACCACA
ATGACCGCGGACTACTGGTACCGCAACATCCGTAACACTGTCCGGTTCCATGACACTGTC
GCTGCCCTGCTCGGGGCGGGTGAGCAGGTTTTCCTGGAACTTTCACCTCACCCGGTGTTG
ACACAAGCGATCACCGACACCGTCGAACAAGCCGGCGGCGGCGGCGCAGCAGTGCCAGCT
CTACGCAAGGATCGCCCTGATGCTGTCGCGTTCGCTGCAGCACTCGGCCAGCTGCACTGC
CATGGCATCAGCCCATCCTGGAATGTTCTTTACTGCCAGGCCCGCCCCCTCACACTGCCC
ACCTACGCTTTCCAGCATCAGCGTTACTGGCTGCTGCCCACCGCTGGTGATTTCAGCGGG
GCCAATACCCACGCCATGCATCCGCTGCTAGACACCGCCACCGAACTGGCCGAAAACCGC
GGATGGGTGTTCACCGGCCGGATCAGCCCACGCACCCAACCATGGCTAAACGAACACGCC
GTCGAATCAGCCGTGCTGTTCCCAGGCACCGGATTTGTCGAGCTAGCGCTGCATGTCGCT
GACCGTGCCGGATATTCCTCGGTCAACGAACTGATCGTGCACACCCCCCTGCTACTCGCT
GGCCACGACACCGCGGATCTACAGATCACCGTCACCGACACCGATGACATGGGCCGGCAG
TCTCTTAACATCCACTCGCACCCACATATCGGCCATGACAACACCACCACCGGCGATGAA
CAACCCGAGTGGGTCCTGCATGCCAGCGCAGTCCTGACCGCACAAACCACCGACCACAAC
CACCTCCCCCTAACGCCTGTGCCGTGGCCTCCACCCGGCACAGCCGCGATCGAGGTGGAT
GACTTCTACGACGACCTGGCTGCACAGGGCTACAACTACGGCCCGACATTCCAAGGTGTG
CAACGGATATGGCGTGACCACGCCACACCCGATGTCATCTACGCCGAAGTTGAACTACCC
GAAGACACCGACATCGACGGCTACGGCATCCACCCCGCCCTATTCGACGCCGCTTTACAC
CCCCTACTCGCCCTGACCCAACCCCCCACCAACGACACCGATGACACCAACACCGCAGAC
ACCGGTGACCAGGTGCGGCTGCCCTACGCCTTTACCGGCATCAGTTTGCACGCCACCCAC
GCCACCCGATTGCGGGTACGGCTGACCCGTACCGGCGCCGATGCCATCACCGTGCACACC
AGTGACACCACCGGAGCCCCGGTGGCGATCATCGACTCATTGATCACCCGCCCCCTCACC
ACCGCCACAGGGTCTGCTCCGGCAACCACAGCAGCTGGCCTACTACACCTGAGCTGGCCA
CCACACCCTGACACCACGACCGACACCGACACCGACACCGATGCCCTGCGGTATCGGGTG
ATCGCCGAACCCACTCAACAACTGCCCCGCTACCTGCACGACCTACACACCAGCACCGAC
CTGCACACCAGCACCACCGAAGCAGACGTGGTTGTGTGGCCGGTACCGGTGCCCAGCAAC
GAAGAGCTCCAGGCACACCAAGCATCCGACACCGCGGTGTCTTCTCGGATACACACCCTG
ACCCGCCAAACACTTACCGTGGTGCAGGACTGGCTCACTCACCCCGACACCACCGGCACC
CGACTGGTCATCGTGACCCGCCACGGCGTCAGCACCAGTGCCCACGACCCGGTCCCCGAC
CTAGCCCACGCCGCAGTGTGGGGCCTGATCCGCAGCGCCCAAAACGAACACCCCGGACGC
TTCACACTGCTCGACACCGACGACAACACCAACAGCGACACCCTCACCACCGCCCTAACC
CTGCCAACCCGCGAAAACCAACTGGCCATACGCCGCGACACCATCCACATCCCCCGCCTG
ACCCGACACAGCAGTGACGGTGCGCTCACTGCGCCGGTGGTGGTAGATCCTGAGGGCACG
GTGTTGATCACCGGGGGGACCGGGACGCTGGGTGCCTTGTTCGCCGAGCATCTGGTTTCT
GCCCATGGTGTCCGGCATCTGTTGTTGACCTCGCGGCGCGGACCTCAGGCCCACGGTGCC
ACCGATCTGCAGCAGCGGCTCACCGATCTAGGTGCTCATGTCACCATCACGGCCTGCGAT
ATCAGCGACCCCGAAGCACTGGCCGCCCTGGTCAATTCAGTGCCCACACAACACCGTTTA
ACCGCGGTAGTGCACACCGCCGCGGTATTGGCCGACACCCCGGTCACCGAGTTGACCGGC
GATCAACTCGACCAGGTGCTGGCCCCCAAAATCGACGCGGCATGGCAGCTGCACCAACTC
ACCTACGAACACAACCTGTCTGCATTCATCATGTTCTCGTCCATGGCCGGAATGATAGGC
AGTCCCGGTCAGGGTAACTACGCGGCAGCCAACACCGCGTTAGATGCTCTCGCCGACTAC
CGCCACCGCCTGGGCTTGCCCGCGACCAGCCTGGCCTGGGGCTACTGGCAGACTCACACC
GGTCTCACCGCGCATCTAACCGATGTAGATCTAGCCCGCATGACCCGCCTGGGTTTGATG
CCCATCGCCACCAGCCACGGACTGGCCCTGTTCGATGCCGCCCTCGCCACCGGACAGCCC
GTTTCGATACCCGCCCCGATCAACACCCACACCCTGGCCCGACACGCCCGCGACAACACC
CTGGCCCCGATCCTGTCTGCGCTGATCACCACACCACGGCGCCGGGCGGCCTCTGCCGCA
ACCGATCTCGCTGCCCGCCTCAACGGACTTAGCCCCCAACAGCAACAACAAACACTGGCC
ACCCTCGTGGCCGCGGCCACCGCCACCGTGCTGGGCCACCACACCCCCGAAAGCATCAGC
CCAGCCACCGCGTTCAAAGACCTCGGAATCGATTCGCTGACCGCCCTTGAACTGCGCAAC
ACCCTCACCCACAACACCGGCCTGGATCTGCCCCCCACCCTCATCTTCGATCACCCCACA
CCCCATGCGGTGGCCGAGCATCTGCTTGAACAGATCCCTGGCATCGGTGCCCTGGTGCCG
GCTCCGGTGGTGATCGCAGCTGGTCGTACCGAGGAGCCGGTGGCGGTGGTGGGGATGGCG
TGTCGTTTCCCCGGTGGTGTCGCATCAGCGGATCAGTTGTGGGACTTGGTGATCGCTGGC
CGTGATGTGGTGGGTAATTTTCCGGCCGATCGGGGTTGGGATGTGGAGGGACTGTTTGAT
CCCGATCCGGACGCGGTCGGCAAAACCTACACCCGTTACGGCGCGTTCCTTGACGATGCG
GCAGGTTTTGATGCCGGGTTCTTTGGGATCTCTCCACGGGAGGCACGCGCGATGGACCCC
CAGCAGCGGCTGCTGCTGGAGGTGTGCTGGGAAGCGCTAGAAACCGCGGGTATTCCCGCG
CACACCTTGGCCGGCACCTCCACCGGGGTATTCGCCGGAGCCTGGGCCCAGTCCTACGGC
GCCACCAACTCCGATGACGCTGAGGGGTATGCGATGACCGGCGGCGCGACTAGCGTCATG
TCCGGCCGTATCGCCTACACCTTGGGCCTAGAAGGTCCAGCGATCACCGTTGACACCGCC
TGCTCGTCATCGCTGGTGGCAATTCACCTGGCCTGCCAATCCTTACGCAACAACGAATCC
CAGCTAGCACTGGCCGGCGGCGTCACCGTGATGAGCACACCTGCGGTTTTCACCGAGTTC
TCCCGCCAACGCGGCCTGGCCCCAGATGGACGCTGCAAAGCCTTCGCCGCTACCGCCGAT
GGCACCGGCTTTGGTGAAGGCGCCGCGGTCTTGGTCCTTGAACGGCTCTCCGAGGCCCGC
CGCAACAACCACCCGGTCCTTGCGATCGTCGCTGGATCGGCGATCAACCAAGACGGCGCA
TCCAACGGACTGACCGCACCCCACGGCCCGTCACAACAACGCGTCATCAACCAAGCACTA
GCCAACGCCGGCCTCACCCACGACCAGGTCGACGCCGTCGAAGCCCACGGCACCGGCACC
ACACTGGGTGACCCCATCGAAGCCAGCGCCCTACACGCCACCTACGGCCACCACCACACG
CCCGATCAACCGCTTTGGCTGGGATCCATCAAATCCAACATCGGCCACACCCAAGCCGCC
GCCGGCGCCGCCGGTGTGGTCAAGATGATCCAAGCCATCACCCACGCCACCTTGCCCGCC
ACCTTGCACGTCGACCAACCCAGCCCCCACATCGACTGGTCCAGCGGCACAGTCCGACTC
CTAACCGAGCCCATCCAATGGCCCAACACCGACCACCCCCGCACCGCGGCGGTGTCCTCA
TTCGGCATCAGCGGCACCAACGCCCACCTCATCCTCCAACAACCCCCCACCCCCGACACC
ACACAAACCCCCAACACCACAACAGGTTCTGATCCCGCAGTGGGTTCTGATCCCGCAGTG
GGTGTACTGGTGTGGCCGTTGTCAGCGCGTTCAGCGCCGGGGTTAAGCGCACAAGCGGCC
CGTCTGTACCAGCATCTCAGCGCCCACCCCGATCTGGATCCGATCGATGTAGCCCACAGC
CTGGCTACCACACGCAGCCACCACCCCCACCGCGCCACCATCACCACCAGCATTGAGCAC
CACAGCGAAAACAACCACGACACAACCGATGCGCTGGCCGCACTGCACGCCCTGGCCAAC
AACGGCACACACCCCCTGCTGAGCAGAGGCCTGCTGACCCCACAGGGCCCCGGCAAAACA
GTGTTCGTGTTCCCCGGACAGGGCAGTCAATACCCCGGCATGGGCGCAGATCTCTACCGC
CAATTCCCCGTGTTCGCCCACGCCCTCGACGCATGCGACGCAGCGTTACAGCCTTTCACT
GGATGGTCGGTGCTAGCTGTGTTACACGACGAACCCGAGGCCCCGTCGTTGGAGCGAGTC
GATGTGGTCCAGCCTGTGTTGTTCTCGGTGATGGTGTCGTTAGCCGCACTCTGGCGGTGG
GCCGGAATCACCCCCGATGCAGTCATCGGCCACTCCCAGGGCGAGATCGCCGCGGCACAT
GTGGCCGGAGCCCTGACCTTGCCCGAAGCAGCTGCGGTAGTGGCTTTGCGCAGCCGTGTC
TTGACCGACCTGGCCGGTGCCGGTGCCATGGCTTCAGTGCTATCGCCCGAGGAACCACTG
ACCCAGCTGCTGGCACGGTGGGACGGCAAGATCACTGTCGCCGCAGTTAACGGCCCCGCT
AGCGCTGTGGTCTCCGGCGATACCACAGCGATCACCGAATTGCTGATTACCTGCGAACAC
GAAAACATCGACGCTCGCGCTATCCCGGTGGACTACCCCTCTCATTCCCCCTATATGGAA
CACATCCGCCATCAGTTCCTCGACGAGCTACCCGAGCTGACACCGCGGCCATCAACCATC
GCGATGTATTCCACCGTCGACGGCGAACCTCACGACACCGCCTACGACACCACCACAATG
ACCGCGGACTACTGGTACCGCAACATCCGTAACACTGTCCGGTTCCATGACACTGTCGCT
GCCCTGCTCGGGGCGGGTGAGCAGGTTTTCCTGGAACTTTCACCTCACCCGGTGTTGACA
CAAGCGATCACCGACACCGTCGAACAAGCCGGCGGCGGCGGCGCAGCAGTGCCAGCTCTA
CGCAAGGATCGCCCTGATGCTGTCGCGTTCGCTGCAGCACTCGGCCAGCTGCACTGCCAT
GGCATCAGCCCATCCTGGAATGTTCTTTACTGCCAGGCCCGCCCCCTCACACTGCCCACC
TACGCTTTCCAGCATCAGCGTTACTGGCTGCTGCCCACCGCTGGTGATTTCAGCGGGGCC
AATACCCACGCCATGCATCCGCTGCTAGACACCGCCACCGAACTGGCCGAAAACCGCGGA
TGGGTGTTCACCGGCCGGATCAGCCCACGCACCCAACCATGGCTAAACGAACACGCCGTC
GAATCAGCCGTGCTGTTCCCAGGCACCGGATTCGTCGAGCTAGCGCTGCATGTCGCTGAC
CGTGCCGGATATTCCTCGGTCAACGAACTGATCGTGCACACCCCCCTGCTACTCGCTGGC
CACGACACCGCGGATCTACAGATCACCGTCACCGACACCGATGACATGGGCCGGCAGTCT
CTTAACATCCACTCGCGCCCACATATCGGCCATGACAACACCACCACCGGCGATGAACAA
CCCGAGTGGGTCCTGCATGCCAGCGCAGTCCTGACCGCACAAACCACCGACCACAACCAC
CTCCCCCTAACGCCTGTGCCGTGGCCTCCACCCGGCACAGCCGCGATCGAGGTGGATGAC
TTCTACGACGACCTGGCTGCACAGGGCTACAACTACGGCCCGACATTCCAAGGTGTGCAA
CGGATATGGCGTGACCACGCCACACCCGATGTCATCTACGCCGAAGTTGAACTACCCGAA
GACACCGACATCGACGGCTACGGCATCCACCCCGCCCTATTCGACGCCGCTTTACACCCC
CTACTCGCCCTGACCCAACCCCCCACCAACGACACCGATGACACCAACACCGCAGACACC
GGTGACCAGGTGCGGCTGCCCTACGCCTTTACCGGCATCAGTTTGCACGCCACCCACGCC
ACCCGATTACGGGTACGGCTGACCCGTACCGGCGCCGATGCCATCACCGTGCACACCAGT
GACACCACCGGAGCCCCGGTGGCGATCATCGACTCATTGATCACCCGCCCCCTCACCACC
GCCACAGGGTCTGCTCCGGCAACCACAGCAGCTGGCCTACTACACCTGAGCTGGCCACCA
CACCCTGACACCACGACCGACACCGACACCGACACCGATGCCCTGCGGTATCAGGTGATC
GCCGAACCCACTCAACAACTGCCCCGCTACCTGCACGACCTACACACCAGCACCGACCTG
CACACCAGCACCACCGAAGCAGACGTGGTTGTGTGGCCGGTACCGGTGCCCAGCAACGAA
GAGCTCCAGGCACACCAAGCATCCGACACCGCGGTGTCTTCTCGGATACACACCCTGACC
CGCCAAACACTTACCGTGGTGCAGGACTGGCTCACTCACCCCGACACCACCGGCACCCGA
CTGGTCATCGTGACCCGCCACGGCGTCAGCACCAGTGCCCACGACCCGGTCCCCGACCTA
GCCCACGCCGCAGTGTGGGGCCTGATCCGCAGCGCCCAAAACGAACACCCCGGACGCTTC
ACACTGCTCGACACCGACGACAACACCAACAGCGACACCCTCACCACCGCCCTAACCCTG
CCAACCCGCGAAAACCAACTGGCCATACGCCGCGACACCATCCACATCCCCCGCCTGACC
CGACACAGCAGTGACGGTGCGCTCACTGCGCCGGTGGTGGTAGATCCTGAGGGCACGGTG
TTGATCACCGGGGGGACCGGGACGCTGGGTGCCTTGTTCGCCGAGCATCTGGTTTCTGCC
CATGGTGTCCGGCATCTGTTGTTGACCTCGCGGCGCGGACCTCAGGCCCACGGTGCCACC
GATCTGCAGCAGCGGCTCACCGATCTAGGTGCTCATGTCACCATCACGGCCTGCGATATC
AGCGACCCCGAAGCACTGGCCGCCCTGGTCAATTCAGTGCCCACACAACACCGTTTAACC
GCGGTAGTGCACACCGCCGCGGTATTGGCCGACACCCCGGTCACCGAGTTGACCGGCGAT
CAACTCGACCAGGTGCTGGCCCCCAAAATCGACGCGGCATGGCAGCTGCACCAACTCACC
TACGAACACAACCTGTCTGCATTCATCATGTTCTCGTCCATGGCCGGAATGATAGGCAGT
CCCGGTCAGGGTAACTACGCGGCAGCCAACACCGCGTTAGATGCTCTCGCCGACTACCGC
CACCGCCTGGGCTTGCCCGCGACCAGCCTGGCCTGGGGCTACTGGCAGACTCACACCGGT
CTCACCGCGCATCTAACCGATGTAGATCTAGCCCGCATGACCCGCCTGGGTTTGATGCCC
ATCGCCACCAGCCACGGACTGGCCCTGTTCGATGCCGCCCTCGCCACCGGACAGCCCGTT
TCGATACCCGCCCCGATCAACACCCACACCCTGGCCCGACACGCCCGCGACAACACCCTG
GCCCCGATCCTGTCTGCGCTGATCACCACACCACGGCGCCGGGCGGCCTCTGCCGCAACC
GATCTCGCTGCCCGCCTCAACGGACTTAGCCCCCAACAGCAACAACAAACACTGGCCACC
CTCGTGGCCGCGGCCACCGCCACCGTGCTGGGCCACCACACCCCCGAAAGCATCAGCCCA
GCCACCGCGTTCAAAGACCTCGGAATCGATTCGCTGACCGCCCTTGAACTGCGCAACACC
CTCACCCACAACACCGGCCTGGATCTGCCCCCCACCCTCATCTTCGATCACCCCACACCC
CATGCGGTGGCCGAGCATCTGCTTGAACAGATCCCTGGCATCGGTGCCCTGGTGCCGGCT
CCGGTGGTGATCGCAGCTGGTCGTACCGAGGAGCCGGTGGCGGTGGTGGGGATGGCGTGT
CGTTTCCCCGGTGGTGTCGCATCAGCGGATCAGTTGTGGGACTTGGTGATCGCTGGCCGT
GATGTGGTGGGTAATTTTCCGGCCGATCGGGGTTGGGATGTGGAGGGACTGTTTGATCCC
GATCCGGACGCGGTCGGCAAAACCTACACCCGTTACGGCGCGTTCCTTGACGATGCGGCA
GGTTTTGATGCCGGGTTCTTTGGGATCTCTCCACGGGAGGCACGCGCGATGGACCCCCAG
CAGCGGCTGCTGCTGGAGGTGTGCTGGGAAGCGCTAGAAACCGCGGGTATTCCCGCGCAC
ACCTTGGCCGGCACCTCCACCGGGGTATTCGCCGGAGCCTGGGCCCAGTCCTACGGCGCC
ACCAACTCCGATGACGCTGAGGGGTATGCGATGACCGGCGGCGCGACTAGCGTCATGTCC
GGCCGTATCGCCTACACCTTGGGCCTAGAAGGTCCAGCGATCACCGTTGACACCGCCTGC
TCGTCATCGCTGGTGGCAATTCACCTGGCCTGCCAATCCTTACGCAACAACGAATCCCAG
CTAGCACTGGCCGGCGGCGTCACCGTGATGAGCACACCTGCGGTTTTCACCGAGTTCTCC
CGCCAACGCGGCCTGGCCCCAGATGGACGCTGCAAAGCCTTCGCCGCTACCGCCGATGGC
ACCGGCTTTGGTGAAGGCGCCGCGGTCTTGGTCCTTGAACGGCTCTCCGAGGCCCGCCGC
AACAACCACCCGGTCCTTGCGATCGTCGCTGGATCGGCGATCAACCAAGACGGCGCATCC
AACGGACTGACCGCACCCCACGGCCCGTCACAACAACGCGTCATCAACCAAGCACTAGCC
AACGCCGGCCTCACCCACGACCAGGTCGACGCCGTCGAAGCCCACGGCACCGGCACCACA
CTGGGTGACCCCATCGAAGCCAGCGCCCTACACGCCACCTACGGCCACCACCACACGCCC
GATCAACCGCTTTGGCTGGGATCCATCAAATCCAACATCGGCCACACCCAAGCCGCCGCC
GGCGCCGCCGGTGTGGTCAAGATGATCCAAGCCATCACCCACGCCACCTTGCCCGCCACC
TTGCACGTCGACCAACCCAGCCCCCACATCGACTGGTCCAGCGGCACAGTCCGACTCCTA
ACCGAGCCCATCCAATGGCCCAACACCGACCACCCCCGCACCGCGGCGGTGTCCTCATTC
GGCATCAGCGGCACCAACGCCCACCTCATCCTCCAACAACCCCCCACCCCCGACACCACA
CAAACCCCCAACACCACAACAGGTTCTGATCCCGCAGTGGGTTCTGATCCCGCAGTGGGT
GTACTGGTGTGGCCGTTGTCAGCGCGTTCAGCGCCGGGGTTAAGCGCACAAGCGGCCCGT
CTGTACCAGCATCTCAGCGCCCACCCCGATCTGGATCCGATCGATGTAGCCCACAGCCTG
GCTACCACACGCAGCCACCACCCCCACCGCGCCACCATCACCACCAGCATTGAGCACCAC
AGCGAAAACAACCACGACACAACCGATGCGCTGGCCGCACTGCACGCCCTGGCCAACAAC
GGCACACACCCCCTGCTGAGCAGAGGCCTGCTGACCCCACAGGGCCCCGGCAAAACAGTG
TTCGTGTTCCCCGGACAGGGCAGTCAATACCCCGGCATGGGCGCAGATCTCTACCGCCAA
TTCCCCGTGTTCGCCCACGCCCTCGACGAGGTCGCTGCGGCGCTGAACCCGCATCTCGAT
GTTGCGTTGCTTGAGGTGATGTTCAGCCAACAAGACACTGCCATGGCGCAACTGCTGGAC
CAGACCTTCTATGCACAACCGGCGTTGTTCGCGCTGGGAACCGCTCTACATCGATTGTTC
ACCCACGCCGGTATCCACCCGGACTACCTGCTAGGCCACTCCATCGGAGAACTCACCGCG
GCATACGCCGCCGGTGTGCTGTCACTGCAAGACGCAGCCACCTTGGTCACAAGCCGAGGA
CGACTGATGCAATCCTGCACGCCCGGCGGGACGATGCTCGCACTACAAGCCAGCGAAGCA
GAAGTACAACCGCTGCTTGAAGGCCTAGACCACGCCGTGTCCATCGCCGCGATCAACGGA
GCAACGTCGATCGTACTGTCAGGAGATCACGACAGCCTCGAACAAATCGGCGAGCACTTC
ATTACCCAAGATCGACGTACCACCCGACTGCAGGTCAGTCACGCTTTCCACTCTCCACAT
ATGGACCCCATCCTCGAACAATTCCGCCAGATCGCGGCCCAACTCACCTTCAGCGCACCC
ACCCTGCCCATCTTGTCCAACCTCACCGGGCAGATCGCCCGCCACGACCAACTCGCCTCA
CCTGACTATTGGACCCAACAGCTACGTAACACTGTCCGGTTCCATGACACTGTCGCTGCC
CTGCTCGGGGCGGGTGAGCAGGTTTTCCTGGAACTTTCACCTCACCCGGTGTTGACACAA
GCGATCACCGACACCGTCGAACAAGCCGGCGGCGGCGGCGCAGCAGTGCCAGCTCTACGC
AAGGATCGCCCTGATGCTGTCGCGTTCGCTGCAGCACTCGGCCAGCTGCACTGCCATGGC
ATCAGCCCATCCTGGAATGTTCTTTACTGCCAGGCCCGCCCCCTCACACTGCCCACCTAC
GCTTTCCAGCATCAGCGTTACTGGCTGCTGCCCACCGCTGGTGATTTCAGCGGGGCCAAT
ACCCACGCCATGCATCCGCTGCTAGACACCGCCACCGAACTGGCCGAAAACCGCGGATGG
GTGTTCACCGGCCGGATCAGCCCACGCACCCAACCATGGCTAAACGAACACGCCGTCGAA
TCAGCCGTGCTGTTCCCAGGCACCGGATTTGTCGAGCTAGCGCTGCATGTCGCTGACCGT
GCCGGATATTCCTCGGTCAACGAACTGATCGTGCACACCCCCCTGCTACTCGCTGGCCAC
GACACCGCGGATCTACAGATCACCGTCACCGACACCGATGACATGGGCCGGCAGTCTCTT
AACATCCACTCGCGCCCACATATCGGCCATGACAACACCACCACCGGCGATGAACAACCC
GAGTGGGTCCTGCATGCCAGCGCAGTCCTGACCGCACAAACCACCGACCACAACCACCTC
CCCCTAACGCCTGTGCCGTGGCCTCCACCCGGCACAGCCGCGATCGAGGTGGATGACTTC
TACGACGACCTGGCTGCACAGGGCTACAACTACGGCCCGACATTCCAAGGTGTGCAACGG
ATATGGCGTGACCACGCCACACCCGATGTCATCTACGCCGAAGTTGAACTACCCGAAGAC
ACCGACATCGACGGCTACGGCATCCACCCCGCCCTATTCGACGCCGCTTTACACCCCCTA
CTCGCCCTGACCCAACCCCCCACCAACGACACCGATGACACCAACACCGCAGACACCGGG
GACCAGGTGCGGCTGCCCTACGCCTTTACCGGCATCAGTTTGCACGCCACCCACGCCACC
CGATTACGGGTACGGCTGACCCGTACCGGCGCCGATGCCATCACCGTGCACACCAGTGAC
ACCACCGGAGCCCCGGTGGCGATCATCGACTCATTGATCACCCGCCCCCTCACCACCGCC
ACAGGGTCTGCTCCGGCAACCACAGCAGCTGGCCTACTACACCTGAGCTGGCCACCACAC
CCTGACACCACGACCGACACCGACACCGACACCGATGCCCTGCGGTATCAGGTGATCGCC
GAACCCACTCAACAACTGCCCCGCTACCTGCACGACCTACACACCAGCACCGACCTGCAC
ACCAGCACCACCGAAGCAGACGTGGTTGTGTGGCCGGTACCGGTGCCCAGCAACGAAGAG
CTCCAGGCACACCAAGCATCCGACACCGCGGTGTCTTCTCGGATACACACCCTGACCCGC
CAAACACTTACCGTGGTGCAGGACTGGCTCACTCACCCCGACACCACCGGCACCCGACTG
GTCATCGTGACCCGCCACGGCGTCAGCACCAGTGCCCACGACCCGGTCCCCGACCTAGCC
CACGCCGCAGTGTGGGGCCTGATCCGCAGCGCCCAAAACGAACACCCCGGACGCTTCACA
CTGCTCGACACCGACGACAACACCAACAGCGACACCCTCACCACCGCCCTAACCCTGCCA
ACCCGCGAAAACCAACTGGCCATACGCCGCGACACCATCCACATCCCCCGCCTGACCCGA
CACAGCAGTGACGGTGCGCTCACTGCGCCGGTGGTGGTAGATCCTGAGGGCACGGTGTTG
ATCACCGGGGGGACCGGGACGCTGGGTGCCTTGTTCGCCGAGCATCTGGTTTCTGCCCAT
GGTGTCCGGCATCTGTTGTTGACCTCGCGGCGCGGACCTCAGGCCCACGGTGCCACCGAT
CTGCAGCAGCGGCTCACCGATCTAGGTGCTCATGTCACCATCACGGCCTGCGATATCAGC
GACCCCGAAGCACTGGCCGCCCTGGTCAATTCAGTGCCCACACAACACCGTTTAACCGCG
GTAGTGCACACCGCCGCGGTATTGGCCGACACCCCGGTCACCGAGTTGACCGGCGATCAA
CTCGACCAGGTGCTGGCCCCCAAAATCGACGCGGCATGGCAGCTGCACCAACTCACCTAC
GAACACAACCTGTCTGCATTCATCATGTTCTCGTCCATGGCCGGAATGATAGGCAGTCCC
GGTCAGGGTAACTACGCGGCAGCCAACACCGCGTTAGATGCTCTCGCCGACTACCGCCAC
CGCCTGGGCTTGCCCGCGACCAGCCTGGCCTGGGGCTACTGGCAGACTCACACCGGTCTC
ACCGCGCATCTAACCGATGTAGATCTAGCCCGCATGACCCGCCTGGGTTTGATGCCCATC
GCCACCAGCCACGGACTGGCCCTGTTCGATGCCGCCCTCGCCACCGGACAGCCCGTTTCG
ATACCCGCCCCGATCAACACCCACACCCTGGCCCGACACGCCCGCGACAACACCCTGGCC
CCGATCCTGTCTGCGCTGATCACCACACCACGGCGCCGGGCGGCCTCTGCCGCAACCGAT
CTCGCTGCCCGCCTCAACGGACTTAGCCCCCAACAGCAACAACAAACACTGGCCACCCTC
GTGGCCGCGGCCACCGCCACCGTGCTGGGCCACCACACCCCCGAAAGCATCAGCCCAGCC
ACCGCGTTCAAAGACCTCGGAATCGATTCGCTGACCGCCCTTGAACTGCGCAACACCCTC
ACCCACAACACCGGCCTGGATCTGCCCCCCACCCTCATCTTCGATCACCCCACACCCCAT
GCGCTAACCCAACACCTGCACACCCGACTCACCCAAAGCCATACCCCGGTCGGACCAATT
GCGTCCCTGCTAAGCCACGCGATCGATGAGGGCAAATTCCGTGCCGGCGCTGACCTATTG
ATGGCCGCATCCAATTTGAACCAAAGTTTCAGCAATATGGCTGAACTCAACCAGCTCCCG
GCCGTGACGGACATAGCTGACGCGTCTCCTGATGGGCTACTCACCCTGATCTGCATCTCT
ACCTCAGAGAATGAGTACGCTCGCCTCGCTGCTGCGAACATTCATTCACTGACCTTCGCT
GAAATTGCGGCGCCCGGCTTTTACGACGCGCAGCTGCCAAATTCGATAGAGACGTCGGCA
GAGGCGCTGGCAACTGCCATCACAGGCGCCTACGCAAATACGTCCATTGTTCTGGTAGCG
CACTCCATTGTCTGCGAGCTAGCTCAGGCAACGATGACACGTCTACAAGACGCTGACATC
GATCTTGTGGGTCTGGTTCTGTTGGATCCACTCGAAGGGACTAACAGCACTGAAGATTAT
GTGGAGACAGTCTTGACTCGAATCGAGCATATCAATGCACCGAGGGTCGGAGTAGACGGT
TACCTTGCCGCCCTGGGCCGCTATCTCCAATTCCACGAAGACCGCCGAATACCAATACCG
GAAACGCGGCACATGACACTGCACTCGGACACGAAAATTGACCGTGCCCAAACACCAATG
AACTTATTACAAGATGAGGCAGCGTTGACCGCCCTCAAAATAGGAAACTGGATGAACGAC
GTGGGTGTTGCCCTCTCTGTCAACCTTGAGTGA
[0] KSQ19..390
[0] AT567..889
[0] malonyl-CoA761..765
[0] dh932..1105
[0] KR1444..1624
[0] ACP1724..1794
[1] KS1817..2190
[1] AT2359..2672
[1] malonyl-CoA2546..2550
[1] KR2985..3161
[1] ACP3263..3333
[2] KS3356..3729
[2] AT3898..4211
[2] malonyl-CoA4085..4089
[2] KR4524..4700
[2] ACP4802..4872
[3] KS4895..5268
[3] AT5444..5766
[3] methylmalonyl-CoA not conserved YASHS(A->P)5635..5639
[3] DH5809..5982
[3] KR6321..6501
[3] ACP6601..6671
[4] KS6694..7067
[4] AT7243..7565
[4] malonyl-CoA7437..7441
[4] DH7608..7781
[4] KR8120..8300
[4] ACP8400..8470
[5] KS8493..8866
[5] AT9042..9364
[5] methylmalonyl-CoA not conserved YASHS(A->P)9233..9237
[5] DH9407..9580
[5] KR9919..10099
[5] ACP10199..10269
[6] KS10292..10665
[6] AT10841..11163
[6] methylmalonyl-CoA not conserved YASHS(A->P)11032..11036
[6] DH11206..11379
[6] KR11718..11898
[6] ACP11998..12068
[7] KS12091..12464
[7] AT12640..12962
[7] malonyl-CoA12834..12838
[7] DH13005..13178
[7] KR13517..13697
[7] ACP13797..13867
[0] KSQ55..1170
[0] AT1699..2667
[0] malonyl-CoA2281..2295
[0] dh2794..3315
[0] KR4330..4872
[0] ACP5170..5382
[1] KS5449..6570
[1] AT7075..8016
[1] malonyl-CoA7636..7650
[1] KR8953..9483
[1] ACP9787..9999
[2] KS10066..11187
[2] AT11692..12633
[2] malonyl-CoA12253..12267
[2] KR13570..14100
[2] ACP14404..14616
[3] KS14683..15804
[3] AT16330..17298
[3] methylmalonyl-CoA not conserved YASHS(A->P)16903..16917
[3] DH17425..17946
[3] KR18961..19503
[3] ACP19801..20013
[4] KS20080..21201
[4] AT21727..22695
[4] malonyl-CoA22309..22323
[4] DH22822..23343
[4] KR24358..24900
[4] ACP25198..25410
[5] KS25477..26598
[5] AT27124..28092
[5] methylmalonyl-CoA not conserved YASHS(A->P)27697..27711
[5] DH28219..28740
[5] KR29755..30297
[5] ACP30595..30807
[6] KS30874..31995
[6] AT32521..33489
[6] methylmalonyl-CoA not conserved YASHS(A->P)33094..33108
[6] DH33616..34137
[6] KR35152..35694
[6] ACP35992..36204
[7] KS36271..37392
[7] AT37918..38886
[7] malonyl-CoA38500..38514
[7] DH39013..39534
[7] KR40549..41091
[7] ACP41389..41601

close this sectionFeature

BLASTP
Database:UniProtKB:2011_09
show BLAST table
InterPro
Database:interpro:38.0
IPR001227 Acyl transferase domain (Domain)
 [561-693]  G3DSA:3.40.366.10 [761-877]  G3DSA:3.40.366.10 [2351-2481]  G3DSA:3.40.366.10 [2546-2661]  G3DSA:3.40.366.10 [3890-4020]  G3DSA:3.40.366.10 [4085-4200]  G3DSA:3.40.366.10 [5438-5567]  G3DSA:3.40.366.10 [5635-5754]  G3DSA:3.40.366.10 [7237-7369]  G3DSA:3.40.366.10 [7437-7553]  G3DSA:3.40.366.10 [9036-9165]  G3DSA:3.40.366.10 [9233-9352]  G3DSA:3.40.366.10 [10835-10964]  G3DSA:3.40.366.10 [11032-11151]  G3DSA:3.40.366.10 [12634-12766]  G3DSA:3.40.366.10 [12834-12950]  G3DSA:3.40.366.10
G3DSA:3.40.366.10   Ac_transferase_reg
IPR006162 Phosphopantetheine attachment site (PTM)
 [1752-1767]  PS00012 [3291-3306]  PS00012 [4830-4845]  PS00012 [6629-6644]  PS00012 [8428-8443]  PS00012 [10227-10242]  PS00012 [12026-12041]  PS00012 [13825-13840]  PS00012
PS00012   PHOSPHOPANTETHEINE
IPR009081 Acyl carrier protein-like (Domain)
 [1717-1833]  2.10000267834031e-26 SSF47336 [3256-3372]  3.69999438558106e-26 SSF47336 [4795-4911]  3.69999438558106e-26 SSF47336 [6594-6710]  2.30000440726534e-27 SSF47336 [8393-8509]  2.30000440726534e-27 SSF47336 [10192-10308]  2.30000440726534e-27 SSF47336 [11991-12107]  2.30000440726534e-27 SSF47336 [13790-13887]  9.19998414420358e-21 SSF47336
SSF47336   ACP_like
 [1724-1794]  PS50075 [3263-3333]  PS50075 [4802-4872]  PS50075 [6601-6671]  PS50075 [8400-8470]  PS50075 [10199-10269]  PS50075 [11998-12068]  PS50075 [13797-13867]  PS50075
PS50075   ACP_DOMAIN
 [1721-1797]  G3DSA:1.10.1200.10 [3264-3363]  G3DSA:1.10.1200.10 [4803-4902]  G3DSA:1.10.1200.10 [6598-6674]  G3DSA:1.10.1200.10 [8397-8473]  G3DSA:1.10.1200.10 [10196-10272]  G3DSA:1.10.1200.10 [11995-12071]  G3DSA:1.10.1200.10 [13794-13871]  G3DSA:1.10.1200.10
G3DSA:1.10.1200.10   ACP_like
 [1728-1793]  4.70000000000001e-11 PF00550 [3267-3332]  1.1e-11 PF00550 [4806-4871]  1.1e-11 PF00550 [6605-6670]  4.70000000000001e-12 PF00550 [8404-8469]  4.70000000000001e-12 PF00550 [10203-10268]  4.70000000000001e-12 PF00550 [12002-12067]  4.70000000000001e-12 PF00550 [13801-13866]  5.8e-12 PF00550
PF00550   PP-binding
IPR013968 Polyketide synthase, KR (Domain)
 [1444-1623]  5.40000000000002e-61 PF08659 [2985-3160]  2.19999999999997e-53 PF08659 [4524-4699]  2.19999999999997e-53 PF08659 [6321-6500]  5.40000000000002e-61 PF08659 [8120-8299]  5.40000000000002e-61 PF08659 [9919-10098]  5.40000000000002e-61 PF08659 [11718-11897]  5.40000000000002e-61 PF08659 [13517-13696]  5.40000000000002e-61 PF08659
PF08659   KR
IPR014030 Beta-ketoacyl synthase, N-terminal (Domain)
 [19-265]  5.50000000000008e-78 PF00109 [1817-2064]  7.79999999999999e-97 PF00109 [3356-3603]  7.79999999999999e-97 PF00109 [4895-5142]  5.59999999999999e-96 PF00109 [6694-6941]  4.89999999999996e-96 PF00109 [8493-8740]  4.80000000000003e-96 PF00109 [10292-10539]  4.80000000000003e-96 PF00109 [12091-12338]  4.80000000000003e-96 PF00109
PF00109   ketoacyl-synt
IPR014031 Beta-ketoacyl synthase, C-terminal (Domain)
 [273-390]  2.1e-40 PF02801 [2072-2190]  1.40000000000001e-43 PF02801 [3611-3729]  1.40000000000001e-43 PF02801 [5150-5268]  1.40000000000001e-43 PF02801 [6949-7067]  1.40000000000001e-43 PF02801 [8748-8866]  1.40000000000001e-43 PF02801 [10547-10665]  1.40000000000001e-43 PF02801 [12346-12464]  1.40000000000001e-43 PF02801
PF02801   Ketoacyl-synt_C
IPR014043 Acyl transferase (Domain)
 [567-889]  2.30000000000004e-72 PF00698 [2359-2672]  3.39999999999996e-68 PF00698 [3898-4211]  3.39999999999996e-68 PF00698 [5444-5766]  4.09999999999995e-102 PF00698 [7243-7565]  2.30000000000004e-72 PF00698 [9042-9364]  4.09999999999995e-102 PF00698 [10841-11163]  4.09999999999995e-102 PF00698 [12640-12962]  2.30000000000004e-72 PF00698
PF00698   Acyl_transf_1
IPR016035 Acyl transferase/acyl hydrolase/lysophospholipase (Domain)
 [565-857]  1.90000694315261e-71 SSF52151 [2357-2644]  1.59998835313644e-72 SSF52151 [3896-4183]  1.59998835313644e-72 SSF52151 [5442-5734]  5.50001754266469e-68 SSF52151 [7241-7533]  1.90000694315261e-71 SSF52151 [9040-9332]  5.50001754266469e-68 SSF52151 [10839-11131]  5.50001754266469e-68 SSF52151 [12638-12930]  1.90000694315261e-71 SSF52151
SSF52151   Acyl_Trfase/lysoPlipase
IPR016036 Malonyl-CoA ACP transacylase, ACP-binding (Domain)
 [695-760]  2.49999811956465e-16 SSF55048 [2483-2545]  2.19999900980708e-16 SSF55048 [4022-4084]  2.19999900980708e-16 SSF55048 [5569-5634]  1.69999922284049e-15 SSF55048 [7371-7436]  2.49999811956465e-16 SSF55048 [9167-9232]  1.69999922284049e-15 SSF55048 [10966-11031]  1.69999922284049e-15 SSF55048 [12768-12833]  2.49999811956465e-16 SSF55048
SSF55048   Malonyl_transacylase_ACP-bd
IPR016038 Thiolase-like, subgroup (Domain)
 [19-276]  G3DSA:3.40.47.10 [277-443]  G3DSA:3.40.47.10 [1819-2076]  G3DSA:3.40.47.10 [2077-2242]  G3DSA:3.40.47.10 [3364-3615]  G3DSA:3.40.47.10 [3616-3781]  G3DSA:3.40.47.10 [4903-5154]  G3DSA:3.40.47.10 [5155-5320]  G3DSA:3.40.47.10 [6696-6953]  G3DSA:3.40.47.10 [6954-7119]  G3DSA:3.40.47.10 [8495-8752]  G3DSA:3.40.47.10 [8753-8918]  G3DSA:3.40.47.10 [10294-10551]  G3DSA:3.40.47.10 [10552-10717]  G3DSA:3.40.47.10 [12093-12350]  G3DSA:3.40.47.10 [12351-12516]  G3DSA:3.40.47.10
G3DSA:3.40.47.10   Thiolase-like_subgr
IPR016039 Thiolase-like (Domain)
 [17-392]  7.40000780398351e-85 SSF53901 [1809-2188]  3.99998544139406e-99 SSF53901 [3348-3727]  3.99998544139406e-99 SSF53901 [4887-5266]  1.50000965748778e-98 SSF53901 [6686-7065]  9.90001804798383e-99 SSF53901 [8485-8864]  1.19999063421924e-98 SSF53901 [10284-10663]  1.19999063421924e-98 SSF53901 [12083-12462]  1.19999063421924e-98 SSF53901
SSF53901   Thiolase-like
IPR016040 NAD(P)-binding domain (Domain)
 [1444-1625]  G3DSA:3.40.50.720 [2985-3163]  G3DSA:3.40.50.720 [4524-4702]  G3DSA:3.40.50.720 [6321-6502]  G3DSA:3.40.50.720 [8120-8301]  G3DSA:3.40.50.720 [9919-10100]  G3DSA:3.40.50.720 [11718-11899]  G3DSA:3.40.50.720 [13518-13703]  G3DSA:3.40.50.720
G3DSA:3.40.50.720   NAD(P)-bd
IPR018201 Beta-ketoacyl synthase, active site (Active_site)
 [1977-1993]  PS00606 [3516-3532]  PS00606 [5055-5071]  PS00606 [6854-6870]  PS00606 [8653-8669]  PS00606 [10452-10468]  PS00606 [12251-12267]  PS00606
PS00606   B_KETOACYL_SYNTHASE
IPR020801 Polyketide synthase, acyl transferase domain (Domain)
 [569-870]  6.19996040581186e-121 SM00827 [2361-2653]  1e-116 SM00827 [3900-4192]  1e-116 SM00827 [5446-5747]  2.2999842048857e-117 SM00827 [7245-7546]  6.19996040581186e-121 SM00827 [9044-9345]  2.2999842048857e-117 SM00827 [10843-11144]  2.2999842048857e-117 SM00827 [12642-12943]  6.19996040581186e-121 SM00827
SM00827   PKS_AT
IPR020806 Polyketide synthase, phosphopantetheine-binding domain (Domain)
 [1725-1797]  3.00000067992871e-30 SM00823 [3264-3336]  6.90001478563507e-29 SM00823 [4803-4875]  6.90001478563507e-29 SM00823 [6602-6674]  4.00000300869883e-32 SM00823 [8401-8473]  4.00000300869883e-32 SM00823 [10200-10272]  4.00000300869883e-32 SM00823 [11999-12071]  4.00000300869883e-32 SM00823 [13798-13870]  2.39999798157265e-32 SM00823
SM00823   PKS_PP
IPR020807 Polyketide synthase, dehydratase domain (Domain)
 [932-1105]  8.00002659346736e-70 SM00826 [5809-5982]  2.89997807423186e-72 SM00826 [7608-7781]  2.89997807423186e-72 SM00826 [9407-9580]  3.39999972437719e-71 SM00826 [11206-11379]  2.89997807423186e-72 SM00826 [13005-13178]  2.89997807423186e-72 SM00826
SM00826   PKS_DH
IPR020841 Polyketide synthase, beta-ketoacyl synthase domain (Domain)
 [20-443]  SM00825 [1819-2242]  SM00825 [3358-3781]  SM00825 [4897-5320]  SM00825 [6696-7119]  SM00825 [8495-8918]  SM00825 [10294-10717]  SM00825 [12093-12516]  SM00825
SM00825   PKS_KS
IPR020842 Polyketide synthase/Fatty acid synthase, KR (Domain)
 [1444-1624]  1.70000295590053e-57 SM00822 [2985-3161]  1.20000117458134e-41 SM00822 [4524-4700]  1.20000117458134e-41 SM00822 [6321-6501]  1.70000295590053e-57 SM00822 [8120-8300]  1.70000295590053e-57 SM00822 [9919-10099]  1.70000295590053e-57 SM00822 [11718-11898]  1.70000295590053e-57 SM00822 [13517-13697]  1.70000295590053e-57 SM00822
SM00822   PKS_KR
SignalP No significant hit
TMHMM No significant hit