close all open/close all

CDS information : Aver_00050


close this sectionLocation

Organism
StrainATCC 31267 (=NBRC 14893)
Entry nameAvermectin
Contig
Start / Stop / Direction18,069 / 36,788 / + [in whole cluster]
12,071 / 30,790 / + [in contig]
Location18069..36788 [in whole cluster]
12071..30790 [in contig]
TypeCDS
Length18,720 bp (6,239 aa)
Click on the icon to see Genetic map.

close this sectionAnnotation

Category1.1 PKS
Productpolyketide synthase
Product (GenBank)type I polyketide synthase AVES 2
Gene
Gene (GenBank)aveA2
EC number
Keyword
Note
Note (GenBank)
  • contains Ave module 3, module 4, module 5 and module 6; multifunctional polyketide synthase
Reference
ACC
PmId
[10449723] Organization of the biosynthetic gene cluster for the polyketide anthelmintic macrolide avermectin in Streptomyces avermitilis. (Proc Natl Acad Sci U S A. , 1999)
[11780788] Organization of biosynthetic gene cluster for avermectin in Streptomyces avermitilis: analysis of enzymatic domains in four polyketide synthases. (J Ind Microbiol Biotechnol. , 2001)

close this sectionPKS/NRPS Module

3 malonyl-CoA
4 malonyl-CoA
not conserved HAFHS(H->N)
5 malonyl-CoA
6 methylmalonyl-CoA
not conserved YASHS(S->C)
KS36..413
AT597..915
ACP986..1060
KS1082..1459
AT1654..1972
KR2306..2486
ACP2583..2657
KS2679..3055
AT3231..3546
KR3878..4059
ACP4171..4241
KS4270..4649
AT4815..5133
DH5189..5362
KR5753..5933
ACP6044..6114

close this sectionSequence

selected fasta
>polyketide synthase [type I polyketide synthase AVES 2]
MQLANEAKLLEYLKRVTADLDRTRRRLYEVVEREQEPIAIVGMACRYPGGATSPTRLWHL
VKSQTDAIGEFPTDRGWNLEQLYDPDPDRSGTSYTRSGGFLYDAGDFDAAFFELSPREAL
AMDPQQRLLLETTWETFEQGGIDPRSMRGSRTGVFVGINPEDYTTGYTHQPSNAVEGYLL
TGSAASIASGRISYNFGLEGPAITIDTACSSSLVALHLACQALRSGECTMALAGGASVMA
TPFVFTEFSRQRGLAADGRCKAFSAAADGTGWSEGVGMLLVERLSDARRNGHRVLAVVRG
SAVNQDGASNGLTAPNGRSQVKVIRQALANAHLSPADVDAVEAHGTGTTLGDPIEAQALV
EAYGQDRPNGRPLWLGTLKSNIGHSMAAAGVGGVIKMVMALRNGLLPRTLHVDEPSPHVD
WSAGAVQLLTETVPWPGGEGRLRRAGVSSFGVSGTNAHVILEEAPAHNIPSDTPADDVPG
ESAADEDAGSGDEAAAGSPGVWPWLVSAKSQPALRAQAQALHAHLTDHPGLDLADVGYTL
AHARAVFDHRATLIAADRDTFLQALQALAAGEPHPAVIHSSAPGGTGTGEAAGKTAFICS
GQGTQRPGMAHGLYHTHPVFAAALNDICTHLDPHLDHPLLPLLTQDPNTQDTTTLEEAAA
LLQQTRYAQPALFAFQVALHRLLTDGYHITPHYYAGHSLGEITAAHLAGILTLTDATTLI
TQRATLMQTMPPGTMTTLHTTPHHITHHLTAHENDLAIAAINTPTSLVISGTPHTVQHIT
TLCQQQGIKTKTLPTNHAFHSPHTNPILNQLHQHTQTLTYHPPHTPLITANTPPDQLLTP
HYWTQQARNTVDYATTTQTLHQHGVTTYIELGPDNTLTTLTHDNLPNTPTTTLTLTHPHH
HPQTHLLTNLAKTTTTWHPHHYTHHHNQPHTHTHLDLPTYPFQHHHYWLQPPGKPSDPSP
SEGREQATTPSTPLRDVLVGKSPQERDEELLRLVRTHAAAVLGHATPEVIVPNKAFKELG
FDSLAAIQLRNRLLADVDLPLPATLIFDYPTPMALCQFLRAAIVGADTGTTTRLPLTAVP
ADEPIAIVGMACRYPGDVRTVDDLWQVVSGGHDAIGGFPTNRGWDLDTLYNPDPDHHGTS
YTRSGGFLYDAGNFDPDFFGISPREALAMDPQQRLLLETAWESIEHACINPDSLRGTPTG
VFAGLTYHDYAARFPTAPAGFEGYLGHGSAGSIASGRVAYALGLEGPALTVDTACSSSLV
ALHLACQALRSGECSMALAGGVTVMSTPAGFVEFSRQRGLAVDGRCKAFSAAADGTGWGE
GVGMLLVERLSDARRLGHRILAVVRGSAVNQDGASNGLTAPNGPSQERVIRLALANADLT
PADVDAVEAHGTGTTLGDPIEAQALLATYGQDRPGNEPLWLGSMKSNIGHAQAAAGVGGV
IKMVMALRNGLLPRTLHVDEPSPHVDWSAGAVQLLTETVPWPGGEGRLRRAGVSSFGVSG
TNAHVILEEAPAHNIPSDTPADDAPGEAAADDVPGEAAGDDAGTGGEATGPAAGSPGVWP
WLVSAKSQPALRAQAQALHAHLTDHPGLDLADVGYTLAHARAVFDHRATLIAADRDTFLQ
ALQALAAGEPHPAVIHSSAPGGTGTGEAAGKTAFICSGQGTQRPGMAHGLYHTHPVFAAA
LNDICTHLDPHLDHPLLPLLTQDPNTQDTTTLEEAAALLQQTPYAQPALFAFQVALHRLL
TDGYHITPHYYAGHSLGEITAAHLAGILTLTDATTLITQRATLMQTMPPGTMTTLHTTPH
HITHHLTAHENDLAIAAINTPTSLVISGTPHTVQHITTLCQQQGIKTKTLPTKNAFHSPH
TNPILNQLHQHTQTLTYHPPHTPLITANTPPDQLLTPHYWTQQARNTVDYATTTQTLHQH
GVTTYIELGPDNTLTTLTHHNLPNTPTTTLTLTHPHHHPQTHLLTNLAKTTTTWHPHHYT
HHHNQPHTHTHLDLPTYPFQHQHYWLESTQPGAGSGSGSGSGRAGTAGGTAEVESRFWDA
VARQDLETVATTLAVPPSAGLDTVVPALSAWHRHQHDQARINTWTYQETWKPLTLPTTHQ
PHQTWLIAIPETQTHHPHITNILTNLHHHGITPIPLTLNHTHTNPQHLHHTRQQAQNHTT
GPITGLLSLLALDETPHPHHPHTPTGTLLNLTLTQTHTQTHPPTPLWYATTNATTTHPND
PLTHPTQAQTWGLARTTLLEHPTHTAGIIDLPTTPTPHTLHHLTQTLTQPHHQTQLAIRT
TGTHTRRLTPTTLTPTHQPPTPTPHGTTLITGGTGALATHLTHHLTTHQPTQHLLLTSRT
GPHTPHAQHLTTQLQQKGIHLTITTCDTSNPDQLQQLLNTIPPQHPLTTVIHTAGILDDA
TLTNLTPTQLNNVLRAKAHSAHLLHQLTQHTPLNAFVLYSSAAATFGAPGQANYAAANAY
LDALAHHRHTHHLPATSIAWGTWQGNGLATGQVSEHLRRRGMFAMPPELAVTAVDGAIAS
GRPSLLVADIDWKKLGPVLSSKSSVLLEDLPQAQGTEEARSTVEQTESTNLRQLLMGRSR
SEQEEELLSLVRIHSAAVLGRDDSEAIPPGRLFRDLGFDSLAAVELRNHLAAQTELALPT
TLVFDYPSPTKLAQFLLSEIAEFQPDNSTPLPRPRAELDEPIAIVGMACRFPGGVTSADD
FWDLISSEQDAIGGFPTDRGWDLDTLYDPDPDHPGTCYTRNGGFLYDAGHFDAEFFGISP
REALAMDPQQRLLLETAWETIEHAGINPHTLHGTPTGVFTGTNGQDHAAHIRQAPSGTEG
FVLTGAATSIASGRISYILGLEGPAVTLDTACSSSLVALHLACQSLRSGECTMALAGGAT
VMTTPITFTEFARQRGLAPDGRCKAFSAAADGTGWGEGVGMLLVERLSDARRNGHRVLAV
VRGSAVNQDGASNGLTAPNGPSQQRVIRQALANADLTPADVDAVEAHGTGTTLGDPIEAQ
AILATYGQDRPGNGPLWLGSVKSNVGHTQAAAGVAGVIKMVMALRHRTLPPTLHADEPSP
HVDWSAGAVQLLTETVPWPGGEGRPRRAGVSSFGVSGTNAHVILEEAPADDVPGGPPADE
DAGSGEEAAAGSPGVWPWLVSAKSQPALRAQAQALHAHLTDHPGLDLADVGYTLAHARAV
FDHRATLIAADRDTFLQALQALAAGEPHPAVIHSSAPGGTGTGEAAGKTAFICSGQGTQR
PGMAHGLYHTHPVFAAALNDICTHLDPHLDHPLLPLLTQNDNDNDNEDAAALLQQTPYAQ
PALFAFQVALHRLLTDGYHITPHYYAGHSLGEITAAHLAGILTLTDATTLITQRATLMQT
MPPGTMTTLHTTPHHITHHLTAHENDLAIAAINTPTSLVISGTPHTVQHITTLCQQQGIK
TKTLPTNHAFHSPHTNPILNQLHQHTQTLTYHPPHTPLITANTPPDQLLTPHYWTQQARN
TVDYATTTQTLHQHGVTTYIELGPDNTLTTLTHHNLPNTPTTTLTLTHPHHHPQTHLLTN
LAKTTTTWHPHHYTHHHNQPHTHTHLDLPTYPFQHHHYWLELPSAQTSPGQRRSRRSAPD
TAESEFWDAVNEEDLQSLAETLDIDASALDTVVPALSAWHRHQHDQARINTWTYQETWKP
LTLPTTHQPHQTWLIAIPETQTHHPHITNILTNLHHHGITPIPLTVNHTHTNPQHLHHTL
HHTRQQAQNHTTGPITGLLSLLALDETPHPHHPHTPTGTLLNLTLPQTHTQTHPPTPLWY
ATTNATTTHPNDPLTHPTQAQTWGLARTTLLEHPTHTAGIIDLPTTPTPHTLHHLTQTLT
QPHHQTQLAIRTTGTHTRRLTPTTLTPTHQPPTPTPHGTTLITGGTGALATHLTHHLTTH
QPTQHLLLTSRTGPHTPHAQHLTTQLQQKGIHLTITTCDTSNPDQLQQLLNTIPPQHPLT
TVIHTAGVNLFAPVSETDAESFSSVTAAKATGAAILHELLLDHETLEHFILFSSGAGAWG
SGNQCAYSAANAYLDALATHRQTHGLPGASIAWGPWAGKGMSAGDAAHGYLEKRGILPME
PRMALAAFHRARAQRPNSNLIIADIDWERFVPAFTARRHSPLIEDIPEVRQAAQELEAAA
STAKTTTAQPIATSLRERLARLTSSKQNQVLLGLIRTGICTVLGLRNPEGIEDQRAFRDL
GFDSLTSAQFSKELAKETGLPLPPSLVFDYPTPQECAAHLRTQLVDLDDEEDAALSNALP
QVAHRRTVEDEPIAIIGMACRFPGGVRSADDLWELLASGKDAIGVFPTDRGWDLDTLYDP
DPDHPGTCYTRNGGFLYGAGHFDAEFFGISPREALAMDPQQRLLLETAWETIEHAGINPH
TLHGTPTGVFAGINAQDHAAHIRQSRDVETIEGYALTGSSGSVASGRVAYTLGLEGPAVS
VDTACSSSLVALHWAAQALRAGECSMALAGGVTVMSSPGTFVEFSRQRGLAADGRCKAYS
AAADGTGWAEGVGMLLVERLSDARRNGHRVLAVVRGSAVNQDGASNGLTAPNGPSQQRVI
RQALANAGLTPADVDAVEGHGTGTTLGDPIEAQALLAAYGQHRPHHRPLWLGSLKSNIGH
AQAAAGVGGVIKMVMALRNGLLPQTLHVDEPTPQVDWSTGAVQLLTQPVPWPADPAGRPR
HAGVSSFGVSGTNAHIILEEAPTPQDSDTDDEPPANAPALPHPLPLPVPVSARSEAGLRA
QAQALRQYVAARPDMSPADIGAGLARGRAVLEHRAVILAADREELAQALTALAAGEPHPH
ITTGHTRGGDRGGVVFVFPGQGGQWAGMGLTLLTSSPVFAEHIDACEKALTPWVPWSLTD
ILHRDPDDPAWQQADVVQPVLFSIMVSLAALWRSYGIEPDAVLGHSQGEIAAAHICGALS
LKDAAKTVALRSRALAAVRGRGAMASLPLPAQDVQQLISERWEGQLWVAALNGPHSTTVS
GDTKAVDEVLAHCTDTGLRAKRIPVDYASHCPHVQPLHDELLHLLGDITPQPSTVPFFST
VEGTWLDTTTLDAAYWYRNLHQPVRFSHAIQTLTDDGHRAFIEISPHPTLVPAIEDTTEN
TTENITATGSLRRGDNDTHRFLTALAHTHTTGIGTPTTWHHHYTQTHPHPNPHTHLDLPT
YPFQHQHYWLQPPTTTTDLTTTGLTPTHHPLLTATLTLADNNTQLLTGRLSLRTHPWLTD
HTVAGMVLLPGTALLELALQAGERVDCPRVEELTLHAPLVIPHTEDVTLQVTVRAADESG
HRALAIHSYSGTASSADREWTRHATGLLTHHADTDHRADTHTDACLGGSWPPPGAQPIEL
GDVYGRMAADSDIAYGPVFQGLHAAWRFGDDVLAEVRLPEEALRDAPAAAFGVHPALLDA
ALHATALTPQNGDGSTENVAQESMPDRAAHQARLPFSWSGVSLHTAGSSVLRVRLSRSPQ
HGNAVALTAADEDGRPVVTIESLALRPVSTEELRAAADRTPEHESLFRLDWVSVPVPANA
PSPTADRPWAVIGAGLPHLPGLTEHEHVTAYDEPADLLLALDRGAPPPGVLVVGGVAHTE
AREYSAEAPGERGTEACEARPDVVHVGVVHTAAVHAAAAQMLARLQAWLGDERLADSRLL
VLTCGAVARASGDDATDLPGAAVWGLVRSAQSEHPDRITLLDFERGTEAEPGQLATALNC
GERQLAVRPGGLFTPRLVRAPRVADAVPAVPAVAVPSAGHAAVPAAGPFLPGGTVLITGG
TGVLGRLVARHLVEAHGVRHLLLAGRRGPDAEGAPELRAELGGLGATVEVVACDAADRQQ
LADLLTRIPDDRPLTGVVHSAGILDDGVITSLSPERLGAVLRAKADAALLLDELTRGAEL
SAFVMFSSASAVVGSPGQGNYAAANAVLDFLAHRRRAEGLPAVSLAWGLWEEGTGMTGHL
DVDDHARISRAGMRPLPTAEALALFDAALADGEPFLMPARLDLTAVRSGAASAPVPPLLQ
GLLQLPRSRSAAAAPGHGAPAADEAAAWRERLARQSAGERRQALLRLVRSHVAAVLGHSG
ADGIDASRAFRELGFDSLTAVELRNRLTAATGLRLRATLAFDFPTPAALAEHLGERLLPD
QEATGEQAGDQLSGGSEEDVRSLLTSIPIGRLRDAGLLGPLLTLADTGRGASGAAAGPED
APPSGQDTPAPVSIDEMDIDDLMDLAHGHGTAPAREPADAEDSSSSRNRTHHTHEGETA
selected fasta
>polyketide synthase [type I polyketide synthase AVES 2]
ATGCAATTGGCGAATGAAGCGAAGCTCCTGGAATACCTCAAGCGCGTCACTGCGGACCTG
GACCGCACTCGCCGTCGCCTGTACGAGGTGGTCGAGCGTGAGCAGGAGCCGATCGCGATT
GTGGGGATGGCGTGTCGTTACCCAGGCGGGGCGACGTCACCCACGCGACTGTGGCATCTC
GTCAAGTCCCAGACGGACGCTATCGGGGAGTTCCCGACCGACCGTGGATGGAACCTGGAG
CAGCTCTACGACCCGGACCCCGACCGCTCAGGAACCAGTTACACGCGCAGCGGAGGGTTT
CTCTATGACGCGGGCGACTTCGACGCCGCGTTCTTCGAGTTGTCACCGCGTGAGGCGCTG
GCAATGGACCCGCAGCAGCGCCTGCTGCTCGAAACCACTTGGGAAACGTTCGAACAGGGC
GGAATCGACCCGAGGTCCATGCGCGGAAGCCGGACCGGGGTTTTCGTGGGGATCAATCCG
GAGGACTACACCACCGGATACACACATCAGCCCTCAAACGCAGTCGAGGGCTACCTGCTC
ACTGGCAGCGCGGCAAGCATTGCGTCAGGCCGTATCTCCTACAACTTCGGGCTCGAAGGC
CCTGCGATCACTATCGACACCGCGTGTTCCTCCTCGCTCGTCGCCCTGCATCTGGCCTGC
CAAGCGCTCCGGTCCGGTGAATGCACCATGGCGCTCGCAGGCGGCGCCTCCGTCATGGCC
ACTCCCTTCGTCTTCACCGAGTTCTCTCGCCAGCGGGGCCTGGCCGCAGACGGCCGGTGC
AAGGCGTTTTCGGCGGCGGCGGACGGGACCGGCTGGTCCGAGGGTGTGGGGATGCTGCTG
GTGGAGCGGCTCTCCGACGCCCGCCGCAACGGTCACCGTGTCCTGGCCGTCGTCCGCGGC
AGCGCCGTCAACCAGGACGGCGCAAGCAACGGCCTGACCGCACCCAACGGTCGTTCACAA
GTCAAGGTCATCCGCCAGGCTTTGGCCAACGCACACCTCTCCCCTGCCGATGTCGATGCG
GTGGAGGCCCACGGCACGGGGACCACCCTGGGCGACCCGATCGAGGCTCAAGCCCTCGTC
GAAGCCTACGGTCAGGACCGCCCCAACGGCCGCCCCCTCTGGCTCGGAACCCTCAAGTCC
AACATCGGGCACTCCATGGCCGCTGCGGGTGTGGGCGGGGTCATCAAGATGGTGATGGCG
CTGCGGAATGGTCTGCTGCCGCGGACGTTGCATGTGGATGAGCCGTCGCCGCATGTGGAC
TGGTCCGCGGGTGCGGTGCAGCTGCTGACGGAGACGGTGCCCTGGCCCGGCGGGGAGGGG
CGGCTACGGCGGGCAGGAGTGTCATCATTCGGCGTCAGCGGCACCAACGCCCACGTCATC
CTCGAGGAAGCACCCGCCCACAACATCCCGTCAGACACACCCGCCGACGACGTCCCGGGA
GAATCAGCCGCCGACGAGGATGCCGGTAGTGGCGATGAGGCTGCTGCCGGCAGTCCAGGG
GTGTGGCCGTGGCTGGTGTCGGCCAAGTCGCAGCCGGCCCTGCGCGCCCAGGCCCAGGCC
CTGCACGCCCACCTCACCGACCACCCCGGCCTCGACCTCGCCGACGTCGGGTACACCCTC
GCCCACGCCCGCGCCGTGTTCGACCACCGCGCCACCCTCATCGCCGCCGACCGCGACACC
TTCCTGCAAGCACTCCAGGCACTCGCCGCAGGCGAACCCCACCCCGCCGTCATCCACAGC
AGCGCCCCAGGCGGGACCGGGACCGGGGAGGCCGCAGGAAAGACCGCATTCATCTGCTCC
GGACAGGGCACCCAACGCCCCGGCATGGCCCACGGCCTCTACCACACCCACCCCGTCTTC
GCCGCCGCACTCAACGACATCTGCACCCACCTCGACCCCCACCTCGACCACCCCCTCCTC
CCCCTCCTCACCCAGGACCCCAACACCCAGGACACCACCACCCTCGAAGAAGCGGCCGCA
CTGCTCCAGCAGACCCGCTACGCCCAGCCCGCCCTCTTCGCCTTCCAGGTCGCCCTCCAC
CGCCTCCTCACCGACGGCTACCACATCACCCCCCACTACTACGCCGGACACTCCCTCGGC
GAAATCACCGCCGCCCACCTCGCCGGCATCCTCACCCTCACCGACGCCACCACCCTCATC
ACCCAACGCGCCACCCTCATGCAAACCATGCCCCCCGGCACCATGACCACCCTCCACACC
ACCCCCCACCACATCACCCACCACCTCACCGCCCACGAAAACGACCTCGCCATCGCCGCC
ATCAACACCCCCACCTCCCTCGTCATCAGCGGCACCCCCCACACCGTCCAACACATCACC
ACCCTCTGCCAACAACAAGGCATCAAAACCAAAACCCTCCCCACCAACCACGCCTTCCAC
TCCCCCCACACCAACCCCATCCTCAACCAACTCCACCAGCACACCCAAACCCTCACCTAC
CACCCACCCCACACCCCCCTCATCACCGCCAACACCCCACCCGACCAACTCCTCACCCCC
CACTACTGGACCCAACAAGCCCGCAACACCGTCGACTACGCCACCACCACCCAAACCCTC
CACCAACACGGCGTCACCACCTACATCGAACTCGGACCCGACAACACCCTCACCACCCTC
ACCCACGACAACCTCCCCAACACCCCCACCACCACCCTCACCCTCACCCACCCCCACCAC
CACCCCCAAACCCACCTCCTCACCAACCTCGCCAAAACCACCACCACCTGGCACCCCCAC
CACTACACCCACCACCACAACCAACCCCACACCCACACCCACCTCGACCTCCCCACCTAC
CCCTTCCAACACCACCACTACTGGCTCCAACCACCCGGCAAGCCGAGCGACCCGTCACCG
AGCGAAGGCCGTGAGCAAGCCACGACCCCATCAACCCCGCTGCGTGATGTCCTCGTGGGC
AAGTCTCCGCAGGAGCGAGACGAAGAGCTGTTGCGCCTGGTGCGCACCCATGCGGCCGCT
GTGCTGGGCCATGCCACTCCCGAAGTGATCGTTCCGAACAAGGCCTTCAAAGAGCTGGGT
TTTGATTCTCTCGCCGCAATTCAGCTTCGTAATCGACTGCTTGCTGACGTTGACCTGCCG
CTTCCGGCCACGCTGATCTTCGATTACCCCACTCCGATGGCGCTTTGCCAGTTCCTCCGG
GCGGCGATCGTCGGAGCGGACACAGGCACGACCACTCGTCTGCCGCTAACTGCGGTCCCC
GCCGACGAGCCGATCGCCATCGTCGGCATGGCCTGTCGGTACCCCGGTGATGTACGGACG
GTCGATGATCTCTGGCAGGTGGTCAGTGGTGGCCATGACGCGATCGGCGGATTCCCGACG
AACCGTGGGTGGGACCTCGACACGCTGTACAACCCGGACCCGGACCACCACGGAACCAGC
TACACCCGGAGCGGCGGATTCCTTTACGACGCAGGCAATTTCGATCCCGACTTCTTCGGT
ATCAGTCCGCGTGAGGCACTGGCGATGGACCCGCAGCAGCGGCTGCTGCTGGAAACAGCG
TGGGAGAGCATCGAACACGCCTGCATCAACCCCGACAGCCTCCGTGGCACACCAACCGGC
GTCTTCGCCGGGCTGACCTACCACGACTACGCCGCGCGCTTTCCCACAGCTCCGGCAGGG
TTCGAGGGGTATCTCGGGCACGGAAGCGCAGGCAGTATCGCCTCGGGTCGTGTCGCCTAC
GCTCTCGGCCTGGAAGGTCCGGCCCTCACAGTCGACACTGCCTGCTCTTCGTCCCTGGTC
GCTCTGCACCTGGCCTGTCAGGCGCTGCGGTCCGGCGAGTGTTCCATGGCCCTCGCGGGT
GGCGTCACGGTGATGTCAACCCCGGCCGGGTTCGTGGAGTTTTCGCGGCAGCGGGGCCTG
GCCGTGGACGGGCGGTGCAAGGCGTTCTCGGCAGCGGCTGACGGCACCGGCTGGGGTGAG
GGTGTCGGAATGCTGCTGGTGGAGCGGCTGTCGGACGCGCGGCGGCTCGGTCACCGAATC
CTCGCGGTGGTGCGTGGCAGTGCGGTCAATCAGGACGGTGCGAGCAACGGGCTGACGGCG
CCCAACGGGCCGTCCCAGGAGCGTGTCATCCGCCTGGCCCTGGCCAACGCGGACCTGACC
CCCGCCGACGTCGATGCGGTGGAGGCCCACGGCACCGGCACCACTTTGGGCGACCCGATC
GAGGCCCAGGCCCTCCTCGCCACCTACGGACAGGACCGCCCCGGCAACGAACCGCTGTGG
CTGGGCTCGATGAAGTCGAACATCGGCCACGCGCAGGCTGCCGCAGGTGTGGGCGGGGTC
ATCAAGATGGTGATGGCGCTGCGGAATGGTCTGCTGCCGCGGACGTTGCATGTGGATGAG
CCGTCGCCGCATGTGGACTGGTCCGCGGGGGCGGTGCAGCTGCTGACGGAGACGGTGCCC
TGGCCCGGCGGGGAGGGGCGGCTGCGGCGGGCAGGAGTGTCATCGTTCGGCGTCAGCGGC
ACCAACGCCCACGTCATCCTCGAAGAAGCACCCGCCCACAACATCCCGTCAGACACACCC
GCCGACGACGCCCCGGGAGAAGCAGCCGCCGACGATGTTCCGGGGGAAGCGGCCGGCGAC
GACGCCGGTACCGGCGGGGAAGCGACTGGTCCTGCTGCCGGCAGTCCAGGGGTGTGGCCG
TGGCTGGTGTCGGCCAAGTCGCAGCCGGCCCTGCGCGCCCAGGCCCAGGCCCTGCACGCC
CACCTCACCGACCACCCCGGCCTCGACCTCGCCGACGTCGGGTACACCCTCGCCCACGCC
CGCGCCGTGTTCGACCACCGCGCCACCCTCATCGCCGCCGACCGCGACACCTTCCTGCAA
GCACTCCAGGCACTCGCCGCAGGCGAACCCCACCCCGCCGTCATCCACAGCAGCGCCCCA
GGCGGGACCGGGACCGGGGAGGCCGCAGGAAAGACCGCATTCATCTGCTCCGGACAGGGC
ACCCAACGCCCCGGCATGGCCCACGGCCTCTACCACACCCACCCCGTCTTCGCCGCCGCA
CTCAACGACATCTGCACCCACCTCGACCCCCACCTCGACCACCCCCTCCTCCCCCTCCTC
ACCCAGGACCCCAACACCCAGGACACCACCACCCTCGAAGAAGCGGCCGCACTGCTCCAG
CAGACCCCGTACGCCCAGCCCGCCCTCTTCGCCTTCCAGGTCGCCCTCCACCGCCTCCTC
ACCGACGGCTACCACATCACCCCCCACTACTACGCCGGACACTCCCTCGGCGAAATCACC
GCCGCCCACCTCGCCGGCATCCTCACCCTCACCGACGCCACCACCCTCATCACCCAACGC
GCCACCCTCATGCAAACCATGCCCCCCGGCACCATGACCACCCTCCACACCACCCCCCAC
CACATCACCCACCACCTCACCGCCCACGAAAACGACCTCGCCATCGCCGCCATCAACACC
CCCACCTCCCTCGTCATCAGCGGCACCCCCCACACCGTCCAACACATCACCACCCTCTGC
CAACAACAAGGCATCAAAACCAAAACCCTCCCCACCAAAAACGCCTTCCACTCCCCCCAC
ACCAACCCCATCCTCAACCAACTCCACCAGCACACCCAAACCCTCACCTACCACCCACCC
CACACCCCCCTCATCACCGCCAACACCCCACCCGACCAACTCCTCACCCCCCACTACTGG
ACCCAACAAGCCCGCAACACCGTCGACTACGCCACCACCACCCAAACCCTCCACCAACAC
GGCGTCACCACCTACATCGAACTCGGACCCGACAACACCCTCACCACCCTCACCCACCAC
AACCTCCCCAACACCCCCACCACCACCCTCACCCTCACCCACCCCCACCACCACCCCCAA
ACCCACCTCCTCACCAACCTCGCCAAAACCACCACCACCTGGCACCCCCACCACTACACC
CACCACCACAACCAACCCCACACCCACACCCACCTCGACCTCCCCACCTACCCCTTCCAA
CACCAGCACTACTGGCTCGAAAGCACACAGCCGGGTGCCGGATCCGGTTCGGGTTCCGGT
TCCGGGCGGGCAGGGACTGCGGGCGGGACGGCAGAGGTGGAGTCGCGGTTCTGGGACGCG
GTGGCCCGCCAGGACCTGGAAACGGTCGCGACCACGCTCGCCGTGCCCCCCTCCGCCGGC
CTGGACACGGTGGTGCCCGCACTCTCCGCCTGGCACCGCCACCAACACGACCAAGCCCGC
ATCAACACCTGGACCTACCAGGAAACCTGGAAACCCCTCACCCTCCCCACCACCCACCAA
CCCCACCAAACCTGGCTCATCGCCATCCCCGAAACCCAGACCCACCACCCCCACATCACC
AACATCCTCACCAACCTCCACCACCACGGCATCACCCCCATCCCCCTCACCCTCAACCAC
ACCCACACCAACCCCCAACACCTCCACCACACCCGACAACAAGCCCAAAACCACACCACC
GGACCCATCACCGGCCTGCTCTCCCTCCTCGCCCTCGACGAAACACCCCACCCCCACCAC
CCCCACACACCCACCGGCACCCTCCTCAACCTCACCCTCACCCAAACCCACACCCAAACC
CACCCACCAACCCCCCTCTGGTACGCCACCACCAACGCCACCACCACCCACCCCAACGAC
CCCCTCACACACCCCACCCAAGCCCAAACCTGGGGACTCGCCCGCACCACCCTCCTCGAA
CACCCCACCCACACCGCCGGAATCATCGACCTCCCCACCACCCCCACCCCCCACACCCTC
CACCACCTCACCCAAACCCTCACCCAACCCCACCACCAAACCCAACTCGCCATCCGCACC
ACCGGCACCCACACCCGCCGCCTCACCCCCACCACCCTCACCCCCACACACCAACCACCC
ACCCCCACCCCCCACGGAACCACCCTCATCACCGGCGGAACCGGCGCCCTCGCCACCCAC
CTCACCCACCACCTCACCACCCACCAACCCACCCAACACCTCCTCCTCACCAGCCGAACC
GGCCCCCACACCCCCCACGCACAACACCTCACCACCCAACTCCAACAAAAAGGCATCCAC
CTCACCATCACCACCTGCGACACCAGCAACCCAGACCAACTCCAACAACTCCTCAACACC
ATCCCCCCACAACACCCCCTCACCACCGTCATCCACACCGCAGGCATCCTCGACGACGCC
ACCCTCACCAACCTCACCCCCACCCAACTCAACAACGTCCTCCGCGCCAAAGCCCACAGC
GCCCACCTCCTCCACCAACTCACCCAACACACCCCCCTCAACGCCTTCGTCCTCTACTCC
TCCGCCGCCGCCACCTTCGGCGCACCCGGCCAAGCCAACTACGCCGCAGCCAACGCCTAC
CTCGACGCCCTCGCCCACCACCGCCACACCCACCACCTCCCCGCCACCAGCATCGCCTGG
GGCACCTGGCAAGGAAACGGACTGGCGACTGGTCAAGTCAGCGAACATCTCCGCCGCCGC
GGGATGTTCGCCATGCCGCCCGAGTTGGCGGTCACAGCTGTTGACGGCGCGATCGCGAGC
GGGCGCCCGAGTCTCCTCGTCGCCGATATCGACTGGAAGAAATTGGGACCGGTTCTCTCC
AGCAAGTCGTCGGTCTTGCTCGAGGACCTTCCCCAGGCACAGGGAACTGAGGAGGCGCGC
AGTACCGTTGAGCAGACGGAGAGCACAAACCTCCGGCAACTCCTCATGGGTCGGTCACGT
TCCGAGCAGGAAGAAGAGCTGCTCAGCCTCGTCCGCATCCACTCCGCGGCAGTGCTCGGG
CGCGACGACTCCGAGGCCATCCCGCCCGGTCGGCTGTTCAGGGATCTAGGGTTCGACTCG
CTTGCGGCGGTGGAGCTTCGCAACCACCTCGCAGCACAGACGGAGCTGGCTCTGCCGACG
ACTCTCGTCTTCGATTACCCCAGCCCCACCAAGCTCGCCCAATTTCTGCTCTCCGAGATC
GCGGAGTTCCAGCCCGACAACTCAACTCCGCTTCCGCGACCCCGGGCAGAGCTCGATGAG
CCGATCGCCATCGTTGGCATGGCCTGTCGCTTCCCCGGCGGAGTGACCTCGGCGGACGAC
TTCTGGGATCTGATCTCCTCCGAGCAGGACGCGATCGGCGGATTCCCCACCGACCGCGGC
TGGGACCTGGACACGCTCTACGACCCCGACCCCGACCACCCCGGCACCTGCTACACCCGA
AACGGCGGATTCCTCTACGACGCAGGCCACTTCGACGCCGAATTCTTCGGCATCAGCCCC
CGCGAAGCCCTCGCCATGGACCCCCAGCAACGACTCCTCCTCGAAACCGCCTGGGAAACC
ATCGAACACGCCGGCATCAACCCCCACACCCTCCACGGCACCCCCACCGGAGTCTTCACC
GGCACCAACGGACAGGACCACGCGGCACACATCCGTCAGGCCCCGAGCGGTACCGAGGGA
TTCGTCCTGACCGGGGCAGCCACCAGCATCGCCTCCGGCCGAATCTCCTACATCCTCGGG
TTGGAAGGGCCTGCGGTCACCCTCGACACAGCGTGTTCCTCCTCGCTCGTCGCCCTGCAC
CTCGCCTGCCAGTCCCTCAGGTCCGGTGAATGCACCATGGCCTTGGCCGGCGGGGCCACG
GTCATGACCACCCCGATCACCTTCACCGAATTCGCCCGCCAACGCGGACTCGCCCCCGAC
GGGCGTTGCAAGGCGTTCTCGGCGGCGGCTGACGGTACCGGCTGGGGTGAGGGTGTGGGG
ATGCTGCTGGTGGAGCGGCTCTCCGACGCCCGCCGCAACGGTCACCGTGTCCTGGCCGTG
GTGCGTGGCAGTGCGGTCAACCAGGACGGTGCGAGCAACGGTCTGACCGCGCCCAACGGG
CCCTCCCAGCAGCGCGTCATCCGCCAGGCCCTCGCCAACGCGGACCTGACCCCCGCCGAC
GTCGATGCGGTGGAGGCCCACGGCACCGGCACCACTTTGGGCGACCCGATCGAGGCCCAG
GCCATCCTCGCGACCTACGGACAGGACCGTCCCGGCAACGGGCCGTTGTGGCTGGGCTCC
GTCAAGTCCAACGTCGGACACACACAGGCCGCGGCGGGCGTGGCCGGAGTGATCAAGATG
GTGATGGCCCTCCGCCACCGGACACTCCCACCGACTCTCCACGCGGATGAGCCGTCGCCG
CATGTGGACTGGTCCGCGGGTGCGGTGCAGCTGCTGACGGAGACGGTGCCCTGGCCCGGC
GGGGAGGGGCGGCCGCGGCGGGCAGGAGTGTCATCATTCGGCGTCAGCGGCACCAACGCC
CACGTCATCCTCGAAGAAGCACCCGCCGACGACGTTCCGGGGGGACCACCCGCCGACGAG
GATGCCGGTAGTGGCGAGGAGGCTGCTGCCGGCAGTCCTGGGGTGTGGCCGTGGCTGGTG
TCGGCCAAGTCGCAGCCGGCCCTGCGCGCCCAGGCCCAGGCCCTGCACGCCCACCTCACC
GACCACCCCGGCCTCGACCTCGCCGACGTCGGATACACCCTCGCCCACGCCCGCGCCGTG
TTCGACCACCGCGCCACCCTCATCGCCGCCGACCGCGACACCTTCCTGCAAGCACTCCAG
GCACTCGCCGCAGGCGAACCCCACCCCGCCGTCATCCACAGCAGCGCCCCAGGCGGGACC
GGGACCGGGGAGGCCGCAGGAAAGACCGCATTCATCTGCTCCGGACAGGGCACCCAACGC
CCCGGCATGGCCCACGGCCTCTACCACACCCACCCCGTCTTCGCCGCCGCACTCAACGAC
ATCTGCACCCACCTCGACCCCCACCTCGACCACCCCCTCCTCCCCCTCCTCACCCAAAAC
GACAACGACAACGACAACGAGGACGCGGCCGCACTGCTCCAGCAGACCCCGTACGCCCAG
CCCGCCCTCTTCGCCTTCCAGGTCGCCCTCCACCGCCTCCTCACCGACGGCTACCACATC
ACCCCCCACTACTACGCCGGACACTCCCTCGGCGAAATCACCGCCGCCCACCTCGCCGGC
ATCCTCACCCTCACCGACGCCACCACCCTCATCACCCAACGCGCCACCCTCATGCAAACC
ATGCCCCCCGGCACCATGACCACCCTCCACACCACCCCACACCACATCACCCACCACCTC
ACCGCCCACGAAAACGACCTCGCCATCGCCGCCATCAACACCCCCACCTCCCTCGTCATC
AGCGGCACCCCCCACACCGTCCAACACATCACCACCCTCTGCCAACAACAAGGCATCAAA
ACCAAAACCCTCCCCACCAACCACGCCTTCCACTCCCCCCACACCAACCCCATCCTCAAC
CAACTCCACCAGCACACCCAAACCCTCACCTACCACCCACCCCACACCCCCCTCATCACC
GCCAACACCCCACCCGACCAACTCCTCACCCCCCACTACTGGACCCAACAAGCCCGCAAC
ACCGTCGACTACGCCACCACCACCCAAACCCTCCACCAACACGGCGTCACCACCTACATC
GAACTCGGACCCGACAACACCCTCACCACCCTCACCCACCACAACCTCCCCAACACCCCC
ACCACCACCCTCACCCTCACCCACCCCCACCACCACCCCCAAACCCACCTCCTCACCAAC
CTCGCCAAAACCACCACCACCTGGCACCCCCACCACTACACCCACCACCACAACCAACCC
CACACCCACACCCACCTCGACCTCCCCACCTACCCCTTCCAACACCACCACTACTGGCTC
GAACTACCCAGCGCCCAAACCAGCCCCGGTCAAAGGCGTTCTCGCCGCTCGGCTCCAGAC
ACCGCCGAGTCGGAGTTCTGGGACGCGGTGAACGAGGAAGACCTCCAGAGCCTCGCCGAA
ACCCTCGACATCGACGCCTCTGCTCTGGACACGGTGGTGCCCGCACTCTCCGCCTGGCAC
CGCCACCAACACGACCAAGCCCGCATCAACACCTGGACCTACCAGGAAACCTGGAAACCC
CTCACCCTCCCCACCACCCACCAACCCCACCAAACCTGGCTCATCGCCATCCCCGAAACC
CAGACCCACCACCCCCACATCACCAACATCCTCACCAACCTCCACCACCACGGCATCACC
CCCATCCCCCTCACTGTCAACCACACCCACACCAACCCCCAACACCTCCACCACACCCTC
CACCACACCCGACAACAAGCCCAAAACCACACCACCGGACCCATCACCGGCCTGCTCTCC
CTCCTCGCCCTCGACGAAACACCCCACCCCCACCACCCCCACACACCCACCGGCACCCTC
CTCAACCTCACCCTCCCCCAAACCCACACCCAAACCCACCCACCAACCCCCCTCTGGTAC
GCCACCACCAACGCCACCACCACCCACCCCAACGACCCCCTCACACACCCCACCCAAGCC
CAAACCTGGGGACTCGCCCGCACCACCCTCCTCGAACACCCCACCCACACCGCCGGAATC
ATCGACCTCCCCACCACCCCCACCCCCCACACCCTCCACCACCTCACCCAAACCCTCACC
CAACCCCACCACCAAACCCAACTCGCCATCCGCACCACCGGCACCCACACCCGCCGCCTC
ACCCCCACCACCCTCACCCCCACACACCAACCACCCACCCCCACCCCCCACGGAACCACC
CTCATCACCGGCGGAACCGGCGCCCTCGCCACCCACCTCACCCACCACCTCACCACCCAC
CAACCCACCCAACACCTCCTCCTCACCAGCCGAACCGGCCCCCACACCCCCCACGCACAA
CACCTCACCACCCAACTCCAACAAAAAGGCATCCACCTCACCATCACCACCTGCGACACC
AGCAACCCAGACCAACTCCAACAACTCCTCAACACCATCCCCCCACAACACCCCCTCACC
ACCGTCATCCACACCGCAGGCGTCAATCTCTTCGCCCCCGTGTCGGAAACCGATGCCGAA
TCCTTCTCTTCCGTTACGGCAGCGAAGGCAACGGGCGCGGCGATTCTGCATGAGTTGCTG
CTGGACCATGAAACGCTTGAACACTTCATTCTCTTCTCGTCGGGCGCCGGCGCTTGGGGC
AGCGGGAATCAGTGCGCATACTCGGCGGCCAACGCATACCTGGACGCGCTCGCGACGCAT
CGTCAGACACATGGACTTCCCGGGGCATCGATCGCCTGGGGCCCCTGGGCCGGAAAGGGC
ATGTCGGCCGGTGATGCGGCTCATGGTTACCTGGAAAAGCGCGGCATTCTGCCGATGGAG
CCACGCATGGCGCTCGCGGCATTCCATCGTGCGCGGGCGCAGCGGCCGAATTCCAACCTG
ATCATCGCGGACATCGACTGGGAGCGCTTCGTCCCCGCCTTCACCGCTCGACGCCACAGC
CCGCTCATCGAGGACATTCCGGAGGTTCGGCAAGCGGCTCAGGAGCTGGAAGCAGCTGCG
TCGACGGCAAAGACGACCACAGCTCAGCCGATTGCGACGTCTCTCCGTGAGCGATTGGCC
CGACTGACGTCCTCAAAGCAGAACCAGGTGCTGCTCGGCCTGATTCGGACAGGCATCTGC
ACCGTTCTCGGCCTTCGTAATCCGGAAGGCATCGAGGACCAACGAGCCTTCCGCGACCTC
GGCTTCGACTCGCTGACGTCGGCTCAGTTCAGCAAGGAACTCGCCAAGGAAACCGGACTG
CCACTCCCCCCGTCCCTGGTCTTCGACTATCCCACCCCGCAGGAATGTGCTGCCCATCTG
CGCACACAACTCGTCGACCTAGACGACGAAGAGGACGCGGCACTGTCGAATGCTCTCCCG
CAAGTGGCCCATCGGCGTACCGTCGAGGACGAACCGATCGCCATCATCGGTATGGCATGT
CGCTTCCCCGGCGGCGTACGTTCTGCCGACGACCTGTGGGAATTGCTCGCTTCGGGTAAG
GACGCTATCGGCGTCTTCCCGACCGACCGCGGCTGGGACCTGGACACGCTCTACGACCCC
GACCCCGACCACCCCGGCACCTGCTACACCCGAAACGGCGGATTCCTCTACGGCGCAGGC
CACTTCGACGCCGAATTCTTCGGCATCAGCCCCCGCGAAGCCCTCGCCATGGACCCCCAG
CAACGACTCCTCCTCGAAACCGCCTGGGAAACCATCGAACACGCCGGCATCAACCCCCAC
ACCCTCCACGGCACCCCCACCGGAGTCTTCGCCGGAATCAACGCTCAAGACCACGCCGCG
CATATCCGCCAAAGCCGTGATGTGGAGACCATCGAGGGCTACGCCCTGACCGGCAGTTCG
GGAAGTGTGGCGTCCGGCCGGGTGGCCTACACGCTCGGGCTCGAAGGCCCCGCGGTGTCG
GTGGATACGGCGTGTTCGTCGTCGTTGGTGGCGTTGCATTGGGCGGCGCAGGCGTTGCGT
GCGGGTGAGTGTTCGATGGCGCTTGCCGGGGGTGTGACGGTGATGTCGTCTCCGGGTACG
TTTGTGGAGTTCTCACGTCAGCGGGGTCTGGCCGCGGACGGGCGGTGCAAGGCCTATTCG
GCGGCTGCTGACGGTACCGGCTGGGCCGAGGGTGTGGGGATGCTGCTGGTGGAGCGGCTC
TCCGACGCCCGTCGCAACGGTCACCGTGTCCTGGCCGTGGTGCGTGGCAGTGCGGTCAAC
CAGGACGGTGCGAGCAACGGTCTGACCGCGCCCAACGGGCCCTCCCAGCAGCGTGTCATC
CGTCAGGCCCTGGCCAATGCGGGACTGACCCCGGCCGATGTCGACGCAGTGGAGGGCCAC
GGCACCGGGACCACTCTGGGGGACCCGATCGAGGCCCAGGCACTCCTGGCCGCCTACGGA
CAACACCGCCCCCACCACCGCCCCTTGTGGCTGGGATCCCTCAAATCCAACATCGGGCAC
GCACAGGCCGCCGCGGGCGTGGGCGGAGTCATCAAGATGGTGATGGCCCTGCGCAACGGG
CTGCTGCCACAGACCCTCCACGTGGACGAGCCCACCCCCCAGGTCGACTGGTCCACAGGC
GCAGTACAACTCCTGACACAACCGGTGCCCTGGCCCGCCGACCCGGCCGGCCGGCCACGC
CACGCCGGCGTGTCATCATTCGGCGTCAGCGGCACCAACGCCCACATCATCCTCGAAGAA
GCACCCACTCCCCAGGACAGCGATACCGACGACGAACCGCCTGCCAACGCACCAGCCCTG
CCCCATCCCCTCCCTCTTCCCGTGCCGGTGTCGGCGAGGTCTGAGGCCGGGTTGCGGGCG
CAGGCACAGGCGTTGCGCCAGTACGTGGCAGCCCGCCCGGACATGTCACCTGCCGACATT
GGTGCGGGTCTGGCCCGCGGCCGGGCCGTACTGGAACACCGCGCCGTCATCCTGGCCGCG
GACCGCGAGGAACTGGCGCAGGCACTGACAGCCCTGGCAGCCGGCGAACCCCACCCCCAC
ATCACCACAGGCCACACCCGGGGCGGTGACCGCGGCGGCGTCGTCTTCGTCTTCCCCGGA
CAGGGCGGCCAGTGGGCCGGGATGGGCCTGACCCTGCTCACCTCCTCACCCGTGTTCGCC
GAACACATCGACGCATGCGAGAAAGCCCTCACCCCCTGGGTGCCCTGGTCCCTGACCGAC
ATCCTGCACCGCGACCCCGACGACCCCGCATGGCAACAAGCCGACGTGGTCCAGCCCGTG
CTCTTCAGCATCATGGTCTCCCTCGCCGCCCTGTGGCGCTCCTACGGCATCGAACCCGAC
GCGGTCCTCGGCCACTCCCAGGGAGAAATCGCCGCCGCCCACATCTGCGGCGCACTCAGC
CTGAAAGACGCCGCCAAAACCGTTGCACTGCGCAGCCGCGCACTGGCCGCCGTACGAGGC
CGGGGCGCCATGGCCTCACTGCCCCTGCCCGCCCAGGACGTGCAGCAGCTCATTTCCGAA
CGGTGGGAAGGGCAGTTGTGGGTGGCAGCCCTCAACGGCCCCCACTCCACCACCGTCTCC
GGCGACACCAAGGCGGTGGATGAGGTGCTGGCGCACTGCACCGACACCGGCCTACGGGCC
AAACGCATCCCCGTCGACTACGCCTCCCACTGCCCCCACGTCCAACCCCTCCACGACGAA
CTCCTGCACCTGCTGGGAGACATCACCCCCCAGCCGTCCACCGTGCCGTTCTTCTCCACC
GTGGAAGGCACCTGGCTGGACACCACAACCCTGGACGCCGCCTACTGGTACCGCAACCTC
CACCAGCCCGTCCGCTTCAGCCACGCCATCCAGACCCTGACCGACGACGGACACCGCGCC
TTCATCGAAATCAGCCCCCACCCCACCCTCGTCCCCGCCATCGAAGACACCACCGAAAAC
ACCACCGAAAACATCACCGCGACCGGCAGCCTCCGCCGCGGCGACAACGACACCCACCGC
TTCCTCACCGCCCTCGCCCACACCCACACCACCGGCATCGGCACACCCACCACCTGGCAC
CACCACTACACCCAAACCCACCCCCACCCCAACCCCCACACCCACCTCGACCTGCCCACC
TACCCCTTCCAACACCAGCACTACTGGCTCCAACCACCCACCACAACAACCGACCTCACC
ACCACCGGCCTCACCCCCACCCACCACCCCCTCCTCACCGCCACACTCACCCTCGCCGAC
AACAACACACAACTACTCACCGGCCGCCTCTCCCTACGCACCCACCCCTGGCTCACCGAC
CACACCGTCGCCGGCATGGTCCTCCTGCCGGGCACCGCGCTCCTCGAACTCGCCCTCCAA
GCCGGCGAACGGGTGGACTGCCCTCGGGTGGAGGAACTGACCCTGCACGCACCGTTGGTG
ATCCCGCACACCGAGGACGTGACGTTGCAGGTCACCGTTCGGGCAGCCGATGAGAGTGGC
CATCGCGCCCTCGCGATCCACTCGTACTCCGGCACCGCGTCGTCGGCGGACCGGGAGTGG
ACCCGTCACGCCACGGGCCTCCTCACACACCACGCCGACACCGATCACCGTGCCGACACG
CACACGGACGCGTGCCTTGGCGGGAGCTGGCCCCCGCCCGGCGCGCAGCCCATCGAACTG
GGCGACGTCTACGGTCGTATGGCGGCGGACTCGGACATCGCCTACGGGCCGGTCTTCCAG
GGGCTGCACGCCGCCTGGAGGTTCGGCGACGATGTCCTGGCCGAGGTGCGTCTGCCGGAA
GAGGCTCTGCGCGATGCTCCGGCGGCGGCCTTCGGTGTTCACCCGGCCTTGCTCGACGCG
GCCCTGCACGCCACGGCGCTCACCCCCCAGAACGGGGACGGCTCGACGGAGAACGTCGCC
CAGGAGAGCATGCCTGACCGCGCAGCCCACCAGGCGCGACTGCCGTTCAGCTGGAGCGGC
GTGTCCCTGCACACGGCGGGCAGTTCCGTGTTGCGCGTACGGCTGTCGCGCAGTCCGCAG
CACGGTAATGCCGTGGCCCTCACCGCGGCCGACGAGGACGGTCGGCCGGTGGTGACGATC
GAGTCGCTCGCGCTGCGGCCGGTGTCCACCGAGGAGCTGCGCGCGGCCGCGGATCGTACG
CCCGAGCACGAGTCGCTCTTCCGACTGGACTGGGTTTCCGTACCAGTGCCCGCCAACGCC
CCTTCGCCCACCGCGGACCGGCCCTGGGCGGTCATCGGCGCGGGCCTTCCCCACCTGCCC
GGCCTGACGGAGCACGAGCACGTGACCGCGTATGACGAGCCGGCGGACCTGCTTCTGGCT
CTGGACCGCGGTGCTCCGCCGCCCGGTGTGCTGGTCGTAGGTGGTGTCGCCCACACCGAA
GCCCGGGAGTATTCCGCCGAAGCCCCCGGGGAGCGCGGGACCGAGGCCTGCGAGGCCCGG
CCGGACGTCGTGCACGTGGGCGTCGTGCACACGGCTGCCGTGCACGCGGCTGCCGCGCAG
ATGTTGGCCAGGCTCCAGGCCTGGCTGGGCGACGAGCGCCTCGCAGACAGCCGGCTGCTC
GTCCTGACGTGCGGCGCGGTCGCCCGCGCCTCCGGCGACGATGCGACGGACCTGCCCGGG
GCCGCCGTGTGGGGGCTGGTGCGTTCGGCGCAGTCCGAGCACCCGGACCGCATCACGCTG
CTGGACTTCGAGCGGGGCACAGAGGCGGAGCCCGGTCAGCTGGCGACGGCGCTGAACTGC
GGGGAGCGGCAGCTTGCCGTCCGCCCCGGAGGGCTGTTCACGCCACGGCTGGTGCGCGCG
CCACGTGTCGCCGACGCCGTACCCGCCGTACCCGCCGTGGCCGTACCGTCAGCGGGTCAC
GCAGCCGTACCGGCAGCGGGTCCCTTCCTTCCGGGCGGAACGGTGCTGATCACCGGCGGA
ACCGGTGTCCTGGGCCGGCTCGTGGCCCGGCATCTGGTGGAGGCGCACGGCGTACGGCAT
CTGTTGCTGGCGGGTCGGCGCGGACCGGACGCCGAGGGTGCGCCGGAGTTGCGGGCGGAG
CTCGGTGGGCTCGGCGCGACGGTGGAGGTCGTCGCCTGCGACGCGGCGGACCGGCAGCAG
CTGGCCGACCTGCTGACACGGATCCCCGACGATCGGCCGCTGACCGGTGTCGTGCACAGT
GCGGGCATCCTGGACGACGGCGTGATCACGTCGCTGTCGCCGGAGCGGCTCGGGGCCGTC
CTCCGGGCCAAGGCGGACGCTGCGCTGCTTCTCGACGAGCTGACGCGCGGGGCAGAGCTG
TCGGCTTTCGTCATGTTCTCCTCCGCGTCGGCGGTGGTCGGCTCGCCCGGGCAGGGCAAC
TACGCCGCCGCCAACGCCGTCCTCGACTTCCTTGCTCATCGCCGCCGCGCCGAGGGGCTG
CCCGCCGTCTCTCTCGCCTGGGGCCTGTGGGAAGAGGGCACAGGGATGACGGGCCACCTC
GACGTCGACGACCATGCGCGGATCAGCCGCGCGGGAATGCGGCCGCTGCCGACTGCCGAG
GCTCTGGCGCTGTTCGACGCGGCCTTGGCCGACGGCGAGCCGTTCCTGATGCCGGCTCGG
CTCGACCTCACGGCCGTACGGTCTGGTGCCGCGTCCGCACCGGTGCCGCCGCTGCTGCAA
GGTCTGCTTCAGCTGCCTCGGTCCCGCTCGGCCGCCGCGGCCCCCGGCCATGGGGCCCCG
GCGGCGGACGAGGCGGCGGCCTGGCGTGAGCGTCTGGCCCGGCAGAGTGCCGGTGAGCGC
AGGCAGGCGCTGCTGCGCCTGGTGCGGTCGCATGTCGCGGCGGTGCTCGGCCATAGCGGT
GCCGACGGAATCGACGCATCGCGGGCGTTCCGCGAGCTGGGGTTCGACTCGCTCACGGCG
GTCGAGCTGCGCAACCGTCTCACGGCCGCGACGGGCCTGCGGCTGCGGGCCACGCTGGCC
TTCGATTTCCCGACCCCGGCAGCGCTGGCCGAGCACTTGGGCGAGCGTCTGCTTCCCGAC
CAGGAGGCCACGGGCGAGCAAGCCGGCGATCAGCTCTCCGGCGGCAGCGAGGAGGACGTA
CGCAGCCTCCTGACGTCCATTCCGATCGGCAGGCTGCGGGACGCGGGGCTCCTCGGGCCC
CTGCTCACGCTCGCGGACACGGGCCGCGGCGCCTCGGGCGCCGCCGCAGGTCCGGAGGAC
GCGCCGCCCTCCGGCCAGGACACACCGGCTCCCGTCTCGATCGACGAGATGGACATCGAC
GACCTGATGGATCTGGCGCACGGGCATGGCACCGCACCCGCCCGTGAGCCCGCCGACGCA
GAGGACTCGTCGTCATCACGAAACCGGACACACCACACACACGAAGGTGAGACAGCGTGA
[3] KS36..413
[3] AT597..915
[3] malonyl-CoA797..801
[3] ACP986..1060
[4] KS1082..1459
[4] AT1654..1972
[4] malonyl-CoA not conserved HAFHS(H->N)1854..1858
[4] KR2306..2486
[4] ACP2583..2657
[5] KS2679..3055
[5] AT3231..3546
[5] malonyl-CoA3428..3432
[5] KR3878..4059
[5] ACP4171..4241
[6] KS4270..4649
[6] AT4815..5133
[6] methylmalonyl-CoA not conserved YASHS(S->C)5007..5011
[6] DH5189..5362
[6] KR5753..5933
[6] ACP6044..6114
[3] KS106..1239
[3] AT1789..2745
[3] malonyl-CoA2389..2403
[3] ACP2956..3180
[4] KS3244..4377
[4] AT4960..5916
[4] malonyl-CoA not conserved HAFHS(H->N)5560..5574
[4] KR6916..7458
[4] ACP7747..7971
[5] KS8035..9165
[5] AT9691..10638
[5] malonyl-CoA10282..10296
[5] KR11632..12177
[5] ACP12511..12723
[6] KS12808..13947
[6] AT14443..15399
[6] methylmalonyl-CoA not conserved YASHS(S->C)15019..15033
[6] DH15565..16086
[6] KR17257..17799
[6] ACP18130..18342

close this sectionFeature

BLASTP
Database:UniProtKB:2011_09
show BLAST table
InterPro
Database:interpro:38.0
IPR001227 Acyl transferase domain (Domain)
 [591-730]  G3DSA:3.40.366.10 [797-904]  G3DSA:3.40.366.10 [1648-1787]  G3DSA:3.40.366.10 [1854-1961]  G3DSA:3.40.366.10 [3225-3361]  G3DSA:3.40.366.10 [3428-3535]  G3DSA:3.40.366.10 [4809-4938]  G3DSA:3.40.366.10 [5007-5124]  G3DSA:3.40.366.10
G3DSA:3.40.366.10   Ac_transferase_reg
IPR002198 Short-chain dehydrogenase/reductase SDR (Family)
 [5753-5920]  6.09999999999996e-60 PF00106
PF00106   adh_short
IPR006162 Phosphopantetheine attachment site (PTM)
 [1018-1033]  PS00012 [2615-2630]  PS00012 [6072-6087]  PS00012
PS00012   PHOSPHOPANTETHEINE
IPR009081 Acyl carrier protein-like (Domain)
 [986-1060]  PS50075 [2583-2657]  PS50075 [4171-4241]  PS50075 [6044-6114]  PS50075
PS50075   ACP_DOMAIN
 [988-1090]  3.60000000000002e-67 G3DSA:1.10.1200.10 [2585-2661]  8.29999999999993e-81 G3DSA:1.10.1200.10 [4171-4244]  8.29999999999993e-81 G3DSA:1.10.1200.10 [6041-6117]  8.29999999999993e-81 G3DSA:1.10.1200.10
G3DSA:1.10.1200.10   ACP_like
 [993-1059]  2.2e-08 PF00550 [2596-2656]  6.1e-08 PF00550 [4175-4240]  1.6e-11 PF00550 [6047-6113]  1.9e-11 PF00550
PF00550   PP-binding
 [983-1099]  7.00000734129907e-27 SSF47336 [2580-2696]  1.29999924468179e-23 SSF47336 [4164-4287]  3.00000067992871e-23 SSF47336 [6036-6131]  1.89999859865865e-24 SSF47336
SSF47336   ACP_like
IPR013968 Polyketide synthase, KR (Domain)
 [2306-2485]  7.10000000000008e-59 PF08659 [3878-4058]  9.50000000000005e-55 PF08659
PF08659   KR
IPR014030 Beta-ketoacyl synthase, N-terminal (Domain)
 [36-287]  2.50000000000001e-95 PF00109 [1082-1333]  3.30000000000002e-95 PF00109 [2679-2930]  4.09999999999995e-100 PF00109 [4270-4523]  8.19999999999987e-100 PF00109
PF00109   ketoacyl-synt
IPR014031 Beta-ketoacyl synthase, C-terminal (Domain)
 [295-413]  1.70000000000001e-44 PF02801 [1341-1459]  1.09999999999999e-46 PF02801 [2938-3055]  2.4e-49 PF02801 [4531-4649]  4.2e-47 PF02801
PF02801   Ketoacyl-synt_C
IPR014043 Acyl transferase (Domain)
 [597-915]  3.29999999999997e-50 PF00698 [1654-1972]  7.00000000000005e-49 PF00698 [3231-3546]  5.40000000000002e-50 PF00698 [4815-5133]  6.29999999999998e-105 PF00698
PF00698   Acyl_transf_1
IPR015083 Polyketide synthase, docking (Domain)
 [6-36]  7.49999605851445e-06 SSF101173
SSF101173   Polyketide_synth_docking
 [4-29]  6.70000000000001e-10 PF08990
PF08990   Docking
IPR016035 Acyl transferase/acyl hydrolase/lysophospholipase (Domain)
 [594-903]  4.30000170645869e-59 SSF52151 [1651-1960]  3.49999466863949e-58 SSF52151 [3228-3534]  6.60002751141562e-59 SSF52151 [4813-5116]  6.80004680300024e-68 SSF52151
SSF52151   Acyl_Trfase/lysoPlipase
IPR016036 Malonyl-CoA ACP transacylase, ACP-binding (Domain)
 [731-796]  8.19999506482837e-13 SSF55048 [1788-1853]  2.00000075217456e-12 SSF55048 [3362-3427]  8.19999506482837e-13 SSF55048 [4940-5006]  5.00000909915354e-17 SSF55048
SSF55048   Malonyl_transacylase_ACP-bd
IPR016038 Thiolase-like, subgroup (Domain)
 [35-298]  G3DSA:3.40.47.10 [299-466]  G3DSA:3.40.47.10 [1091-1345]  G3DSA:3.40.47.10 [1346-1512]  G3DSA:3.40.47.10 [2679-2941]  G3DSA:3.40.47.10 [2942-3108]  G3DSA:3.40.47.10 [4270-4534]  G3DSA:3.40.47.10 [4535-4703]  G3DSA:3.40.47.10
G3DSA:3.40.47.10   Thiolase-like_subgr
IPR016039 Thiolase-like (Domain)
 [28-411]  6.19996040581186e-101 SSF53901 [1075-1457]  1.59998835313644e-99 SSF53901 [2672-3054]  1.59998835313644e-105 SSF53901 [4263-4647]  2.39999798157263e-102 SSF53901
SSF53901   Thiolase-like
IPR016040 NAD(P)-binding domain (Domain)
 [2307-2492]  3.49999999999995e-94 G3DSA:3.40.50.720 [3879-4065]  3.49999999999995e-94 G3DSA:3.40.50.720 [5750-5963]  1.60000000000002e-92 G3DSA:3.40.50.720
G3DSA:3.40.50.720   NAD(P)-bd
IPR018201 Beta-ketoacyl synthase, active site (Active_site)
 [200-216]  PS00606 [1246-1262]  PS00606 [2843-2859]  PS00606 [4436-4452]  PS00606
PS00606   B_KETOACYL_SYNTHASE
IPR020801 Polyketide synthase, acyl transferase domain (Domain)
 [598-947]  1.79999754022378e-102 SM00827 [1655-2075]  2.2999842048857e-96 SM00827 [3232-3595]  1.79999754022378e-102 SM00827 [4817-5115]  2.89997807423186e-124 SM00827
SM00827   PKS_AT
IPR020806 Polyketide synthase, phosphopantetheine-binding domain (Domain)
 [991-1063]  1.39999892049878e-29 SM00823 [2588-2660]  1.20000117458134e-30 SM00823 [4172-4244]  7.30000590915208e-24 SM00823 [6045-6117]  1.3000049540733e-34 SM00823
SM00823   PKS_PP
IPR020807 Polyketide synthase, dehydratase domain (Domain)
 [5189-5362]  4.60000044330866e-72 SM00826
SM00826   PKS_DH
IPR020841 Polyketide synthase, beta-ketoacyl synthase domain (Domain)
 [38-466]  SM00825 [1085-1512]  SM00825 [2682-3109]  SM00825 [4273-4703]  SM00825
SM00825   PKS_KS
IPR020842 Polyketide synthase/Fatty acid synthase, KR (Domain)
 [2306-2486]  8.30001403791594e-52 SM00822 [3878-4059]  1.3000049540733e-43 SM00822 [5753-5933]  3.89999861795218e-60 SM00822
SM00822   PKS_KR
SignalP
 [1-18]  0.095 Signal
Eukaryota   
TMHMM No significant hit