Please make javascript effective.
CDS Information : AO090023000729
open/close allGenomic map
Information storage and processing [J] Translation, ribosomal structure and biogenesis [A] RNA processing and modification [K] Transcription [L] Replication, recombination and repair [B] Chromatin structure and dynamics Cellular processes and signaling [D] Cell cycle control, cell division, chromosome partitioning [O] Posttranslational modification, protein turnover, chaperones [Y] Nuclear structure [V] Defense mechanisms [T] Signal transduction mechanisms [M] Cell wall/membrane/envelope biogenesis [N] Cell motility [Z] Cytoskeleton [W] Extracellular structures [U] Intracellular trafficking, secretion, and vesicular transport Metabolism [C] Energy production and conversion [G] Carbohydrate transport and metabolism [E] Amino acid transport and metabolism [F] Nucleotide transport and metabolism [H] Coenzyme transport and metabolism [I] Lipid transport and metabolism [P] Inorganic ion transport and metabolism [Q] Secondary metabolites biosynthesis, transport and catabolism Poorly characterized [R] General function prediction only [S] Function unknown Bi-functional protein Unannotated
Location
Organism
Aspergillus oryzae RIB40 (= NBRC 100959 )
Replicon
chromosome3
Contig
SC023
Start / Stop / Direction
1,917,581 / 1,921,419 / +
Location
join(1917581..1917904, 1917972..1918013, 1918070..1918241, 1918299..1920676, 1920730..1921419)
Type
CDS
Length
3,606 bp (1,201 aa)
Intron 1917905..1917971, 1918014..1918069, 1918242..1918298, 1920677..1920729
Annotation
Product
mismatch repair ATPase MSH6 (MutS family)
Gene name
Functional category
Information storage and processing - [L] Replication, recombination and repair
EC number
Note
KEGG pathway
Sequence feature
References
Related links to external database
Computational search results
BLASTPDatabase:UniProtKB:2010_04
HAMAP
No significant hit
InterPro
No significant hit
SignalP
No significant hit
TMHMM
No significant hit
SOSUI
No significant hit
Calculated information (Amino acid sequence)
size of protein
1201 amino acids
molecular mass
134,517.08 Da
pI
5.54
aa composition
Ala Val Leu Ile Phe Trp Pro Met Gly Ser
8 (96aa) 5.7 (69aa) 8.5 (102aa) 5.2 (63aa) 4.4 (53aa) 1.4 (17aa) 5 (60aa) 2.1 (25aa) 5.7 (69aa) 8.7 (105aa)
Thr Cys Asn Gln Tyr Lys His Arg Asp Glu
4.7 (57aa) 1.1 (13aa) 3 (36aa) 3.7 (45aa) 2.2 (26aa) 7.5 (90aa) 1.9 (23aa) 5.7 (68aa) 7.6 (91aa) 7.7 (93aa)
Sequence
>AO090023000729 mismatch repair ATPase MSH6 (MutS family)
ATGGCAAAAGGACCGCCCGCATCCTCTTCCCCGGCCGCAACGCCTCCCTCCGGAGTGCTG
AAACGAACTACTTCTAGCACTCAGAACATGAAGAACCAGAAGTCAATTCTAGGTTTCTTT
CAGAAGTCATCACCATCCACTCCATCTACTGCTCGCAATGCTGAACCAGCTTCGTCGCCT
GCTCAGAGAGTATCGGAGCAGCGCGGGGCTGCCAGAGGGTCCGTGAAGAGCGACAAGAAG
AAGAGTCTCCCACAGCTCTCGGACCTTAGCCCGGTACCAAGTAGTGATCTAGTTGAGCCT
GAGGAGGATGAGGGCCATATTCAGGCGACTTCTAATGACGCAAAGACCGATTCCCCATCT
CGACGGCCTAAAAAGCAGGTCAACTACTTCGAGTCCGATTCAGAAGGCGAGGATGACGAC
GAAAAGATATTCCGCCCAGGCCGGAAAAGTAGCAAAATATCTAAGCGAAGAAAGTTATCT
CCCGAGAGCGATGATGAATTTGAACAAGGTGGAGATGATGCCGGCTACTCGGACGAAGAT
ATGGATGATTTTATCGTTGCGGATGACTCAGATGAGGACGTAAAGACTTCGAAGAAGCGC
AAGAGGCCAACTCAACCGAAGCCCAAGTCGTCGTCTGTACCGCCAGTCCCATCCTTTGAG
GAGGATATGGATTTGAACATTCCTGATGCATCTTCTGGCTCCGCGATGAAATGGACGTAT
GATCCGGATAGTGCCGAGCCTCGTCAGAATCGAACAGCCCCTGCGAAGTCGAAGAGCCCA
TCTGGTAAGAAACTGAAAGCTCACGTTACAGAACCAGAGCAACGCTATGCCTGGCTGGCA
AACATTCGGGATATAGATGGCCATTCCCCGGGCCACCCTGACTACGACCCTCGTACCCTT
TATATTCCTCCACTGGCTTGGGCTAAATTTTCACCCTTCGAAAAACAGTACTGGGAGATC
AAGCAGAAGTTTTGGGACACGGTCGTATTTTTTAAGAAGGGAAAGTTTTATGAGCTTTAT
GAAAATGACGCCACCATTGGACACCAGTTATTCGACCTCAAACTTACCGATCGCGTGAAC
ATGCGTATGGTGGGTGTTCCTGAAATGAGCTTGGATCACTGGGCAAATCAGTTTGTCGCC
AAGGGATTCAAGATTGCTCGCGTCGACCAAATAGAGTCCGCCCTCGGCAAGGAAATGCGC
GAAAGAGATGGAAAGAAAGGGGGCAAAGAAGACAAGGTCATCAGGAGAGAGCTGTCTTCT
GTCCTTACTGCTGGAACGCTGGTTGAGGGTTCGATGCTGCAGGATGACATGTCTACTTAC
TGCGTAGCTATCAAAGAAGCCATCATAGAGGATTTCCCTGCGTTTGGGCTAGCTTTCGTT
GACACAGCGACGGGACAATTCTTCTTGTCGGAGTTTGTCGATGATGCCGACATGACAAAG
TTTGAAACATTCGTGGCTCAGACTCGCCCGCAAGAACTACTTCTCGAAAAGTCCACAGTT
TCACAGAAGGCACTCCGAATTTTGAAGAATAATACAGGCCCCACTACCATTTGGAACCAC
TTGAAGCCAGGGAAAGAGTTCTGGGAAGCGGACATTACGGTGAAAGAAATGGATGTCAGC
GAGTATTTTGTTTCTGAGGACGATGACAACCTGAAGGCGTGGCCGGAAGCTCTTCGAGCG
GCTCGCGATAAAGAACTAGTGATGTCAGCCTTTGGGGCGTTGGTACAATATCTCCGGCTC
TTGAAACTTGACCGTGACTTGATCACAATCGGTAACTTCTCCTCGTACGATCCAATCAAG
AAGGCTTCTAGCCTAGTGTTGGATGGCCAGACCCTTATCAACATGGAAATCTTTGCAAAT
TCATTTGATGGAGGCTCTGATGGGACACTATTCCAGCTTTTGAATCGCTGCATCACACCA
TTCGGCAAGCGCATGTTCAAGCAGTGGGTTTGCCACCCGCTGATAGATGCGAAGAAAATC
AATGCCAGATTGGATGCTGTTGATGCCCTTAACGCGGATCCTAACATTCGGGATCAGTTT
TCTTCACAATTGACCAAGATGCCTGATTTGGAAAGACTGATTTCCCGTATTCATGCTGCT
AATTGCAAAGCCCAGGACTTCCTGCGTGTGCTGGAGGGCTTCGAGCAAATTGAATACACC
GTGAGTCTTCTTAAGGACAGTGGTTCTGGAGAAGGCGTTATTGGTCAGTTGATAAGTGCG
ATGCCGGACCTGAATGAGTTGCTTGAATACTGGAAAACAGCATTTGACCGCACCAAAGCA
AGGGAGAATGGCATTCTAGTACCTAAATCAGGCGTTGAAGAAGACTTCGACAACTCCCAA
GAGTACATCGAAGAACTCCACAACGAGCTTGATAGTCTTCTGAAGCGGGTTCGTCGTGAG
TTAGGCTCTACTGCCATTTGCTACCGTGACAACGGCAAGGAGATCTACCAGTTGGAGGTG
CCGATCAAGGTAAAGAACATTCCCAAGAACTGGGACCAGATGTCAGCAACTAAACAAGTG
AAACGGTATTATTTCCCTGAACTTCGGACGATCATCCGTAAATTGCAGGAAGCCCAAGAG
ACACATAGTCAGATTGTGAAAGAAGTCGCTGGCCGATTCTATGCCCGGTTTGACGAACAT
TATATAACATGGCTGGCAGCAGTCAAGATTATCTCGCAGCTGGACTGCTTGATAAGTCTT
GCAAAGGCTTCCTCATCTTTGGGACAACCAAGTTGCCGCCCTGTCTTTGTTGAGGATGAG
CGAAGTGTACTCGAATTCGAGGAGCTACGCCATCCGTGCCTCCTCTCCTCTGTGGAAGAC
TTCATACCGAATGACATCAAATTGGGGGGTGACCGTGCTAACATTGATCTTCTCACTGGT
GCTAATGCTGCCGGAAAATCCACTGTCCTTCGAATGACGTGTGTCGCCGTGATCATGGCT
CAGATCGGTTGCTACTTGCCTTGCCAATCTGCGCGATTGACCCCCGTGGACCGCATCATG
TCCCGTTTAGGCGCAAATGACAATATCTTCGCCGCTCAGTCAACGTTCTTCGTAGAGCTT
TCCGAAACCAAGAAGATCCTCTCCGAAGCTACTCCCCGATCTTTGGTTATCCTCGATGAA
TTGGGTCGTGGAACCAGCTCATACGACGGAGTGGCCGTGGCCCAAGCCGTACTGCACCAT
GTTGCTACGCACATTGGAGCCTTGGGATTCTTCGCGACCCACTACCACTCCCTGGCGGCT
GAGTTTGAAGGACACCCTGAGATCACACCGAAACGCATGAAGATCCATGTAGACGATGAG
GAGCGACGTGTTACGTTCCTGTATAAACTGGAAGACGGTGTTGCCGAAGGTAGTTTCGGC
ATGCACTGTGCCGCCATGTGTGGTATTTCCAGCAAGGTCATCGAGAGAGCGGAGGTCGCC
GCAAAGCAATGGGAACACACTAGTCGTCTCAAAGAGAGTCTTGAACGCCGAAAAGGTGGT
GGTTTCATTGGGTTGGGTTGGTGGAGCGATGTCGCCTGGGCCCTTCGAGAGTCCTCTGAC
GTCAATGAGCACGAAGTCACCGATAGAGGTCTGGACGTACTTCTTAAGGCTATTGAAGCT
CTGTGA
>AO090023000729 mismatch repair ATPase MSH6 (MutS family)
MAKGPPASSSPAATPPSGVLKRTTSSTQNMKNQKSILGFFQKSSPSTPSTARNAEPASSP
AQRVSEQRGAARGSVKSDKKKSLPQLSDLSPVPSSDLVEPEEDEGHIQATSNDAKTDSPS
RRPKKQVNYFESDSEGEDDDEKIFRPGRKSSKISKRRKLSPESDDEFEQGGDDAGYSDED
MDDFIVADDSDEDVKTSKKRKRPTQPKPKSSSVPPVPSFEEDMDLNIPDASSGSAMKWTY
DPDSAEPRQNRTAPAKSKSPSGKKLKAHVTEPEQRYAWLANIRDIDGHSPGHPDYDPRTL
YIPPLAWAKFSPFEKQYWEIKQKFWDTVVFFKKGKFYELYENDATIGHQLFDLKLTDRVN
MRMVGVPEMSLDHWANQFVAKGFKIARVDQIESALGKEMRERDGKKGGKEDKVIRRELSS
VLTAGTLVEGSMLQDDMSTYCVAIKEAIIEDFPAFGLAFVDTATGQFFLSEFVDDADMTK
FETFVAQTRPQELLLEKSTVSQKALRILKNNTGPTTIWNHLKPGKEFWEADITVKEMDVS
EYFVSEDDDNLKAWPEALRAARDKELVMSAFGALVQYLRLLKLDRDLITIGNFSSYDPIK
KASSLVLDGQTLINMEIFANSFDGGSDGTLFQLLNRCITPFGKRMFKQWVCHPLIDAKKI
NARLDAVDALNADPNIRDQFSSQLTKMPDLERLISRIHAANCKAQDFLRVLEGFEQIEYT
VSLLKDSGSGEGVIGQLISAMPDLNELLEYWKTAFDRTKARENGILVPKSGVEEDFDNSQ
EYIEELHNELDSLLKRVRRELGSTAICYRDNGKEIYQLEVPIKVKNIPKNWDQMSATKQV
KRYYFPELRTIIRKLQEAQETHSQIVKEVAGRFYARFDEHYITWLAAVKIISQLDCLISL
AKASSSLGQPSCRPVFVEDERSVLEFEELRHPCLLSSVEDFIPNDIKLGGDRANIDLLTG
ANAAGKSTVLRMTCVAVIMAQIGCYLPCQSARLTPVDRIMSRLGANDNIFAAQSTFFVEL
SETKKILSEATPRSLVILDELGRGTSSYDGVAVAQAVLHHVATHIGALGFFATHYHSLAA
EFEGHPEITPKRMKIHVDDEERRVTFLYKLEDGVAEGSFGMHCAAMCGISSKVIERAEVA
AKQWEHTSRLKESLERRKGGGFIGLGWWSDVAWALRESSDVNEHEVTDRGLDVLLKAIEA
L
>AO090023000729 with intron
ATGGCAAAAGGACCGCCCGCATCCTCTTCCCCGGCCGCAACGCCTCCCTCCGGAGTGCTG
AAACGAACTACTTCTAGCACTCAGAACATGAAGAACCAGAAGTCAATTCTAGGTTTCTTT
CAGAAGTCATCACCATCCACTCCATCTACTGCTCGCAATGCTGAACCAGCTTCGTCGCCT
GCTCAGAGAGTATCGGAGCAGCGCGGGGCTGCCAGAGGGTCCGTGAAGAGCGACAAGAAG
AAGAGTCTCCCACAGCTCTCGGACCTTAGCCCGGTACCAAGTAGTGATCTAGTTGAGCCT
GAGGAGGATGAGGGCCATATTCAGGTATGATTAGTACTTTAGCACGATCTCACGTTTGAC
GCCGATAGAGCTTCGCTCACTGGATTGGCAG GCGACTTCTAATGACGCAAAGACCGATTC
CCCATCTCGACGGGTAAGCTACCATGTCCCTAAGGCTTCTGTATCATAGTGTATACTGAT
TGTCTGTAG CCTAAAAAGCAGGTCAACTACTTCGAGTCCGATTCAGAAGGCGAGGATGAC
GACGAAAAGATATTCCGCCCAGGCCGGAAAAGTAGCAAAATATCTAAGCGAAGAAAGTTA
TCTCCCGAGAGCGATGATGAATTTGAACAAGGTGGAGATGATGCCGGCTACTCGGACGAA
GGTGTGTTATTGATCTCATAAGCCCTCGTTGCAGTTTGACAGGTATTGACTGTTTCAG AT
ATGGATGATTTTATCGTTGCGGATGACTCAGATGAGGACGTAAAGACTTCGAAGAAGCGC
AAGAGGCCAACTCAACCGAAGCCCAAGTCGTCGTCTGTACCGCCAGTCCCATCCTTTGAG
GAGGATATGGATTTGAACATTCCTGATGCATCTTCTGGCTCCGCGATGAAATGGACGTAT
GATCCGGATAGTGCCGAGCCTCGTCAGAATCGAACAGCCCCTGCGAAGTCGAAGAGCCCA
TCTGGTAAGAAACTGAAAGCTCACGTTACAGAACCAGAGCAACGCTATGCCTGGCTGGCA
AACATTCGGGATATAGATGGCCATTCCCCGGGCCACCCTGACTACGACCCTCGTACCCTT
TATATTCCTCCACTGGCTTGGGCTAAATTTTCACCCTTCGAAAAACAGTACTGGGAGATC
AAGCAGAAGTTTTGGGACACGGTCGTATTTTTTAAGAAGGGAAAGTTTTATGAGCTTTAT
GAAAATGACGCCACCATTGGACACCAGTTATTCGACCTCAAACTTACCGATCGCGTGAAC
ATGCGTATGGTGGGTGTTCCTGAAATGAGCTTGGATCACTGGGCAAATCAGTTTGTCGCC
AAGGGATTCAAGATTGCTCGCGTCGACCAAATAGAGTCCGCCCTCGGCAAGGAAATGCGC
GAAAGAGATGGAAAGAAAGGGGGCAAAGAAGACAAGGTCATCAGGAGAGAGCTGTCTTCT
GTCCTTACTGCTGGAACGCTGGTTGAGGGTTCGATGCTGCAGGATGACATGTCTACTTAC
TGCGTAGCTATCAAAGAAGCCATCATAGAGGATTTCCCTGCGTTTGGGCTAGCTTTCGTT
GACACAGCGACGGGACAATTCTTCTTGTCGGAGTTTGTCGATGATGCCGACATGACAAAG
TTTGAAACATTCGTGGCTCAGACTCGCCCGCAAGAACTACTTCTCGAAAAGTCCACAGTT
TCACAGAAGGCACTCCGAATTTTGAAGAATAATACAGGCCCCACTACCATTTGGAACCAC
TTGAAGCCAGGGAAAGAGTTCTGGGAAGCGGACATTACGGTGAAAGAAATGGATGTCAGC
GAGTATTTTGTTTCTGAGGACGATGACAACCTGAAGGCGTGGCCGGAAGCTCTTCGAGCG
GCTCGCGATAAAGAACTAGTGATGTCAGCCTTTGGGGCGTTGGTACAATATCTCCGGCTC
TTGAAACTTGACCGTGACTTGATCACAATCGGTAACTTCTCCTCGTACGATCCAATCAAG
AAGGCTTCTAGCCTAGTGTTGGATGGCCAGACCCTTATCAACATGGAAATCTTTGCAAAT
TCATTTGATGGAGGCTCTGATGGGACACTATTCCAGCTTTTGAATCGCTGCATCACACCA
TTCGGCAAGCGCATGTTCAAGCAGTGGGTTTGCCACCCGCTGATAGATGCGAAGAAAATC
AATGCCAGATTGGATGCTGTTGATGCCCTTAACGCGGATCCTAACATTCGGGATCAGTTT
TCTTCACAATTGACCAAGATGCCTGATTTGGAAAGACTGATTTCCCGTATTCATGCTGCT
AATTGCAAAGCCCAGGACTTCCTGCGTGTGCTGGAGGGCTTCGAGCAAATTGAATACACC
GTGAGTCTTCTTAAGGACAGTGGTTCTGGAGAAGGCGTTATTGGTCAGTTGATAAGTGCG
ATGCCGGACCTGAATGAGTTGCTTGAATACTGGAAAACAGCATTTGACCGCACCAAAGCA
AGGGAGAATGGCATTCTAGTACCTAAATCAGGCGTTGAAGAAGACTTCGACAACTCCCAA
GAGTACATCGAAGAACTCCACAACGAGCTTGATAGTCTTCTGAAGCGGGTTCGTCGTGAG
TTAGGCTCTACTGCCATTTGCTACCGTGACAACGGCAAGGAGATCTACCAGTTGGAGGTG
CCGATCAAGGTAAAGAACATTCCCAAGAACTGGGACCAGATGTCAGCAACTAAACAAGTG
AAACGGTATTATTTCCCTGAACTTCGGACGATCATCCGTAAATTGCAGGAAGCCCAAGAG
ACACATAGTCAGATTGTGAAAGAAGTCGCTGGCCGATTCTATGCCCGGTTTGACGAACAT
TATATAACATGGCTGGCAGCAGTCAAGATTATCTCGCAGCTGGACTGCTTGATAAGTCTT
GCAAAGGCTTCCTCATCTTTGGGACAACCAAGTTGCCGCCCTGTCTTTGTTGAGGATGAG
CGAAGTGTACTCGAATTCGAGGAGCTACGCCATCCGTGCCTCCTCTCCTCTGTGGAAGAC
TTCATACCGAATGACATCAAATTGGGGGGTGACCGTGCTAACATTGATCTTCTCACTGGT
GCTAATGCTGCCGGAAAATCCACTGTCCTTCGAATGGTACGTCACCTTGCCTGATTTGCG
TTGTAATTCGCTAACTAAATTGCCTGTAG ACGTGTGTCGCCGTGATCATGGCTCAGATCG
GTTGCTACTTGCCTTGCCAATCTGCGCGATTGACCCCCGTGGACCGCATCATGTCCCGTT
TAGGCGCAAATGACAATATCTTCGCCGCTCAGTCAACGTTCTTCGTAGAGCTTTCCGAAA
CCAAGAAGATCCTCTCCGAAGCTACTCCCCGATCTTTGGTTATCCTCGATGAATTGGGTC
GTGGAACCAGCTCATACGACGGAGTGGCCGTGGCCCAAGCCGTACTGCACCATGTTGCTA
CGCACATTGGAGCCTTGGGATTCTTCGCGACCCACTACCACTCCCTGGCGGCTGAGTTTG
AAGGACACCCTGAGATCACACCGAAACGCATGAAGATCCATGTAGACGATGAGGAGCGAC
GTGTTACGTTCCTGTATAAACTGGAAGACGGTGTTGCCGAAGGTAGTTTCGGCATGCACT
GTGCCGCCATGTGTGGTATTTCCAGCAAGGTCATCGAGAGAGCGGAGGTCGCCGCAAAGC
AATGGGAACACACTAGTCGTCTCAAAGAGAGTCTTGAACGCCGAAAAGGTGGTGGTTTCA
TTGGGTTGGGTTGGTGGAGCGATGTCGCCTGGGCCCTTCGAGAGTCCTCTGACGTCAATG
AGCACGAAGTCACCGATAGAGGTCTGGACGTACTTCTTAAGGCTATTGAAGCTCTGTGA
Covered clones
NBRC No.
clone name
start
stop
length(bp)
G07-023-188
B042D08
1,761,451
1,929,768
168,318
G07-023-189
B076H12
1,780,197
1,970,691
190,495
G07-023-190
B001F04
1,786,286
1,983,124
196,839
G07-023-191
B025H08
1,808,287
1,977,652
169,366
G07-023-192
B030F10
1,808,288
1,970,649
162,362
G07-023-193
B077C03
1,834,807
1,970,654
135,848
G07-023-194
B078D07
1,839,351
1,983,125
143,775
G07-023-195
B041C08
1,843,516
1,943,679
100,164
G07-023-196
B074D04
1,847,090
1,995,846
148,757
G07-023-197
B096F05
1,847,123
1,971,846
124,724
G07-023-198
B093H08
1,847,125
1,935,826
88,702
G07-023-199
B052D11
1,854,106
2,024,685
170,580
G07-023-200
B061G04
1,881,098
1,943,700
62,603
G07-023-201
B089A02
1,881,100
1,962,336
81,237
G07-023-202
B065C04
1,882,153
1,983,128
100,976
Distribution of Our Microbial Genomic DNA clones
We have been distributing copies of the microbial genomic DNA clones constructed during the course of each of the genomic DNA sequencing projects.
You can find more detailed information at
http://www.nbrc.nite.go.jp/e/mdna-e.html .