HOG COG KOG arCOG COG description NRC-1 annotation
cHOG0268COG0249KOG0221 KOG0217 KOG0218 KOG0219 KOG0220arCOG02896Mismatch repair ATPase (MutS family)mismatch repair protein

HOG Organism Gene ID Name Annotation
cHOG0268 nrc1 VNG0163 mutS1a mismatch repair protein
cHOG0268 hmu HMUK0366 Hmuk_0366 DNA mismatch repair protein MutS
cHOG0268 hma RRNAC2532 mutS4 DNA mismatch repair protein
cHOG0268 hwa HQ1460A mutS1 DNA mismatch repair protein
cHOG0268 hla HLAC0119 Hlac_119 DNA mismatch repair protein MutS
cHOG0268 hbo HBOR26580 Hbor_26580 DNA mismatch repair protein MutS
cHOG0268 hut HUTA0493 Huta_0493 DNA mismatch repair protein MutS
cHOG0268 hvo HVO_0552 HVO_0552 mutS-2 DNA mismatch repair protein MutS
cHOG0268 nph NP0538A mutS_1 DNA mismatch repair protein
cHOG0268 hhi 2963 mutS4 DNA mismatch repair protein MutS
MUSCLE (3.6) multiple sequence alignment


1460|gene|hwa       ------------------------------------------------------------
163|gene|nrc1       ------------------------------------------------------------
538|gene|nph        ------------------------------------------------------------
119|gene|hla        ------------------------------------------------------------
26580|gene|hbo      ------------------------------------------------------------
493|gene|hut        ------------------------------------------------------------
12532|gene|hma      MALDSPVGRRLGLVLQSRSNGVGPDQLFGDSELDHRAVCVIGEKGAERLAGPERGVVGRD
2963|gene|hhi       ------------------------------------------------------------
                                                                                

1460|gene|hwa       ------------------------------------------MTSQGIVDEFFSLRAETD
163|gene|nrc1       ---------------------------------------------MGIVDEFQALKAETD
538|gene|nph        -------------------------------------------MASGIVGEYFELRASAS
119|gene|hla        -------------------------------------------MPTGIVGEFLDLKAETD
26580|gene|hbo      ------------------------------------------MTSQGIVGEFLSLKSETD
493|gene|hut        -----------------------------------------MTEATGIVGEFFSLKDGAD
12532|gene|hma      SLCCVHTRPYGLWPLHWPRPAAATTLSRPAILTPRTATRTCMTEATGIVGEFLTLKEGTD
2963|gene|hhi       -----------------------------------------MTEATGIVGEFLTLKEGTD
                                                                  ***.*:  *.  :.

1460|gene|hwa       ADLLTMQCGDFYEFFAEDAELVSAELDLTVSEKSSHGSSYPMAGVPIDDLTPYLKSLVER
163|gene|nrc1       ADLLAMQVGDFYEFFAADARTVASVLDLQVSEKSNHGSSYPMAGVPVDDLTPYLAALVER
538|gene|nph        ADLLAMQMGDFYEFFGEDAELVGDELDLKVSERSSGGESYAMAGVPVDDLTPYLKALAER
119|gene|hla        ADILAMQCGDFYEFFADDAELVADELDLTVSQKSSHGSSYPMAGVPLSELTPYVKALVER
26580|gene|hbo      ADVLTMQCGDFYEFFGDDAEFVAEELDLKVSQKSSHGSSYPMAGVPVDDLTPYLKVLVER
493|gene|hut        ADLLAMQVGDFYEFFGADAETVADELDLQVSQKSSHGSSYPMAGVPVNELTPYLTALVER
12532|gene|hma      ADLLAMQCGDFYEFFAEDAEIVADELDLKVSQKSSHGSSYPMAGVPVDDLTPYVSALVER
2963|gene|hhi       ADLLAMQCGDFYEFFDEDAEIVADELDLKVSQKSSHGSSYPMAGVPVDDLTPYVSALVER
                    **:*:** *******  **  *.  *** **:.*. *.**.*****:.:****:  *.**

1460|gene|hwa       GYRVAVADQYEADDGDHYRRISRVVTPGTLLSTDDADARYIAAVVGDLTATETTETTIGI
163|gene|nrc1       GYRVAVAEQSETDAGDIEREIERVVTPGTLLASTDADPRYLAAVVRE------AGGDWGL
538|gene|nph        GYRVAVADQRETDDG-IVREVTRTVTPGTLVETDDADAKYLASVVYE-------NNEYGL
119|gene|hla        GYRVAVADQYETEDG-HAREITRVVTPGTLLETADDDARYLAAIVRE---GDDADGPYGL
26580|gene|hbo      GYRVAVADQFDGEDG-HYRKITRVATPGTLLETTDADARYIASMVAP-----DDDEGVGL
493|gene|hut        GYHVAVADQHETDDG-HARKVEKIVTPGTLLSTTDAGARYLAAIV--------EGEAWGL
12532|gene|hma      GYRVAIADQHETENG-HAREITRVVTPGTHLETGDESAQYLAAVVRE--ASRDSGDTYGI
2963|gene|hhi       GYRVAIADQHETENG-HAREITRVVTPGTHLETGDESAQYLAAVVRE--ASRDGGDTYGI
                    **.**:*:* : : *   * : . .**** : : * ...*:*::*             *:

1460|gene|hwa       AFAEVTTGSFFIGDVADAET---AYAELTRFDPVEILPGPTVRNADEFIRRLRDGTDATV
163|gene|nrc1       AFVDVTTGQFRVTRGADRAD---AVTELYRFAPAEVLPGPALRGDDDFLGVLRERTDATL
538|gene|nph        ALCDITTGQFRATAVSGPDAADRALTELYRFGPVELLPGPTVRTDDSFLGQLRERLDGRL
119|gene|hla        ALADVTTGRFLVTEVDDEGD---LRAELYRFDPAEVLPGPRVRNDDRLLGAVREDLSGSV
26580|gene|hbo      AFADVTTGQFFVTETDDADA---AYSELYRFSPVEILPGPTVRGDDDFLSRLRQDTDASL
493|gene|hut        AFADVTTGEFFVTQVADRDA---VFSELYRFDPAEVLPGPTVRADDEMIERLRERTDASV
12532|gene|hma      AATDVTTGQFQVTQLDDADAGE-ALTELYTFGPAEILPGPELRNDDEFLDRLRERTDAAL
2963|gene|hhi       AATDVTTGQFQVTQLDDADAGE-ALTELYTFGPAEILPGPELRNDDAFLDRLRERTDAAL
                    *  ::*** *      .        :**  * *.*:**** :*  * ::  :*:  .. :

1460|gene|hwa       SSFETAAFALGRARHILSEQFGTTAIDSLNI-ETDVAIQAAGAILTYIDETDTGVRAAIT
163|gene|nrc1       TLHDAGAFDAGRATHRVREQFGDGVIESLGVAADGPVVRAAGAAVGYIAAADEGVLASVS
538|gene|nph        TLHDTDAFAPGRARARLGEQFG-DAVDAVGL-ESDAAVAAAGAVIDYVDETGVGALASIT
119|gene|hla        SVFDAEAFAPGRAKHAVREQFGRETADSVGI-DSELALRAAGAVLGYVEETGAGVLASIT
26580|gene|hbo      SLFEADAFAPGRAKHVVRDQFGRETLESVGL-NSDLAVRAAGAVLRYVEETGTGVLQSMT
493|gene|hut        SLHATEAFAPGRARHRLREQFGTETIESVGIGDAESAIAAGGAVLSYVEETGQGVLASMT
12532|gene|hma      TLHDSASFEPGRASHTVREQFGSETVDSVGIGDQNVAVRAAGAVLSYVEDTGVGTLAAVT
2963|gene|hhi       TLHDSASFEPGRARHTVREQFGSETVDSVGIGEQSVAVQAAGAVLSYVEDTGVGTLAAVT
                    : . : :*  ***   : :***  . :::.:     .: *.** : *:  :. *.  :::

1460|gene|hwa       RLQPLDSDTHLALDTTTRRNLELTETMHGDRGGDGTLLGTIDHTATSAGHRQLREWIMRP
163|gene|nrc1       RIQPFGGGDHVELDATTQRNLELTETMTGGS--DGSLLATIDHTASAAGGRRLAAWVTRP
538|gene|nph        RLQAYAPDDHVSLDGTTQRNLELTEPMT-EG--GQTLFATVDHTETSAGRRLLEAWLKRP
119|gene|hla        RLTAYGDGDHVAVDATTQRNLELTETMRGDA--DGSLFETVDHTVTAAGGRLLREWITRP
26580|gene|hbo      RLQTYETRDHLELDATTQRNLELTETMHGET--DGTLVDAIDHTVTSPGGRLLREWLTRP
493|gene|hut        RLQRYGASDHVELDATTQRNLELTETMRGER--TGSLLDTIDHTVTSAGTRTLRAWLQRP
12532|gene|hma      RLQAYGERDHVDLDATTQRNLELTETMQGDS--SGSLFDTIDHTVTAAGGRLLQQWLQRP
2963|gene|hhi       RLQAYGERDHVDLDATTQRNLELTETMQGDS--SGSLFDTIDHTVTAAGGRLLQQWLQRP
                    *:       *: :* **.*******.*        :*. ::*** ::.* * *  *: **

1460|gene|hwa       QREQTEITRRLDCIEALVDAPLARERIAETLNGSYDLERLAARCISERADATDLLRIRET
163|gene|nrc1       TRDRAELDRRQAAVGALADAALARDALGDVLGEIYDLERLASRAASGRADATDLLRVRDT
538|gene|nph        LKRRGELDRRQQAVAALVGDALAREALRETLSSAYDLERVASRAVAGNADADALLRARET
119|gene|hla        RRDREELNRRLDAVEALASAALARDRLRETLGDAYDLERLAARATSGSAGARELLSVRDS
26580|gene|hbo      RRDRAELERRLDAVTALAREALSRERVRDTLDGVYDLERLAARAASGSAGASDLLSVRDT
493|gene|hut        RRSRETLDRRGDSVEALATEAMARERLRDVLGDAYDLERLASKAASGSADARDLRAAVDT
12532|gene|hma      RRNRAELQRRQSCVAALSEAAMARERIRETLSDAYDLERLAARATSGSADARDLRAVQET
2963|gene|hhi       RRNRTELQRRQSCVAALSEAAMARERIRETLSDAYDLERLAARATSGSADARDLRAVQET
                     . .  : **  .: **   .::*: : :.*.  *****:*:.. :  *.*  *    ::

1460|gene|hwa       LEILPTLDEVITESI-LSESPLMQVVTRPTEDTVKVVRDELTAALVDEPPKTIRDGGLFQ
163|gene|nrc1       LAALPDVADALTTTPELAESPARDVLARVDRAAAADVRAELADALADDPPKTLSEGGLLQ
538|gene|nph        LGLVEDLRTRIDDAARLSSSPVADLFADLDVEAVETLRSEL-EALADDPPKTLHEGGLIR
119|gene|hla        LALVPALADAVSGTA-LADSPVAAVLERIDRERAATLHDELADALAEDPPKTKTQGGLLR
26580|gene|hbo      LAVLPAVAEAIEESP-LSESPLSDVVARPDREAAAALHDELADALADDPPKTVRQGGLFK
493|gene|hut        LELFETVRAIVRETPTLAESPLSTWLDEPDPAAVATLAADLDAAIVDDPPGTITEGGIIR
12532|gene|hma      LALLGQVADAVTETERLAESPLADALDGADREAADSLAAELDSALVADPPGTVRQGGLFK
2963|gene|hhi       LALLGQVADAVTETERLAESPLADALDGADREAADALAADLDSALVADPPGTIRQGGLFK
                    *  .  :   :  :  *:.**    .       .  :  :*  *:. :** *  :**::.

1460|gene|hwa       YGYDAELDSLRERHETAQEWMDKLAQRETETHNLNHLSVDRNKTDGHYIQVGKSVADQVP
163|gene|nrc1       AGYDEALDELLAAHDEHRAWLDGLADREKDRLGITHLQVDRNKTDGYYIQVGNSETDAVP
538|gene|nph        RGYDEELDDLVERHETAVEWFETLAEREKRAHGLTHVTVDRNKTDGYYIQVGKSETDAVP
119|gene|hla        VGYDGELDELIARHEKANEWLDRLAEREKRQYGLSHVTVDRNKTDGYYIQVGKSAADGVP
26580|gene|hbo      HGYDDELDELIDSHEEAQRWIDTLADREKRQHSLSHVTVDRNKTDGYYIQVGKSVADQVP
493|gene|hut        EGYDAELDEVIDEHETALEWIETLPEREQREHGITHLSVDRNKTDGYYIQVGKSETGKVP
12532|gene|hma      RGHDDDLDEIIDEHEAALEWLETLPDREKERTGITHLSVDRNKTDGYYIQVGKSETDAVP
2963|gene|hhi       RGYDDDLDEIIDEHEAALEWLETLPDREKERTGITHLSVDRNKTDGYYIQVGKSETDAVP
                     *:*  **.:   *:    *:: *.:**    .:.*: ********:*****:* :. **

1460|gene|hwa       E----HYREIKTLKNSKRFKTDELAERERDILRLEESRGEMEYELFCELREQIADYAAML
163|gene|nrc1       DGEDGAYRRIKQLKNATRYTMAELDSHEREVLRIEAERAELERELFAALRERVGERAAVL
538|gene|nph        D----HYEGIKTLKNAERYVFEELREKEREILRLEEVRGDREYEAFCRLRESVADSAELL
119|gene|hla        E----HYREIKTLKNSKRFVTDELEEREREVLRLEEARGELEYELFEELRERVAADAELL
26580|gene|hbo      E----HYREIKTLKNSKRFVTEELEEREREILRLEESRGELEYELFCELRERVAAHAELL
493|gene|hut        D----HYENVKTLKNSERYTIAELTERERKIFRLEERRHDLERQCFEELREAVADHADLL
12532|gene|hma      E----KYQHIKTLKNSKRYTTPELDEKERDVLRLEERRHDMEYEHFQRLRARVAEHATLL
2963|gene|hhi       E----EYQHIKTLKNSKRYTTPELDEKERDVLRLEERRHDMEYEHFQRLRARVAEHATLL
                    :     *  :* ***: *:   ** ..**.::*:*  * : * : *  **  :.  * :*

1460|gene|hwa       QTVGQVIAEIDALHSLATHAVENDWTRPTLRADHTVDIEAGRHPVVEQTTEFVPNDLQLE
163|gene|nrc1       QDVGRALAEVDALVSLAEHAAANQWVRPELVAGDGLDIDAGRHPVVEQTTSFVPNDARFD
538|gene|nph        QRVGRTLAELDVYCSLAAHAAKHGWTRPELTDTRAIDIEAGRHPVVETEVQFVPNDLYLD
119|gene|hla        QDVGRAVAEIDALASLATHAAGNDWTRPELADERRLDVEAGRHPVVERTTDFVPNDLRLD
26580|gene|hbo      QDVGRVLAEIDVLSSLATHAAGNDWTRPTLTDPGPVRIEAGRHPVVERTTEFVPNDLYMD
493|gene|hut        QGVGQALAAVDVMAALATHAVRNDWTRPTLRDSRALDVEAGRHPVVEQTTEFVPNDLRMD
12532|gene|hma      QDVGRTLAELDAFASLAVHAVENDWARPAVVDGNELSIEAGRHPVVEQTTEFVPNDLYMD
2963|gene|hhi       QDVGRTLAELDAFASLAVHAVENDWTRPAVVDGNELSIEAGRHPVVEQTTEFVPNDLYMD
                    * **..:* :*.  :** **. : *.** :     : ::********  ..*****  ::

1460|gene|hwa       DERNFLLVTGPNMSGKSTYLRQVALITLLAHAGSFVPAAAATIGAVDGIYTRVGALDELA
163|gene|nrc1       ASRRFQVVTGPNMSGKSTYMRQVAVIVLLAQVGSFVPADAARIGLVDGIYTRVGALDELA
538|gene|nph        RERRFLLVTGPNMSGKSTYMRQAALITLLAQVGSFVPADSARVGLVDGVFTRVGALDELA
119|gene|hla        GERGFLIVTGPNMSGKSTYMRQAALIQLLAQAGSFVPARTATVGLVDGIYTRVGALDELA
26580|gene|hbo      DERGFLIVTGPNMSGKSTYMRQAALIVLLAQVGSFVPARSAEIGVVDGIYTRVGALDELA
493|gene|hut        DDRRFLIVTGPNMSGKSTYMRQAALIVLLAQIGSFVPARSAEVGLVDGIYTRVGALDELA
12532|gene|hma      DDRQFLIVTGPNMSGKSTYMRQAALITLLAQVGSFVPARSATVGLVDGIFTRVGALDELA
2963|gene|hhi       DDRQFLIVTGPNMSGKSTYMRQAALITLLAQVGSFVPARSATVGLVDGIFTRVGALDELA
                     .* * :************:**.*:* ***: ****** :* :* ***::**********

1460|gene|hwa       QGRSTFMVEMQELSKILHSASSESLVILDEVGRGTATYDGISIAWAATEYLSAAQSTAPS
163|gene|nrc1       GGRSTFMVEMEELSRILHAATSDSLVVLDEVGRGTATYDGISIAWAATEYLHN----EVR
538|gene|nph        QGRSTFMVEMEELSNILHSATEDSLVILDEVGRGTATYDGISIAWAATEYLHN----EVG
119|gene|hla        QGRSTFMVEMQELSNILHSATADSIVILDEVGRGTATYDGISIAWAATEYLHN----EVR
26580|gene|hbo      QGRSTFMVEMQELSNILHSATEDSLVILDEVGRGTATYDGISIAWAATEYLHN----EVR
493|gene|hut        QGRSTFMVEMQELANILHSATEDSLVILDEVGRGTATYDGVSIAWAATEYLSSAQSASPS
12532|gene|hma      QGRSTFMVEMQELSNILHSATEESLVILDEVGRGTATFDGISIAWAATEYIVN----SIQ
2963|gene|hhi       QGRSTFMVEMQELSNILHSATDESLVILDEVGRGTATFDGISIAWAATEYIVN----SIQ
                     *********:**:.***:*: :*:*:**********:**:*********:         

1460|gene|hwa       PRVLFATHYHELTTLADRIAGVSNVHVAAE-----------------ERNGDVTFLRTVE
163|gene|nrc1       ATTLFATHYHELTALADHLDAVVNVHVAAE-----------------ERDGAVTFLRTVR
538|gene|nph        AFTLFATHYHELTTLADELPRVENVHVAVA-----------GDPE--DGDGDVTFLRTVE
119|gene|hla        ARTLFATHYHELTTLADHLPRVENVHVAVD-----------------KRDGEVTFLRTVR
26580|gene|hbo      AKTLFATHYHELTSLADHLDRVANVHVAAE-----------------ERDGDVTFLRTVQ
493|gene|hut        PKTLFATHYHELTTLADHISGVENVHVAVDEPAGGTDGDDAGPPTTDGADDDVTFLRTVR
12532|gene|hma      SKTLFATHYHELTALGEELPTVENVHVAVD-----------GEPRSAGSDGDVTFLRTVR
2963|gene|hhi       SKTLFATHYHELTALGEELPAVENVHVAVD-----------GEPRSAEGDGDVTFLRTVR
                    . .**********:*.: :  * *****.                    :. ******* 

1460|gene|hwa       SGPADRSYGIHVAELAGVPDPVLTRARDVLSTLRADEAIEVETQNHADTSKRDQEENMTG
163|gene|nrc1       DGATDRSYGVHVAALAGVPEPVVDRARGVLDRLREENAVEAK-----------------G
538|gene|nph        PGATDRSYGVHVADLAGVPDPVVDRAGAVLSRLREDRAVEAK-----------------G
119|gene|hla        DGPTNRSYGVHVADLAGVPAPVVSRAGTVLDRLREEKAIEAK-----------------G
26580|gene|hbo      DGPTDRSYGIHVADLAGVPKPVVSRADNVLEKLREEKAIEAK-----------------G
493|gene|hut        DGPADRSYGVHVATLAGVPDPVVSRAREVLRKLRADEAIDVQ-----------------N
12532|gene|hma      DGPTDRSYGVHVADLAGVPEPVVDRSQAVLDRLRDDKAIEIR-----------------G
2963|gene|hhi       DGPTDRSYGVHVADLAGVPEPVVGRSQEVLDRLRDDKAIEIR-----------------G
                     *.::****:*** ***** **: *:  **  ** : *::                   .

1460|gene|hwa       GDK------------SAETTQQVVFDVNSGDIRVAE------------------------
163|gene|nrc1       ---------------SAGESVQAVFDVDSGGFVDDA------------------------
538|gene|nph        G--------------DSGEPVQTVFDLSDGQFRE--------------------------
119|gene|hla        GARGGGGERGGFTGTADGDTKQVVFDLSSGSFSESDDAESTAAGAP-------GSGGGRN
26580|gene|hbo      G--------------ESDKPVQAVFDLSSGEFSSGESSGQAGSGQTTDGQIVDGQTADGQ
493|gene|hut        G-R------------ASEETRQVVFDLDSGELRESD-----------------TDDSGT-
12532|gene|hma      SEQ------------NDGGTTQAVFDLDSGQFRDGA-----------------AQSGGAA
2963|gene|hhi       SER------------DDGGTTQAVFDLDSGQFRDGA-----------------AGSAGTS
                                       . *.***:..* :                            

1460|gene|hwa       --AGENTENQTTPTSDNRKLKNNRATSESPEHARVDSDTKEVINELQSLDIESTAPVTLL
163|gene|nrc1       ---------------------GDDGEAD-------DPEAAAVLDELRTVELAETSPVELL
538|gene|nph        ---------------------NGDNSADSDEGGELDPETAAVVDELADLEVDETPPVELM
119|gene|hla        GATPGSASDGASGSAGTAETAGAAETAESAETDRFDPETRAVIEELADVDVAETAPVELL
26580|gene|hbo      AASADQRESGVAAAQQSQHAANGESTAGRDASRATDPEADEILDELRETDVNSNAPLDIM
493|gene|hut        ---GDPPADQMGDTDETVG--GGTGPIVDQ----FGEDAPAVLEALESLDIEETPPVELL
12532|gene|hma      AGSTAEPVATDGDPEHAPGEAAAEGPKGDERAASLDSETEAVLSELTELDVNETPPVELM
2963|gene|hhi       ADSGTEPVAADGDPEHAPGESAAEGPAGDAAAASLDPETEAVLSELTELDVNETPPVELM
                                                       . ::  ::. *   :: ...*: ::

1460|gene|hwa       QTVEQWQKQLTDADNEE
163|gene|nrc1       GTVQAWQDRLED-----
538|gene|nph        ARVQEWQQRLDGE----
119|gene|hla        SRVQEWQERLDENR---
26580|gene|hbo      AKVQEWQRRLSDND---
493|gene|hut        GEVQEWQRRLE------
12532|gene|hma      AKVQEWQAELDDE----
2963|gene|hhi       AKVQEWQAELDDE----
                      *: **  *