HOG COG KOG arCOG COG description NRC-1 annotation
cHOG0087COG1194KOG2457arCOG00462A/G-specific DNA glycosylaseA/G specific adenine glycosylase.repair protein

HOG Organism Gene ID Name Annotation
cHOG0087 nrc1 VNG1520 mutY A/G specific adenine glycosylase.repair protein
cHOG0087 hmu HMUK2201 Hmuk_2201 HhH-GPD family protein( EC:3.2.2.- )
cHOG0087 hmu HMUK0448 Hmuk_0448 HhH-GPD family protein( EC:3.2.2.- )
cHOG0087 hma RRNAC0987 mutY A/G-specific adenine glycosylase
cHOG0087 hwa HQ2481A mutY A/G-specific adenine glycosylase
cHOG0087 hla HLAC2002 Hlac_2002 HhH-GPD family protein
cHOG0087 hbo HBOR14100 Hbor_14100 hypothetical protein
cHOG0087 hbo HBOR14130 Hbor_14130 A/G-specific DNA glycosylase( EC:3.2.2.- )
cHOG0087 hut HUTA2939 Huta_2939 HhH-GPD family protein
cHOG0087 hvo HVO_2896 HVO_2896 mutY A/G-specific adenine glycosylase
cHOG0087 nph NP4222A mutY A/G-specific adenine glycosylase
cHOG0087 hhi 1614 mutY A/G-specific adenine glycosylase
MUSCLE (3.6) multiple sequence alignment


2481|gene|hwa       -------------MSGGQNSDAEASIKCIDTDIDIDVFRNRLISWYEAEHREFPWRETDD
1520|gene|nrc1      ------------MTTGDGSDRAGATG-----PADTTALQTALVDWYTDSHRSFPWRETTD
14100|hp|hbo        ------------------------------------------------------------
14130|gene|hbo      --------------MTDGHASSLETGDATALPEDVDDVRDALVSWYEADHRDFPWRRTDD
2002|gene|hla       ----------MTDSDATAVAGAGVDGDAPELPADLDAVRDALVDWYEADHREFPWRRTED
4222|gene|nph       -----------------------MEDGPDTLPADVDAVRAALIEWYETDHRSYPWRETED
2939|gene|hut       -----------------------MSEALENLPADQGAIQRALIEWYQDDHREYPWRETDD
10987|gene|hma      -----------------MTDQSTATDRPEGVPADPSAVQNALVEWYEADHRSYPWRETTD
1614|gene|hhi       MGGNDRHAQIGTGLNGGMTDQSTATDRHGAVPADPSAVQDALVEWYEADHRSYPWRETTD
                                                                                

2481|gene|hwa       PYAILVSEVMSHQTQLDRVVEAWKDFIQRWPTVKALAGDSQSAVVTFWSEHALGYNNRAS
1520|gene|nrc1      PYEILVSEVMSQQTQLSRVIDAWRAFLDRWPTTAALAAADRSDVVGFWSAHSLGYNNRAT
14100|hp|hbo        -----------------------MDFSH--------------------------------
14130|gene|hbo      PYEILVSEVMSQQTQLGRVVEAWEDFLDEWPTAADLAAADRSDVVSFWSGHSLGYNNRAK
2002|gene|hla       PYEILVSEVMSQQTQLDRVVPAWEDFVEEWPTTEELAEADRGGVVAFWSDHSLGYNNRAK
4222|gene|nph       PYEILVSEVMSQQTQLDRVVEAWHAFLDEWPTAEALAEADRAAVVGFWTDHSLGYNNRAK
2939|gene|hut       PYAILVSEVMSQQTQLDRVVDAWDDFLDRWPTVADLADADRADVVGFWSDHSLGYNNRAK
10987|gene|hma      PYEILVSEVMSQQTQLDRVVDAWEDFLDRWPTAAALAEADRSDVVGFWTSHSLGYNNRAK
1614|gene|hhi       PYEILVSEVMSQQTQLDRVVDAWEDFLDRWPTAAALAEADRSDVVGFWTSHSLGYNNRAK
                                             *                                  

2481|gene|hwa       YLHEAANQVVDEYDGTVPADPDELLSLMGVGPYTANAVASFAFNNGDAVVDTNVERVLYR
1520|gene|nrc1      HLHEAAQQVETDYDGAIPRTPAELSELMGVGPYTANAVASFAFNAGNAVVDTNVKRVLYR
14100|hp|hbo        ----------------------------------------------------------YR
14130|gene|hbo      YLHEATRQVIEEYDGEFPRSPDELSELMGVGPYTANAVASFAFNNGDAVVDTNVKRVLHR
2002|gene|hla       YLHEAAGQVEGEYGGTFPETPEELQELMGVGPYTANAVASFAFDNGDAVVDTNVKRVLHR
4222|gene|nph       YLHEAARQVRDEHDGEFPRTPDGLQELMGVGPYTANAVASFAFNNGDAVVDTNVKRVLYR
2939|gene|hut       YLHEAATQIVEEYDGAFPESPDELSELMGVGPYTANAVASFAFNNGDAVVDTNVKRVLYR
10987|gene|hma      YLHEAAGQVVDDYDGEWPRDPDGLSDLMGVGPYTANAVASFAFNNGNAVVDTNVKRVLYR
1614|gene|hhi       YLHEAAGQVVDDYDGEWPRDPDGLSDLMGVGPYTANAVASFAFNNGNAVVDTNVKRVLYR
                                                                              :*

2481|gene|hwa       VFKQIRQADDPPYEQIASALLPVERSRTWNNAIMELGGVACKKTPRCDEANCPWRQWCHA
1520|gene|nrc1      AFEGIRDDDDPDYRPLANELLPDGTSRVWNNAVMELGAVACQQTPRCDEAECPLREWCHA
14100|hp|hbo        AFD--VPDDDDAFERVAQFAMPEGESKVWNNAIMELGGVACEKTPTCDESGCPWRRWCHA
14130|gene|hbo      AFAEIHNADDPDYETVANTLMPPGESRIWNNAIMELGGVACGKKPRCDEASCPWREWCHA
2002|gene|hla       AFA--VPDDDAAFAQVASDVMPDGESRIWNNAIMELGGVACGTTPRCDEAGCPWRRWCHA
4222|gene|nph       AFSELHDKEEPPYQHIADELLPKGRSRVWNNAIMELGAVACGKTPRCDEAGCPWREWCDA
2939|gene|hut       AFS--IPDEDAAFEDAASTLMSEGESRVWNNAIMELGGVACEKTPRCDAAGCPWREWCDA
10987|gene|hma      AFD--VPDDDSAFETAAGTLMPAGQSRVWNNAIMELGGVACEKTPDCDGAQCPWREWCSA
1614|gene|hhi       AFD--VPDDDSEFEAAASKLMPGGQSRVWNNAIMELGGVACEKTPDCDGAQCPWREWCSA
                    .*      ::  :   *   :.   *. ****:****.***  .* ** : ** * ** *

2481|gene|hwa       YQTGDFTAPDVPTQPSFEGSRRQFRGRIVRTLGEHGELALDTLGHRIRVDYSP--EGTHG
1520|gene|nrc1      YQTGDFTAPDVPTQPSFEGSRRQFRGRIVRLLGEHDEMELDALGHRVRVDYTP--DGEYG
14100|hp|hbo        YETGDFTAPDVPTQPSFEGSRRQFRGRVVRTLGEYDELELDELGPRIRVDYG----GDYG
14130|gene|hbo      YQTGDFTAPDVPTQPSFEGSRRQFRGRVVRLLGEHDEMDLDTLGHRIRVDYTP--DGEHG
2002|gene|hla       YETGDFTAPDVPEQPSFEGSRRQFRGRIVRLLGEYDELALDDLGPRVRVDYSP--DGEHG
4222|gene|nph       YDTGDFTAPDVPTQPSFEGSRRQFRGRVVRALNEYGELPIDELGPRIRVDYSP--DGEYG
2939|gene|hut       YANGDFSAPDVPEQSTFEGSRRQMRGRVIAALKEHDHLAIDNLGPKVRVDYAPEADAEAD
10987|gene|hma      YETGDFTAPDVPTQPEFEGSRRQMRGRVISALKEYDDLRLDKLGPRVRVDYAP--EGEYG
1614|gene|hhi       YETGDFTAPDVPTQPEFEGSRRQMRGRVISALKEYDELQLDQLGPRVRVDYAP--EGEYG
                    * .***:***** *. *******:***::  * *:. : :* ** .:****     .  .

2481|gene|hwa       RDWLYEIVTDLADDGLVNIQTSTAGTTETESASMSQSDGSE-------TLSSIFVSLQS
1520|gene|nrc1      REWLRGLLSDLADDGLVRTE---------------YRGERT------------VVRLE-
14100|hp|hbo        REWLRELVSDLSADGLVNVE---------------KRDDDT------------VVRLRE
14130|gene|hbo      REWLRGLLSDLADDGLVQIE---------------EGNDQT------------IARLQ-
2002|gene|hla       REWLRGLVDDLADDGLVAIE---------------ERAGADEGRSADDGASEVVVSLRR
4222|gene|nph       RAWLRSLLEDLSDEGMVQLT---------------ERDGAV------------IARLQR
2939|gene|hut       REWLRDLLEDLADDGLVEIE---------------EGSDQP------------IARLR-
10987|gene|hma      REWLRGLLDDLADDGLVDVE---------------VGDGET------------VARLQR
1614|gene|hhi       REWLRGLLEDLADDGLVDVE---------------NSDGEA------------VARLQR
                    * **  :: **: :*:*                                    .. *