HOG COG KOG arCOG COG description NRC-1 annotation
cHOG0070COG1253KOG2118arCOG00626Hemolysins and related proteins containing CBS domainshemolysin protein

HOG Organism Gene ID Name Annotation
cHOG0070 nrc1 VNG2308 hlp hemolysin protein
cHOG0070 hmu HMUK0885 Hmuk_0885 protein of unknown function DUF21
cHOG0070 hma RRNAC2600 hypothetical protein
cHOG0070 hwa HQ1961A CBS domain-containing protein
cHOG0070 hwa HQ3705A CBS domain-containing protein
cHOG0070 hla HLAC0407 Hlac_407 protein of unknown function DUF21
cHOG0070 hla HLAC2723 Hlac_2723 protein of unknown function DUF21
cHOG0070 hbo HBOR29400 Hbor_29400 CBS domain-containing protein
cHOG0070 hut HUTA1548 Huta_1548 protein of unknown function DUF21
cHOG0070 hvo HVO_0250 HVO_0250 hlp hemolysin protein
cHOG0070 nph NP1414A hypothetical protein
cHOG0070 hhi 3020 HAH_3020 CBS domain-containing protein
MUSCLE (3.6) multiple sequence alignment


2723|gene|hla       -----------------------------------------MDLTIALVGVAAILVLTGI
1961|gene|hwa       --------------MIIFVLTAIFETGITLDAVPIVG-FELGQAEFTALGVSMILLLIMG
1548|hp|hut         --------------MGISVLTAVAEASAYLYTVPIVG-IELSKTGVTAIGILMILFLIVG
407|gene|hla        --------------MGLSSSVPV----ADLLQIPMPG-----DGVVLALGVAAILFLIGL
3705|gene|hwa       -----MHQIRYRSRMGLSPSLSL----FIGQIAP----IPVTQTTITVAGAFVIFLLIGF
29400|gene|hbo      --------------MG----NAITFVNGVLQA------LPVNKATVTIGGVFAIIILIGL
2308|gene|nrc1      --------------MGISPP---------LTAVSAFG-TDVSGNAFIGAGVATLAVLMVL
1414|hp|nph         --------------MGTLASNAAL---VVLQAAPDAGELPFSDTTVALLGSAVILLLLAL
12600|hp|hma        MTSLNPRPLQLRPLMALDPPAPFELSTATLQAIEYAG-VTIPETVVLAGGTAVIVFLIVL
3020|gene|hhi       --------------MALDPPAPFELSTAMLQGVEIVG-VPIPKDAVLVAGIVTLVALVVL
                                                                 .   *   :  *   

2723|gene|hla       SAFFSSSELAVFSVPSHRVDSLLAADVPGARALSALRADSHRFLVTALVSNNVANIAAAS
1961|gene|hwa       SGFFSSSEIAMFSLSPYQIDAMIEEGKRGAQAIKSLKSDPHRLLVTILVGNNLVNITMSS
1548|hp|hut         SGFFSSSEIAMFSLGTHRIDPMVEQGLRGAKAIKSLKEDPHRLLVTILVGNNMVNITMSS
407|gene|hla        SAFFSSSEIAMFSLPQHRVDSLVDEGVKGAETIRGMKQNPHRLLVTILVGNNIVNVAMTS
3705|gene|hwa       SAFFSSSEIAMFSLAKHRVDALVDDNVPGADVVDELKSNPHRLLVTILVGNNIVNIAMSS
29400|gene|hbo      SAFFSSSEIAMFSLAKHRVDSLVEEGVPGAETVQSLKNDPHRLLITILVGNNIVNIAMSS
2308|gene|nrc1      SAFFSSSEIAMFSIPRHRIDALVEEGRSGADAVAALKSDPHRLLITILVGNNLVNIAMSS
1414|hp|nph         SGFFSSSEIAMFSLPAHRVDALVADNIPHAETLKELKDDPHRLLVTILVGNNIVNIAMSS
12600|hp|hma        SAFFSSSEIAMFSLPAHRTEALVEDGVPGAKTLKQLKADPHRLLVTILVGNNLVNIAMSS
3020|gene|hhi       SAFFSSSEIAMFSLPAHRTEALVEDAVPGAKTLKQLKSDPHRLLVTILVGNNLVNIAMSS
                    *.******:*:**:  :. :.::      * .:  :. :.**:*:* **.**:.*:: :*

2723|gene|hla       VATAVFVRFGFS-GGEAATGSTLVTSVFVIVFGEIAPKSYAVANAEKHALRVSRPVVAIQ
1961|gene|hwa       ISTT-IVGFYLD-AGIAVIVSSLGITSIVLLFGESAPKSYAVDNTDSWARTVAPLLKIVG
1548|hp|hut         ISTT-IVGFYFD-PGTAVLVSSFGITSLVLIFGETAPKSYAVDNTELHARRVAPVLQFVE
407|gene|hla        IATA-LFGIYLS-RGESVLATTFGITTLVLIFGESAPKSYAVENTESWALRIARPLKLSE
3705|gene|hwa       LATA-LVSLYFD-PGSAVLVSTFGITSLVLLFGESAPKSYAVEHTESWALRIARPLKYSE
29400|gene|hbo      LSTGLLVYIGFG-QGEAVAIATFGITALVLLFGESAPKSYAVENTESWSLRIARPLKISE
2308|gene|nrc1      IATG-LFATSMS-QGKAVLAATFGVTALVLLFGESAPKSYAVENSESWALSIAKPLQWSE
1414|hp|nph         IATG-LLSYYVS-QSMAVLIATFGITALVLLFGESAPKSYAVENTESWALRISRPLKLSE
12600|hp|hma        IATG-LLALYLS-QGQAVAVATFGITAIVLLFGESAPKSYAVENTESWALRISKPLKAAE
3020|gene|hhi       IATG-LLAMYVDSQGLAVAIATFGVTAVVLLFGESAPKSYAVENTESWALRISKPLKAAE
                    ::*  :.   ..  . :.  :::  : .*::*** ******* ::: .:  ::  :    

2723|gene|hla       RLIRPVLYIFEALSGVVNRFTGGESDIE-SYLTREEIETLVLSGEAAGALDPDEGAMIRG
1961|gene|hwa       KILWPLITLFYYLTRLINKITPGGSAIESSYVSRDDIRNMIKMGEREGILEEEERQILDR
1548|hp|hut         KLLWPLITLFHYVTQFVNKLTGGGPAIESSYLSRSEIREMIQTGEREGVLDEEERQMLQR
407|gene|hla        YALYPLVVLFDYIVKGINKIIGGSAAIESTYVTRDEIQDIIETGEREGVIEEEEREMLDR
3705|gene|hwa       YVLLPLVIVFDRLTRVINRITGGRSEIETSYVTRDEIQDLIQTGEREGVIEEDEREMLDR
29400|gene|hbo      YVLLPLVVIFDRLTRIVNRITGGRSAIETSYVTRDEIQDLIQTGEREGVIEEDEREMLDR
2308|gene|nrc1      RFLYPLVVLFDYLTRAVNQFTGGRAAIESSYVTRSEIQDMIKTGEREGVIEEDEREMLQR
1414|hp|nph         YLLLPLIVTFDYLTRQVNRITGGRAEIESTYVTREEIRDIIETGERAGVLEEDEREMLQR
12600|hp|hma        KVLLPLILLFDYLTRVVNKITGGRSAIETSYVTREEIQDIIETGEREGVLDEDEREMLQR
3020|gene|hhi       KVLLPLILLFDYLTRVVNKITGGRSAIETSYVTREEIQDIIETGEREGVLDEDEREMLQR
                      : *::  *  :   :*.:  * . ** :*::*.:*  ::  **  * :: :*  ::  

2723|gene|hla       VLDLESTRVSAVMVSRTDMVALPDTATPAEAVSTAAAEGVTRMPVYSQNRDDVVGVVDLR
1961|gene|hwa       TLRFTDATAKEIMTPRLDMSAISGQLTVEKAIEECIQSGHARIPVYEESLDNIVGIFDIR
1548|hp|hut         TLRFNRTIAKEVMTPRLDMDAISADSSVEEAIAECVHSGHTRLPVYEGGLDNVIGVVNIR
407|gene|hla        IFRFNNTIAKEVMTPRLDVTAVAKESSVEEAIETCIQADHERVPVYEGNLDNIIGVVTVR
3705|gene|hwa       IFQFNQTIVKEVMTPRLDMTAVAKDATIEEAIETCVHSDHERIPVYDGNLDNVIGTVTVR
29400|gene|hbo      IFRFNQTIAKEVMTPRLDMTAVPKDATLDEAIETCIQSDHERVPVYDGNLDNVIGIVNIR
2308|gene|nrc1      IFRFNNTIAKEVMTPRLDVVAVSKTDTIEAAIQTCTQAGHERVPVYDGELDNVIGVVSLE
1414|hp|nph         IFRFTNTIAKEVMTPRLDMEAVPKDATIEEAIQTCVQSGHARVPVYEGSLDNVLGVVHIS
12600|hp|hma        TLRFNDTIVKEVMTPRLDMTAVAKDNTVEEALETCIQSGHARIPVYEGSLDNVIGVIHIR
3020|gene|hhi       TLRFNDTIAKEVMTPRLDMTAVAKEASAEEALETCIQSGHARIPVYEGSLDNVIGVIHIR
                     : :  : .. :*..* *: *:.   :   *:  .   .  *:***.   *:::* . : 

2723|gene|hla       DAIGANERGE------PLASALHEPTFVPETQPVDELFATMRSSALRMAIVVDEFGAVVG
1961|gene|hwa       EL--DDTDSA--FDDLTVAEVLNTTLHVPESKNVDELLSEMREDRMRMVIVIDEFGTTEG
1548|hp|hut         DLVRDAQYGG--TDDVELQDLIEPTLHVPESKNVDDLLTEMRSERLHMVIVIDEFGTTEG
407|gene|hla        DLVRELRYSE---GEPSLERVVKPTLHVPESKNADELLAEMQDNRLQMVTVIDEFGTTEG
3705|gene|hwa       DLVREKHYGE---GDIPLTDIVQPTLHVPESKNVDELLTEIQDNRLRMVIVIDEFGTTEG
29400|gene|hbo      DLVREQFYGE---GGGDLADIVQPTLHVPESKNVDELLTEIQDNRLQMVIVIDEFGTTEG
2308|gene|nrc1      DLVRESLYGE--TEDAELDDLIEPTLHVPESKNVDDLLQEMQDERVQLVVVIDEFGTTEG
1414|hp|nph         DLVRDDTYGE--ASDIELEDVIEETLHVPESKNIDELLAEMRENRLHMVIVIDEFGTTEG
12600|hp|hma        DLVRDLNYGEALARDMDLEDLIEPTLHVPESKNVDDLLTEMRAERLHMVIVIDEFGTTEG
3020|gene|hhi       DLVRDLNYGEAVARNMDLEDLIEPTLHVPESKNVDDLLTEMRAERLHMVIVIDEFGTTEG
                    :       .        :   :  . .***::  *:*:  :. . :.:. *:****:. *

2723|gene|hla       IVTLEDVLEEIVGELVGGWETDHVDVVAPDAAVARGWTTVAHLNETLGLDLPIDGGTETV
1961|gene|hwa       LITMEDVTEEIVGEILMDDEEHPIEFVNDNEVLLRGELNIHEVNEILDIDLPEGEEFETI
1548|hp|hut         LVTMEDLTEEIVGEILEGEEEHPIEFVNDDTVTVKGEVNIEEVNEALSIDLPEGEEFETI
407|gene|hla        IITLEDMVEEIVGEILEGDEEAPVEFLEDNVAVVQGEVNIDEVNEMLGIDLPEGEEFETL
3705|gene|hwa       LVTLEDMVEEIVGDILEDDEEEAFEFIDTNETLVRGEVNIDEVNDELELDLPEGEEFETL
29400|gene|hbo      LITLEDMVEEIVGDILEDDEEEAFEFINERETLVRGEVNIDEVNEVLELDLPEGEEFETL
2308|gene|nrc1      LLTAEDITEEIVGEILDGGEDLPIDSVDEDTVRVRGEVNIEEVNEALNIDLPEGEEFETI
1414|hp|nph         LVTMEDITEEIVGEILQADEEEPIEFVSDNEILVKGEVNIDEVNEKLDVAIPEGEEFETI
12600|hp|hma        LVTMEDLTEEIVGEILEGEEEEPIEYVDDDTVTVKGEVNIEEVNEALDLDLPEGEEFETI
3020|gene|hhi       LVTMEDLTEEIVGEILEGEEEEPIEYVDDDTVTVKGEVNIEEVNEALDLDLPEGEEFETI
                    ::* **: *****:::   *   .: :       .*  .:  :*: * : :* .   **:

2723|gene|hla       AGLVTRQLGRVPAEGDRVEIGDVTLAVTGATATRVTRVRV--------------------
1961|gene|hwa       AGFIFNLAGRLVEQGEEFDYENVTLRTEQLENTRIQKVRI--------------------
1548|hp|hut         AGFIFNRAGRLVEEGESIEYEGIQIRVEQVENTRIMKARI--------------------
407|gene|hla        AGFVFNRAGRLVEEGEEIEFDEIRIRIERVDNTRIMSARV-------------------T
3705|gene|hwa       AGFIFNRVGRLVEEGEQITFDNITICIEQVDNTRIMKARVITPDSSTTKEYSESQSESQP
29400|gene|hbo      AGFIFNRAGRLVEEGEEIEYDSVMIRIEQVDNTRIMKARI--------------------
2308|gene|nrc1      AGFIFNRAGRLVEEGEAFRFEDIELTVEHVENTRIMKARV--------------------
1414|hp|nph         AGFIFNRAGRLVEEGERIEYEGLEIRVERVENTRIMKARI--------------------
12600|hp|hma        AGFIFNRAGRLVEEGETITYDGVEIRVEQVENTRIMKARV--------------------
3020|gene|hhi       AGFIFNRAGRLVEEGETITYDGVEIRVEQVENTRIMKARV--------------------
                    **:: .  **:  :*: .    : :       **:  .*:                    

2723|gene|hla       -----EHPGIGTEGESGSDISGSPDATGDDTE-------
1961|gene|hwa       --------------------TVNPNTTGMTE--------
1548|hp|hut         ---TRPEEGATLESEAEGG-DDDHESDTNDA--------
407|gene|hla        VLDGAEAADVVAEDDALES-SGEPEAPPNDAE-------
3705|gene|hwa       GLETAPPDDNIADSDS-DS-RPESDSDEDLTSTQDIEAE
29400|gene|hbo      ----------TVDNTDDES-EAEAEIDPSNAS-------
2308|gene|nrc1      -----QRLDSAAEADANAE-VLTADSTGNGA--------
1414|hp|nph         -----------TKTDNREG-DGEHEAAGDE---------
12600|hp|hma        ---------VRLQTDETEP-VDAEEASD-----------
3020|gene|hhi       ---------VRLETDETES-VDAEEVTD-----------
                                            :