HOG COG KOG arCOG COG description NRC-1 annotation
cHOG0634COG1491arCOG04130Predicted RNA-binding proteinhypothetical protein

HOG Organism Gene ID Name Annotation
cHOG0634 nrc1 VNG1168 vng1168 no entry
cHOG0634 hmu HMUK2672 Hmuk_2672 Protein of unknown function DUF655
cHOG0634 hma RRNAC0258 hypothetical protein
cHOG0634 hwa HQ2894A hypothetical protein
cHOG0634 hla HLAC2103 Hlac_2103 Protein of unknown function DUF655
cHOG0634 hbo HBOR12150 Hbor_12150 predicted RNA-binding protein
cHOG0634 hut HUTA0622 Huta_0622 Protein of unknown function DUF655
cHOG0634 hvo HVO_2747 HVO_2747 Predicted RNA-binding protein
cHOG0634 nph NP3680A hypothetical protein
cHOG0634 hhi 1002 HAH_1002 hypothetical protein
MUSCLE (3.6) multiple sequence alignment


1168|chp|nrc1      -----------------------------MSDTSTTDANSDSGADG----------DSES
2103|gene|hla      -----------------------------MTDPETDAPAAD---------------DGPV
2894|hp|hwa        MDDVGASNLLPASLFKCTLGVHIVITMTESQDADANSVDKSDQSDGRNTSADAET-DIRH
12150|hp|hbo       -----------------------------MTSAESGDADPDTSVDETENGADATTEDAEF
10258|hp|hma       -----------------------------MTESESGG-------------------DTMM
1002|hp|hhi        -----------------------------MTESESGG-------------------DSMM
3680|hp|nph        -----------------------------MSDDGDDT-------------------AART
622|hp|hut         -----------------------------MSSDESDT-------------------DVAT
                                                  .                            

1168|chp|nrc1      AVVLDFLAHGRSDG--RSYGDGAVGYAVSTIDFTLYELSLS-S-DADIGILDRVQVRP-E
2103|gene|hla      VVLLDVLPNGRPDDDRPQYRKSPVAYGLGTDAFRLYELTLD-E-EADVSVSHRLALD---
2894|hp|hwa        AVILDHLPHGRADNNRPRHQRDPLAYGLGERDFRLYELTLDADADIDISIGDRVVVGPTE
12150|hp|hbo       AVVLDHLPHGQPDDDRPRYKKSPLAYALGERDFRLFELRLG-D-ESDISIGDRVVVFPSE
10258|hp|hma       AAVLDVLPHGRPGDDRPQHQKEPLAYALDIDEFYIYELQLA-DKDADIAFGDRIDL----
1002|hp|hhi        AAVLDVLPHGRPGDDRPQHQKEPLAYALDVEEFYIYELQLD-DKDVDIAFGDRVDL----
3680|hp|nph        AVVLDFLPRGSPSDDRPQYEKSPVAFAVGEEDFQLVEAALT-D-DAGINIGDRIEIDP--
622|hp|hut         AVVLDHLPRGRSDDDRPQYEKEPLAYAVATADFQLLEMTLI-D-DADISFGDIIDLD---
                   ..:** *..* ...    :   .:.:.:    * : *  *  . : .: .   : :    

1168|chp|nrc1      FETGIDRGREVEYDDLSDGARSELDYVVEDVVDANERRFVDFYNDAQPISLRLHQLNLLP
2103|gene|hla      -GPAVGRYREVSFDDLTRNAAAEIEYAVEDIVEGDEKRFVDFYNEAGPITLRLHQLNLLP
2894|hp|hwa        ARELVDEFRQISYADLSNTAQNEVEYAVEDIIKTNERRFVDFYNDAQPITLRLHQLNLLP
12150|hp|hbo       ERDAVEELREVEYDDLSNTALSELEYVVEDIVEENERRFVDFYNDAQPITLRLHQLNLLP
10258|hp|hma       --TEFGRVTEVEFEDIPSGARSELDYAVEEVVEANEQRFVDFYNDAQPITLRLHQLNLLP
1002|hp|hhi        --TEFGRVTEVEFEDIPSGARSELDYAVEEIVEANEQRFVDFYNDAQPITLRLHQLNLLP
3680|hp|nph        VGDNVKDLRTVEYRDLTSTAESELEYAIESIIDADEAQFVDFYNDAQPITTRLHVLNLLP
622|hp|hut         -GEVIETRRTVDYGDLSSGAQSELDYAIEDIVEDNEGRFVDFFNDAQPITTRLHSLNLLP
                       .     :.: *:.  *  *::*.:*.::. :* .****:*:* **: *** *****

1168|chp|nrc1      GIGEKLRDNILEERKRRGPFESFADVSERVDGLHTPQEVIVERILDEIRDPDLKYYNFAA
2103|gene|hla      GIGKQLRNKVLDERK-RGPFESFEEVSERVTGLHHPREVLVERIVEEIHEEDLKYRRFVG
2894|hp|hwa        GIGKKLRNNILDQRK-RGPFESFDEIEERVAGLHHPSSVLKDRILEELHEDDLKYNVFVA
12150|hp|hbo       GIGKKLRNNILDERK-RGPYESFEDLEDRVSGLHHPRDVLVERIMEELRDDDLKYKTFVG
10258|hp|hma       GIGKKLRNTILDERK-RKPFESFEDLEKRVSGLHSPKETLVERILEELQDDDLKYRLFVR
1002|hp|hhi        GIGKKLRNTLLDERK-RKPFESFEDLEERVSGLHSPKETLVERILEELQDDDLKYRLFVR
3680|hp|nph        GIGKKLRNNVLDARK-RQPFEDFDDIEERVSGLHDPKGVLIERITEELQEDDLKYRIFAR
622|hp|hut         GIGKKLRNGILDERK-RKPFESFEELTERVDGLHNPKEVLVDRILEEIREEDLKYRTFAR
                   ***::**: :*: ** * *:*.* :: .** *** *  .: :** :*:.: ****  *. 

1168|chp|nrc1      S-----
2103|gene|hla      YEE---
2894|hp|hwa        PSE---
12150|hp|hbo       RDT---
10258|hp|hma       REEQSS
1002|hp|hhi        REEQSS
3680|hp|nph        RDDE--
622|hp|hut         RDD---