HOG COG KOG arCOG COG description NRC-1 annotation
cHOG0284COG0582arCOG01241Integraseintegrase/recombinase

HOG Organism Gene ID Name Annotation
cHOG0284 nrc1 VNG0838 ssrA integrase/recombinase
cHOG0284 hmu HMUK2505 Hmuk_2505 integrase family protein
cHOG0284 hma RRNAC0375 ssrA integrase/recombinase
cHOG0284 hwa HQ2614A xerC tyrosine recombinase xerC
cHOG0284 hla HLAC1216 Hlac_1216 integrase family protein
cHOG0284 hbo HBOR16030 Hbor_16030 site-specific recombinase XerD
cHOG0284 hut HUTA0154 Huta_0154 integrase domain protein SAM domain protein
cHOG0284 hvo HVO_1620 HVO_1620 ssrA integrase/recombinase
cHOG0284 nph NP3640A tyrosine recombinase
cHOG0284 hhi 1247 xerC integrase/recombinase
MUSCLE (3.6) multiple sequence alignment


838|gene|nrc1       -MSEPRDSLARETDADARDDPIGYFLEDMQYHGKAKRTRDAYARVLRSFEAFLSNAGAQP
3640|gene|nph       MSSDNR------TDESAAVDPVAYFLQDMTFHGRTERTRDAYERVLRDFEQFIETRRGTT
154|gene|hut        -MSTGT------AIPEEVEDPIEYFLTDITYQGKSQRTSDAYERVLRDFEASLRENGEKP
10375|gene|hma      -MSTDT------ATPDAESDPVGYFLDDITYHGKSDRTRASYERVLRDFESYLAD-GHQP
1247|gene|hhi       -MSTDT------TTPDAESDPVGYFLDDITYHGKSDRTRASYERVLRDFESYLAD-GHQP
1216|gene|hla       -MSSTR-------APDEVDDPIGYFIEDLTYHGKTERTRDSYERVLRRFESFL-DHDMTP
2614|gene|hwa       -MSVNT------ADGEVVHDPVGYFIQDMTYHGKSTQTRDAYNRVLRRFETFLTEESRDA
16030|gene|hbo      -MAVET------ADGEAVEDPVAYFLQDMTYHGKTERTRDAYERVLRRFEAFLTDPEANP
1620|gene|hvo       -MAAER--------PADAEDPVGYFLQDMTYHGKTDRTRDAYERVLRAFESFLGDPSRNP
                      :                **: **: *: ::*.: .*  :* **** **  :      .

838|gene|nrc1       VD-----------------AGRRDCLTWVHHLRE-QYAESTVATYASYVNRFYSYMQQVG
3640|gene|nph       LD----------------AADRRACMAWVHSLRD-EHAPSTVASYASYVHRFYAYMTQVG
154|gene|hut        LA----------------VASHRDCMAWIHSQRE-ELSKSTVATYASYLHRFYAYMTQIG
10375|gene|hma      LP--------------VREASHRDCMAYVHSLRG-DVEESTVATYASYLHRFFAYMTQVG
1247|gene|hhi       IP--------------VQEASHRDCMAYVHSLRG-NVEESTVATYASYLHRFFAYMTQVG
1216|gene|hla       AS-----------------ATHRDCMAFVHSLRG-DAADSTVATYAAYLHRFYGYMTEVG
2614|gene|hwa       SDQTSPTEIDSTATSNIARATHRDCMAWIHSLRQTETADSTIATYASYLHRFYAYMSQIG
16030|gene|hbo      MD----------AEKRVSEASHRDCMAWIHSLRRADAADSTIATYASYLHRFYAYMTQVG
1620|gene|hvo       AG----------TARSVDDATHRDCMAWIHTLRNRDVAPSTVATYASYLHRFYAYMAQVG
                                       * .* *::::*  *  :   **:*:**:*::**:.** ::*

838|gene|nrc1       VVDGNPMALVSQEMDEAIDTNPTRRDISVAAMREFIADITHPLERAVIVGLLKTGLRVGE
3640|gene|nph       ELESNPMALVTEEMDESIDTDPARRDVSVPEMRAFVTSLSHPLERALVGTLLKTGMRVGE
154|gene|hut        VFDSNPMAIVVEEMDERIDADPARRDVSIPEMQGFVSGIDHPLDRAVIVTLLKTGMRVGE
10375|gene|hma      AFESNPMTLVMEEMDESINTDPTRREIDLQSMRAFVDSVSHPLERAIIVTLLKTGMRAGE
1247|gene|hhi       VFESNPMTLVMEEMDESINTDPTRREIDLQSMRTFVDSVPHPLERAIVVTLLKTGMRAGE
1216|gene|hla       VFEGNPMTLVMEEMDETVDKDPARRDVSIPAMREFVAGIRHPLHRALVVALLKTGMRVGE
2614|gene|hwa       AFESNPMTLVIAEMDERIDTNPSRREISLTDMRSFLADIHHPLEHAVIMTLIKTGIRAGE
16030|gene|hbo      AFEANPMTLVMEEMDEQIDANPSRREISISGMRAFLREVHHPLDHAVMVTLLKTGMRVGE
1620|gene|hvo       AFDSNPMSLVMEEMDERIDTNPARRELSVPQMRGFVRDIAHPLDHALVVTLLKTGMRVGE
                     .:.***::*  **** :: :*:**::.:  *. *:  : *** .*::  *:***:*.**

838|gene|nrc1       LCNLDLRDVNLDAPAVSSAYEV----GVRGQLGSHPDTLYVDPAMSRGETVNGEQRSASN
3640|gene|nph       CCNLDVRDLHLDDDEVQAAFDV----PHRGRLEGRPDSVYVSSEPARGEVYNGEERNASN
154|gene|hut        LCNLDQRDLHLSDD-PRESVTT------RPALDGRPDSLFVASEPSRGATVNGEQRAASN
10375|gene|hma      LCNLDIRDLNLETPGPRPEIPV------RAGLDGRPDSLFVASDPARGEASNGEERTASN
1247|gene|hhi       LCNLDIHDLNLETPGPRPEIPV------RAGLDGRPDSLFVAADTARGEASNGEERTASN
1216|gene|hla       LCNLDLRDLAIDDEEVRETFAL----GTRPALADRPNSLYVTPDATVGEELNGEARTASN
2614|gene|hwa       LCNLDLRDIVLTDRKALDEVSTSNSQPLRAHLEGRGPSMYVSDAPAVGVVVAGEERTASN
16030|gene|hbo      LCNLDCRDVSLADDTPLGDVSN------RAQLDGRGPSLYVSAEPAAGGTTNGERRTASN
1620|gene|hvo       LCNLDLRDVSLSAETPLDDVST------RAALDGRGPSIYVAADPSVGSVTNGEERTASN
                     **** .*: :                 *  * ..  :::*    : *    ** * ***

838|gene|nrc1       KRKRATLVPLDRELKTELVRWLAIRPDTTSPADPVFVSTQDAWGKRLTPEMVRGMVRRHA
3640|gene|nph       KRKRDTVVPVDSELKRLLKAWLAVRPDTTSAAEPLFCSTSE-WGDRATPSMVRSMVAEQA
154|gene|hut        KRRRDTVVPVDADLAETLRAWLDVRPDPVSSARPLFTSTTDNWGQRITTDGVRHIVTSHA
10375|gene|hma      KRKRETTIPVDAELASVLRRWLAIRPDTQSPAAPLFVSTSDSWGERVTPNMVHHIVESHA
1247|gene|hhi       KRKRETTIPVDAELASVLRHWLAIRPDTQSPAAPLFVSTSDSWGERLTPNMVHHIVESHA
1216|gene|hla       KRKRATVIPIDDELMGTLKRWLAVRPDGPSSADPLFVGTAEGWGERLDPQAVRHVVERYA
2614|gene|hwa       KRQRSTTIPIDEELDWALRRWLAIRPQTTSPAQPLFVSTTGQWGKRLTPHMVHHIVETHA
16030|gene|hbo      KRKRSTVVPVDEELEAVLTRWLAVRPDVTSPAEPLFVSTSGSWGNRLTTDMVHHLVERHA
1620|gene|hvo       KRKRSTVVPVDDELEAVLLRWLAIRPDSPSPAEPLFVGTASGWGRRLEPDAVHHVVTAHA
                    **.* * :*:* :*   *  ** :**:  *.* *:* .*   ** *  .  *. :*   *

838|gene|nrc1       EPRGWYHKGGDAAENVTPHYFRHFFTTHLRNRTGDRGVVKYLRGDVADDIIDTYTHNWGD
3640|gene|nph       RERGWYRQGGDSAENVTPHYFRHFFTTHLRDRTGDRGVVKYLRGDVASDVIDTYTHNWGD
154|gene|hut        SVHGWYDNGGGVEENVTPHYFRHFFTTHLRDRTGDRGIVKYLRGDVAQDIIDTYTHDWGD
10375|gene|hma      REFGWYTDGGGAEENVTPHYFRHFFTTHLRDRTGDRGVVKYLRGDVAQDVIDTYTHEWGT
1247|gene|hhi       REFGWYTDGGGAEENVTPHYFRHFFTTHLRDRTGDRGVVKYLRGDVAQDVIDTYTHEWGT
1216|gene|hla       REAGWYRTGGGAAENVTPHYFRHFFTTHLRDRTGDRGVVKYLRGDVADDVIDTYTHNWGD
2614|gene|hwa       KDVGWHNDGGDATENVTPHYFRHFFTTHLRDRTGDRGIVKYLRGDVGGDIIETYTHNWGD
16030|gene|hbo      AERGWHRAGGDAEENVTPHYFRHFFTTHLRDRTGDRGIVKYLRGDVGGDIIDTYTHNWGD
1620|gene|hvo       ADAGWYRAGGDREENVTPHYFRHFFTTHLRDRTGDRGIVKYLRGDVAGDIIDTYTHNWGD
                       **:  **.  *****************:******:********. *:*:****:** 

838|gene|nrc1       RVREVYLNHIYSLTTTHGGG----
3640|gene|nph       RVRTVYEANIYQLLP---------
154|gene|hut        RVRDVYEDHIYELPVAAYRTMSET
10375|gene|hma      RVRDVYEAEIYSLL----------
1247|gene|hhi       RVRDVYEAEIYSLL----------
1216|gene|hla       QVRETYEANVYQLLR---------
2614|gene|hwa       RVRETYEEHIYTIL----------
16030|gene|hbo      RVRETYEQHIYSLF----------
1620|gene|hvo       RVRETYESNIYSLR----------
                    .** .*   :* :