HOG COG KOG arCOG COG description NRC-1 annotation
cHOG0165COG0086KOG0262 KOG0260 KOG0261arCOG04257/1DNA-directed RNA polymerase beta' subunit/160 kD subunitDNA-directed RNA polymerase subunit A

HOG Organism Gene ID Name Annotation
cHOG0165 nrc1 VNG2664 rpoA DNA-directed RNA polymerase subunit A
cHOG0165 hmu HMUK0513 Hmuk_0513 DNA-directed RNA polymerase subunit A'( EC:2.7.7.6 )
cHOG0165 hma RRNAC2428 rpoA DNA-directed RNA polymerase subunit alpha
cHOG0165 hwa HQ3394A rpoA DNA-directed RNA polymerase subunit alpha
cHOG0165 hla HLAC0105 Hlac_105 DNA-directed RNA polymerase subunit A'
cHOG0165 hbo HBOR28220 Hbor_28220 DNA-directed RNA polymerase, subunit A'( EC:2.7.7.6 )
cHOG0165 hut HUTA0782 Huta_0782 DNA-directed RNA polymerase subunit A'
cHOG0165 hvo HVO_0349 HVO_0349 rpoA1 DNA-directed RNA polymerase subunit A'
cHOG0165 nph NP0114A rpoA1 DNA-directed RNA polymerase subunit alpha
cHOG0165 hhi 2860 rpoA' DNA-directed RNA polymerase subunit A'
MUSCLE (3.6) multiple sequence alignment


114|gene|nph        -MQGQAPKKIGSLSFGLMDPEEYRNMSATKIITADTYDDDGFPIDMGLMDPRLGVIDPGL
782|gene|hut        MSTGQSPKSIGRLSFGLMNPEEYRDMSATKVITADTYDDDGFPIDMGLMDPRLGVIDPGL
2664|gene|nrc1      MSAGQAPKEIGEISFGLMDPEEYRDMSATKVITADTYDDDGFPIDMGLMDPRLGVIDPGL
12428|gene|hma      MSSQQTPQEIGQLSFGLMDPEEYREMSATKVITADTYDDDGFPIDMGLMDPRLGVIDPGL
2860|gene|hhi       MSSQQTPQEIGQLSFGLMDPEEYREMSATKVITADTYDDDGFPIDMGLMDPRLGVIDPGL
105|gene|hla        -MSMQTPKVLGGIDFGLMDPETYRDMSATKVITADTYNDDGYPIDMGLMDPRLGVIDPGL
3394|gene|hwa       -MSTQTPKEIGAIQFGLMDPETYRDMSATKVITADTYDDDGYPIDMGLMDPRLGVIDPGL
28220|gene|hbo      ---MQTPKEIGAIQFGLMDPETYRDMSATKVITADTYDDDGYPIDMGLMDPRLGVIDPGL
                        *:*: :* :.****:** **:*****:******:***:******************

114|gene|nph        ECKTCGKHSGSCNGHFGHIELAAPVIHVGHAKLIRRLLRGTCRKCGRLTLTDPSDREFRR
782|gene|hut        ECKTCGKHSGSCNGHFGHIELAAPVIHVGFSKLIRRLLRGTCRECSRLTAVDGSK-----
2664|gene|nrc1      ECKTCGQRSGGCNGHFGHIELAAPVIHVGFSKLIRRLLRGTCRECASLLLT---------
12428|gene|hma      ECKTCGKHSGSCNGHFGHIELAAPVIHVGFAKLIRRLLRGTCRECSRLCLD---------
2860|gene|hhi       ECKTCGKHSGSCNGHFGHIELAAPVIHVGFAKLIRRLLRGTCRECSRLCLD---------
105|gene|hla        ECRTCGSHSGSCNGHFGHIELAAPVIHVGFTKLIRRLLRSTCRECGKLALD---------
3394|gene|hwa       ECKTCGSHSGSCNGHFGHIELAAPVIHVGFTKLIRRLLRSTCRNCGHLALT---------
28220|gene|hbo      ECRTCGSHSGSCNGHFGHIELAAPVIHVGFTKLIRRLLRSTCRDCGRLSLT---------
                    **.***..**.******************.:********.***.*. *            

114|gene|nph        EYFKERDDNPPRNWFVSPEDARKAQEKAKQAAERKMAQFREELERTREIGEDPSNVLKAA
782|gene|hut        ---------------YTDGDGNVDLEAASAAAEDKKREFKEELRRARELSNDPSDVLKSA
2664|gene|nrc1      --------------------------------EEEKDEYRENLDRTRSLRQDVSDVMTAA
12428|gene|hma      --------------------------------EHERDEFADRLTRTQDLGRDLNDVTKAA
2860|gene|hhi       --------------------------------EHERDEFADRLTRTQDLGRDLNDVTKAA
105|gene|hla        --------------------------------EEQRDEFRDRYERAKELGNDEHDVLKAA
3394|gene|hwa       --------------------------------ESEREEFREGLQRNEELGEDWTDVLKSA
28220|gene|hbo      --------------------------------EEEAEEFRDGLTRTEELGEDWTDVLKSA
                                                    * :  :: :   *  .:  *  :* .:*

114|gene|nph        IRQARDADYCPHCTESDDYRERYPTAQYDISHEKPSTYLEQGPVPDGDVRDLLAEALEGG
782|gene|hut        IRQARSASRCPHCGEK----------QFDVKHEKPTTYYEIIEVPHAELAERLSMAMDP-
2664|gene|nrc1      IREARKKDHCPHCGEV----------QYDVKHEKPTTYYEVQQVLASDYSERIAASMQPD
12428|gene|hma      IRQARKKDRCPFCGEK----------QYDIKHEKPTTYYEVQDVLASDYPERIAAAME--
2860|gene|hhi       IRQARKKDRCPFCGEK----------QYDIKHEKPTTYYEVQDVLASDYPERIAAAME--
105|gene|hla        VRQARKASTCPFCGEP----------QADIKHEKPTTYYEVQDVLSGDYSERIAAAMQPD
3394|gene|hwa       VRQARKTNVCPSCGET----------QHDIKHEKPTTYYEVQDVLAGDYPEKIASAMQPD
28220|gene|hbo      VRQARKASRCPHCGAP----------QHDIKHEKPTTYYEVQDVLAGDYSERIANAMQPD
                    :*:**. . ** *             * *:.****:** *   *  .:  : :: :::  

114|gene|nph        ---EGGTEIELNELVETVNDVIKRWGGDASVTVSDISRLMSGQGPGDSERIGREHVSLLQ
782|gene|hut        ---PEGEPVTPDELEEAT---------DGGFDAGRIRELLSDT-----YRPDRNDIPALQ
2664|gene|nrc1      -EDEDDAGVSPQELAEQT-----------DIDISRINEILSGE-----FRPRREDREAIE
12428|gene|hma      ---ESEDPISPIELSEET-----------GIDSSRVQEILSGE-----FRPRQEDRKALE
2860|gene|hhi       ---ESEDPISPIDLSEET-----------GIDSSRVQEILSGE-----FRPRQEDRRALE
105|gene|hla        -EEEDDPGTSPQELAEKT-----------DIDLERINEIMAGE-----FRPRKEDRRAIE
3394|gene|hwa       PDDDSDSGLSPNSLADAT-----------GINVERINNILSGE-----FRPVGEDRKELE
28220|gene|hbo      EDDEDDTGMAPSELADQT-----------GIALQRVNQILSGE-----FRPVGDDRKKLE
                                .* : .           ..    :  :::.       *   :    ::

114|gene|nph        IIGEVMSSSGAFGGRFENAPAPD----FLLSYSGDKLMPSNVRDRFENIPDEDIETLGID
782|gene|hut        EIGSALHRFNDLEEHLGDELDIESPSSFLLEEDMDKLMPSDIRDWFEEIPDDDLEAIGVN
2664|gene|nrc1      T-----------------AIGAD-----LTTEDMNKLMPSDIRDWFEDIPGEDLEALGVN
12428|gene|hma      K-----------------ALSVD-----LTEEDMNKLMPSDIRDWFEDIPDEDIEVLGMK
2860|gene|hhi       K-----------------ALSVD-----LTEEDMNKLMPSDIRDWFEDIPDEDIEVLGMK
105|gene|hla        K-----------------ALDID-----LTEEDMNKLMPSDIRDWFEDIPDEDLEVLGID
3394|gene|hwa       S-----------------ALDID-----LTEEDMNKLMASDIRDWFESIPDEDIRTLGID
28220|gene|hbo      K-----------------ALDVD-----LTEEDMNKLMPSDIRDWFEDIPDEDIVTLGID
                                          :     *   . :***.*::**.**.**.:*: .:*:.

114|gene|nph        PERSRPEWMILTVLPVPPVTARPSITLDNGQRSEDDLTHKLVDIIRINQRFMENREAGAP
782|gene|hut        PETSRPEWMILTVLPVPPVTARPSITLDNGQRSEDDLTHKLVDIIRINQRFMENREAGAP
2664|gene|nrc1      SDRSRPEWMILTVLPVPPVTARPSITLDNGQRSEDDLTHKLVDIIRINQRFMENREAGAP
12428|gene|hma      PARSRPEWMILTVLPVPPVTARPSITLDNGQRSEDDLTHKLVDIIRINQRFMENREAGAP
2860|gene|hhi       PARSRPEWMILTVLPVPPVTARPSITLDNGQRSEDDLTHKLVDIIRINQRFMENREAGAP
105|gene|hla        SEHSRPEWMILTVLPVPPVTTRPSITLDNGQRSEDDLTHKLVDIIRINQRFMENREAGAP
3394|gene|hwa       PSRARPEWMVMTVLPVPPVTARPSITLDNGQRSEDDLTHKLVDIIRINQRFMENREAGAP
28220|gene|hbo      PSRARPEWMILTVLPVPPVTARPSITLDNGQRSEDDLTHKLVDIIRINQRFMENREAGAP
                    .  :*****::*********:***************************************

114|gene|nph        QLIIEDLWELLQYHVTTFMDNEISGTPPARHRSGRPLKTLSQRLKGKEGRFRGSLSGKRV
782|gene|hut        QLIIEDLWELLQYHVTTFMDNEISGTPPARHRSGRPLKTLSQRLKGKEGRFRGSLSGKRV
2664|gene|nrc1      QLIIEDLWELLQYHVTTFMDNEISGTPPARHRSGRPLKTLSQRLKGKEGRFRGSLSGKRV
12428|gene|hma      QLIIEDLWELLQYHVTTFMDNEISGTPPARHRSGRPLKTLSQRLKGKEGRFRGSLSGKRV
2860|gene|hhi       QLIIEDLWELLQYHVTTFMDNEISGTPPARHRSGRPLKTLSQRLKGKEGRFRGSLSGKRV
105|gene|hla        QLIIEDLWELLQYHVTTFVDNEISGTPPARHRSGRPLKTLSQRLKGKEGRFRGSLSGKRV
3394|gene|hwa       QLIIEDLWELLQYHVTTFIDNEISGTPPARHRSGRPLKTLSQRLKGKEGRFRGSLSGKRV
28220|gene|hbo      QLIIEDLWELLQYHVTTFIDNEISGTPPARHRSGRPLKTLSQRLKGKEGRFRGSLSGKRV
                    ******************:*****************************************

114|gene|nph        NFSARTVISPDPTLSLNEVGVPDRVASEMTQTMIVNERNLDDARRYVANGPESHPGANYV
782|gene|hut        NFSARTVISPDPTLSLNEVGVPDRVASEMTQTMNVNERNLAEARQYVSNGPEAHPGANYV
2664|gene|nrc1      NFSARTVISPDPTLSLNEVGVPDRVATEMTQTMVVNEQNLERARRYVRNGPEGHPGANYV
12428|gene|hma      NFSARTVISPDPTLSLNEVGVPDRVAKEMTQTMNVTERNLEEARRYVSNGPEAHPGANYV
2860|gene|hhi       NFSARTVISPDPTLSLNEVGVPDRVAKEMTQTMNVTERNLEKARRYVSNGPEAHPGANYV
105|gene|hla        NFSARTVISPDPTLSLNEVGVPDRVAMEMTQTLNVTERNVEEARQYVRNGPEAHPGANYV
3394|gene|hwa       NFSARTVISPDPTLSLNEVGVPERVAREMTQTMNVTARNVDRAQQYVRNGPEGHPGANYV
28220|gene|hbo      NFSARTVISPDPTLSLNEVGVPERVAREMTQTMNVTERNLEQAQQYVRNGPEGHPGANYV
                    **********************:*** *****: *. .*:  *..** ****.*******

114|gene|nph        KRPDGRRLKVTEKNCEELAEKVNPGWEVARHLIDGDIIIFNRQPSLHRMSIMAHEVVVMP
782|gene|hut        RRPDGRRLKVTEKNCEELAEKVEPGWEVARHLLDGDIVIFNRQPSLHRMSIMAHEVVVMP
2664|gene|nrc1      TRPDGRRVRVTEKVCEELAERVEPGWEVQRHLIDGDIIIFNRQPSLHRMSIMAHEVVVMP
12428|gene|hma      KRPDGRRLKVTEKNCEELAEKVEAEWEVSRHLVDGDIIIFNRQPSLHRMSIMAHEVVVMP
2860|gene|hhi       KRPDGRRLKVTEKNCEELAEKVEAEWEVSRHLVDGDIIIFNRQPSLHRMSIMAHEVVVMP
105|gene|hla        RRPDGRRLKVTEKNCEELAEKVEADWEVNRHLVDGDIVIFNRQPSLHRMSIMAHEVVVMP
3394|gene|hwa       KRPDGRRLKVTEKNCEELAEKIDAGWEVNRHLIDGDIVIFNRQPSLHRMSIMAHEVVVMP
28220|gene|hbo      KRPDGRRLKVTEKNCEELAEKVQAGWEVNRHLIDGDIVIFNRQPSLHRMSIMAHEVVVMP
                     ******:.**** ******.::. *** ***:****:**********************

114|gene|nph        YKTFRLNTTVCPPYNADFDGDEMNMHALQNEEARAEARVLMRVQEQILSPRFGENIIGAI
782|gene|hut        YKTFRLNTVVCPPYNADFDGDEMNMHALQNEEARAEARVLMRVQEQMLSPRFGENIIGAI
2664|gene|nrc1      YKTFRLNTVVCPPYNADFDGDEMNMHALQNEEARAEARVLMRVQEQILSPRFGENIIGAI
12428|gene|hma      YKTFRLNTTVCPPYNADFDGDEMNMHALQNEEARAEARVLMRVQEQMLSPRFGENIIGAI
2860|gene|hhi       YKTFRLNTTVCPPYNADFDGDEMNMHALQNEEARAEARVLMRVQEQMLSPRFGENIIGAI
105|gene|hla        YKTFRLNTVVCPPYNADFDGDEMNMHALQNEEARAEARVLMRVQEQILSPRFGGNIIGAI
3394|gene|hwa       YKTFRLNTVVCPPYNADFDGDEMNMHALQTEESRAEARVLMRVQEQLLSPRFGENIIGAI
28220|gene|hbo      YKTFRLNTVVCPPYNADFDGDEMNMHALQTEEARAEARVLMRVQEQILSPRFGQNIIGAI
                    ********.********************.**:*************:****** ******

114|gene|nph        QDHISGLYLLTNQNPKFNETQALDLLRATSIDELPDPDGQDEEGVDYWTGRSIFSELLPD
782|gene|hut        QDHITSVYLLTHQNPHFNETQALDLLRATNVDELPEPDG-EEDGRAYWTGRSIFSELLPD
2664|gene|nrc1      QDHISGTYLLTNDNPRFNETQASDLLRQTRIDELPAAAGTDEDGDQYWTGHQIFSELLPD
12428|gene|hma      QDHISGTYLLTHTNPKFNETQALDLLRATRIDELPEADGVDEGGEEYWTGRSLFSELLPD
2860|gene|hhi       QDHISGTYLLTHTNPKFNETQALDLLRATRIDELPEADGVDEDGEEYWTGRSLFSELLPD
105|gene|hla        QDHISGTYLLTHSNPEFTETQALDLLRATRVDELPEADGVDDEGREYWTGRTLFSELLPD
3394|gene|hwa       QDHISGTYLLTNGNPQFSETQALDLLRATSIDTLPPAAGQTEDGSPYWSGRQLFSELLPD
28220|gene|hbo      QDHISGTYLLTNGNPHFTETQALDLLRATSIDTLPEPAGEEEDGQPYWTGRQLFSELLPD
                    ****:. ****: ** *.**** **** * :* ** . *  : *  **:*. :*******

114|gene|nph        GLSLEFTSSAGDTVEIKDGQLISGTIDEDAVGAFGGEIVDTICKVYDNTRARVFVNEVAT
782|gene|hut        DLDLEFTSEAGDDVVIEDGQLLEGTIDDSAVGAFGGEIVDTIAKVYDKTRARIFINEVST
2664|gene|nrc1      DLSLEFTGTTGDTVVIEDGQLLEGTIADDEVGEYGSEIVDTITKVHGNTRARIFINEVAS
12428|gene|hma      DLNLEFTSSAGDTVVVEDGQMTGGTIDEDAVGAFGGEIVDTIAKDYSRTRARILVNEVSA
2860|gene|hhi       DLNLEFTSSAGDTVVVEDGQMTGGTIDEDAVGAFGGEIVDTIAKDYSRTRARILVNEVSA
105|gene|hla        DLSLHFSSSTGDDVIIEDGQLIEGTIDEDAVGAFGGEVVDTLTKEYGETRSRVFINEIAS
3394|gene|hwa       NLNLEFTSAAGDTVEIVDGELIAGTIDENAVGAFGGEVVDTLAKVHSKTRARVFVNEVAS
28220|gene|hbo      DLNLSFTSSTGDDVIIEDGQLIEGTIDEDAVGAFGGEVVDTLAKVYSKTRSRVFINEVAS
                    .*.* *:. :** * : **::  *** :. ** :*.*:***: * :. **:*:::**:::

114|gene|nph        LAMRSIMHFGFSIGIDDESIEREAEEQIQEAIDNAYERVETLIETYRRGELESLPGRTVD
782|gene|hut        LAVRTIMHFGFSIGIDDESIPPAAQEQVEEAIGSAYDRVQELIETYEQGDLESLPGRTVD
2664|gene|nrc1      LAMRSIMHFGFSIGIDDETVSTEARERIDEAIQSAYDRVQELIETYENGDLESLPGRTVD
12428|gene|hma      LAMRSIMHFGFSISIDDESIPREAEEQINDAIDSAYDRVEELIETYDRGELESLPGRTVD
2860|gene|hhi       LAMRSIMHFGFSISIDDESIPREAEEQINDAIDSAYDRVEELIETYDRGELESLPGRTVD
105|gene|hla        LAMRAIMHFGFSIGIDDESIPPEAEEQVDDAIESAYDRVQELIATYEAGELESLPGRGVD
3394|gene|hwa       LAMRAIMHFGFSIGIDDESIPPAANEQVDEAIEGAYDQVNEFIDVYEDGELESLPGRTVD
28220|gene|hbo      LAMRAIMHFGFSIGIDDESIPQEAGEQVDEAIDSAYERVEELIEIYEAGELESLPGRTVD
                    **:*:********.****::   * *.:::** .**:.*: :*  *  *:******* **

114|gene|nph        ETLEMKIMQTLGKARDSAGDIADDHFDDDNPAVVMAESGARGSMLNLTQMAGCVGQQAVR
782|gene|hut        ETLEMKIMQTLGQARDSAGDIAEDHFAEDNPAVVMADSGARGSMLNLTQMAGCVGQQAVR
2664|gene|nrc1      ETLEMKIMQTLGKARDSAGDVAEENFDEDNPAVVMANSGARGSMLNLTQMAGCVGQQAVR
12428|gene|hma      ETLEMKIMQTLGKARDSAGDIAEDHFGEDNPAVIMAESGARGSMLNLTQMAGCVGQQAVR
2860|gene|hhi       ETLEMKIMQTLGKARDSAGDIAEDHFGEDNPAVIMAESGARGSMLNLTQMAGCVGQQAVR
105|gene|hla        ETLEMKIMQTLGKARDSAGDIADQHFGDDNPAVVMARSGARGSMLNLTQMAGSVGQQAVR
3394|gene|hwa       ETLEMKIMQQLGKARDSAGEIAEDHFEDDNPAVVMARSGARGSMLNLTQMAGCVGQQAVR
28220|gene|hbo      ETLEMKIMQQLGKARDSAGEIADDHFTDDNPAVVMARSGARGSMLNLTQMAGCVGQQAVR
                    ********* **:******::*:::* :*****:** ***************.*******

114|gene|nph        GERINRGYEDRTLSHFKPNDLSADAHGFVENSYRNGLTPKEFFFHAMGGREGLVDTAVRT
782|gene|hut        GERINRGYEDRTLSHFEENDLSAEAHGFVEHSYREGLGPKEFFFHAMGGREGLVDTAVRT
2664|gene|nrc1      GERINRGYEDRTLSHFAPNDLSSEAHGFVENSYTSGLTPKEFFFHAMGGREGLVDTAVRT
12428|gene|hma      GERINRGYENRTLSHFEENDLSADAHGFVEASYRSGLGPKEFFFHAMGGREGLVDTAVRT
2860|gene|hhi       GERINRGYENRTLSHFEENDLSADAHGFVEASYRSGLGPKEFFFHAMGGREGLVDTAVRT
105|gene|hla        GERINRGYEDRTLSHYRPNDLSSEAHGFVENSYRSGLTPEEFFFHAMGGREGLVDTAVRT
3394|gene|hwa       GERINRGYEGRTLSHYQPDDLSAEAHGFVENSYRGGLTPREFFFHAMGGREGLVDTAVRT
28220|gene|hbo      GERINRGYEGRTLSHYQKEDLSAEAHGFVENSYRGGLTPREFFFHAMGGREGLVDTAVRT
                    *********.*****:  :***::****** **  ** * ********************

114|gene|nph        SKSGYLQRRLINALSELEAQYDGTVRDTSDTVVQFEFGEDGTSPVRVSSSEEFDI-DVES
782|gene|hut        SKSGYLQRRLINALSELETQYDGTVRDTSDTVVQFEFGEDGTSPVDVSSNEDFDI-DVEA
2664|gene|nrc1      SKSGYLQRRLINALSELETQYDGTVRDTSDTIVQFEFGEDGTSPVQVSSNEEVDI-DVEH
12428|gene|hma      SKSGYLQRRLINALSELETQYDGTVRDTSDTIVQFEFGEDGTSPVDVSSNQDGDIVDVEQ
2860|gene|hhi       SKSGYLQRRLINALSELETQYDGTVRDTSDTIVQFEFGEDGTSPVDVSSNQEGDIVDVEQ
105|gene|hla        SKSGYLQRRLINALSELEAQYDGSVRDTSGRIVQFEFGEDGTSPVKVSSGEGDGI-DVDD
3394|gene|hwa       SKSGYLQRRLINALSELEAQYDGTVRDTSGNVVQFEFGEDGTSPVKVSSNKDTPI-DVES
28220|gene|hbo      SKSGYLQRRLINALSELEAQYDGTVRDTSGTIVQFEFGEDGTSPVKVSSGEGDGI-DVEN
                    ******************:****:*****. :************* ***.:   * **: 

114|gene|nph        ITDRILDEEFESELEREEFLGIEEPKTNLSEHGESWRADTVQ-EVESDD
782|gene|hut        ITERIVEAEFDDESEKATFLEREASPTNLSEHADEWWH------AEAGD
2664|gene|nrc1      VADRILNSEFDSDTQKAEFLEVEEPPTNLSEHGAAW-------EVESDD
12428|gene|hma      IADSVLDEEFESEKQKESFLGTRTERTNLSEHADDWWM------AEGDD
2860|gene|hhi       IADSVLDEEFESEKQKESFLGTRTERTNLSEHADDWWM------AEGDD
105|gene|hla        IVDRVVDSEFDSDDEKERFLGERTPPTNLSEHSGPGLNKAS--GVESDD
3394|gene|hwa       VADRILESEFESDEEKAQFLGRQERPTNISEYAGPGLDKARNLGVESDD
28220|gene|hbo      ITDRVLDAEFSSSEEKERFLGRREPPTNLSEYAGPGLDKAQDTGVQSDD
                    :.: ::: **... :.  **      **:**:.           .:..*