HOG COG KOG arCOG COG description NRC-1 annotation
cHOG0333COG1685arCOG01025Archaeal shikimate kinasehypothetical protein

HOG Organism Gene ID Name Annotation
cHOG0333 nrc1 VNG1245 vng1245 no entry
cHOG0333 hmu HMUK2348 Hmuk_2348 shikimate kinase( EC:2.7.1.71 )
cHOG0333 hma RRNAC0753 aroK shikimate kinase
cHOG0333 hwa HQ2762A aroK shikimate kinase
cHOG0333 hla HLAC1876 Hlac_1876 shikimate kinase
cHOG0333 hbo HBOR19080 Hbor_19080 shikimate kinase( EC:2.7.1.71 )
cHOG0333 hut HUTA3025 Huta_3025 shikimate kinase
cHOG0333 hvo HVO_1323 HVO_1323 shikimate kinase
cHOG0333 nph NP3982A aroK shikimate kinase
cHOG0333 hhi 1403 aroK shikimate kinase
MUSCLE (3.6) multiple sequence alignment


1245|chp|nrc1       -------------------MDGRAVAPAAGTVLNALATGVGAAFALDIDTEASVSVTP--
2762|gene|hwa       -------------------MHGRAVALGAGTVLNALATGIGAAFALDVETTATVDITPEE
3982|gene|nph       ----------------METMEGRAAAPAAGTVLNALANGYGSAFAIDAYTEATVTLD---
10753|gene|hma      MTATGVSQLVNRSVWEYTSMEGKAAAPAAGTVLNALATGRGAAFAIDEYTTATVTLST--
1403|gene|hhi       -------------------MEGKAAAPAAGTVLNALATGRGAAFAIDEYTTATVTLST--
3025|gene|hut       -------------------MEGTAVAPAAGTVLNALATGTGSAFAIDAELRAEVTLE---
1876|gene|hla       -------------------MEGRAAALGAGTVLNALATGTGAAFGIDVETRATVTLNP--
19080|gene|hbo      -------------------MHGRAVALGAGTILNALSTGVGSAFAIDAETTATVELD---
1323|gene|hvo       -------------------MHGRATALGAGTVLNALATATGSAFAIDAETTASVELD---
                                       * * *.* .***:****:.. *:**.:*    * * :    

1245|chp|nrc1       --SESGVSGEIAGHPEADTALVERCVSRVIDRYGDGQ--------GGHVRTESEVPLAAG
2762|gene|hwa       TLDSTRVNGTIAGVENGDTRLIERCVELAIETYGPGD-----MQYIAHIQTESDIPMAAG
3982|gene|nph       --DTGSVTGEIADAPDADTRLIERCVEAVVERFGDGQ--------GGHVRTESDVPMASG
10753|gene|hma      --ETDGVTGEVDGAPDADTRLIERCVEYVIDAHGGPQNVGVPAVSGGTVRTESEVPMASG
1403|gene|hhi       --ETDGVTGEVDGAPDADTRLIERCVEYVIDAHGGPQNVGVPAVSGGTVRTESEVPMASG
3025|gene|hut       --ADGGVSGTIAGEPDADTALIEHCVELVTERFGDGE--------GGSVRTESDIPLAAG
1876|gene|hla       --DADGVRGTIAEDPDGDTALIEHCVALATERWGDGE--------GGTVRTDSDVPLAAG
19080|gene|hbo      --DSESVRGSVAEDADADTRLIERCVELAVERYGDDE-------YGGTVRTESDVPMAAG
1323|gene|hvo       --DSGEVRGSIAEAPDADAHLVERCVELAIEAFGDGE--------GGTVETESDLPMAAG
                          * * :    :.*: *:*.**  . :  *  :         . : *:*::*:*:*

1245|chp|nrc1       LKSSSAAANATVLATLDALG---------VADEVDRVDAARLGVQAARDAGVTVTGAFDD
2762|gene|hwa       LKSSSAAANASILATADAFD-----------ITPEPISACRLGVRAARDVGVTVTGAFDD
3982|gene|nph       LKSSSAAANATVLATLAALD-----------VDLDREAACRIGVDAARDVGVTVTGAFDD
10753|gene|hma      LKSSSAAANATVMATLDALD---------ATERMSREDMARLGVMAARDVGVTVTGAFDD
1403|gene|hhi       LKSSSAAANATVMATLDALD---------ATERMSREDMARLGVMAARDVGVTVTGAFDD
3025|gene|hut       LKSSSAAANATVLATLSALD---------ADSAVSRVDAARIGVQAARDAGVTATGAFDD
1876|gene|hla       LKSSSAAANATVLATCDALGLAVGDDDADPSVDVTRLDACRLGVRAAREAGVTVTGAFDD
19080|gene|hbo      LKSSSAAANATVLATADALG-----------VEPDREEACRLGVQAARDVGVTITGAFDD
1323|gene|hvo       LKSSSAAANATVLATLSALGRSVG---PGPDADISRLDACRMGVRAAREVGVTATGAFDD
                    **********:::**  *:.                   .*:** ***:.*** ******

1245|chp|nrc1       AAASMLGGVAMTDNREDDLLFRDAVEWHAAVWTPPERAYSADADVARCERVSGLAEHVAA
2762|gene|hwa       ASASMLGGVTVTNNNEDKLLAHDTVEWDVVVWTPPERAYSADADAERCANIAPMAELVED
3982|gene|nph       ASASMLGGVTVTDNTSDELLQRDEPDWDVLVYTPDERAFSADADADRCERIAPMADVVYE
10753|gene|hma      ASASMLGGVTVTDNEDDALLAREEIDWDVLVWTPPEQSFSADADVERCRQIAPMARLVED
1403|gene|hhi       ASASMLGGVTVTDNEDDALLAREEIDWDVLVWTPPEQSFSADADVERCRQIAPMARLVED
3025|gene|hut       ASASMLGGVTVTNNHEDELLARETVEWDVLVWTPPERAYSAEADIERCQQVAPMAELVAD
1876|gene|hla       ATASMLGGVTVTDNDADELRSRETVDWDVLVWTPPERAYSADADVTRCEAVAPMADLVAD
19080|gene|hbo      ASASMLGGATVTNNDDDELLSREPVEWDVLVWTPDERAYSADADVESCANVAPMAELVAE
1323|gene|hvo       AAASMVGGVVVTDNTEDGLIARDEVDWDVLVWTPPERAYSADADVSRCENVAPMADLVAD
                    *:***:**..:*:*  * *  .:  :* . *:** *.::**:**   *  :: :*  *  

1245|chp|nrc1       LAAAGDYGTAMTVNGLAFCAALDFPTAPAVTALPHAAGVSLSGTGPSYVAV---------
2762|gene|hwa       LALTNQYTEAMTVNGLAFSAALGFDPAPAVEVMPYASGVSLSGTGPSVVAVADRNSESK-
3982|gene|nph       LARDGDYERAMCVNGFAFCAALGYPTDPLVEALP-AAAASLSGTGPSYTAV---------
10753|gene|hma      LALDGDYQRAMTVNGLAFSAALDFETDPVLDALQHVEGVSLSGTGPSFTAV---------
1403|gene|hhi       LALDGDYQRAMTVNGLAFSAALDFETDPVLDALQHVEGVSLSGTGPSFTAV---------
3025|gene|hut       LALDGAYERAMTVNGLAFTAALGFSAEPLLEAMPHVAGVSLSGTGPSVTAV---------
1876|gene|hla       LALDGRYAEAMTVNGLAFSAALGFDADPAVEAMPHATGVSLSGTGPSVVAVAD-------
19080|gene|hbo      LALDGRYAEAMTVNGLAFSAALDFPTDPAVEAMPHAAGVSLSGTGPSVVAVAERGSAAHQ
1323|gene|hvo       LALEGRYAEAMTVNGLAFSAALDFPTDPAVEAMPIADGVSLSGTGPSVVAV---------
                    **  . *  ** ***:** ***.: . * : .:  . ..******** .**         

1245|chp|nrc1       --------GDEDGIEEVSTRWHENPGTVRETTTQLAGARTT------
2762|gene|hwa       ------GESDEVDLAVVTRRWRSRGGTVWKTTTRATREESSTRPDET
3982|gene|nph       --------GDRETLESLETTWSNREGHTWLTTTQQTGATRK------
10753|gene|hma      --------GERAALEAVQDAWDKRPGATWLTTTQTEGTHTV------
1403|gene|hhi       --------GERAALEAVQDAWDERPGATWLTTTQTKGTHTV------
3025|gene|hut       --------GNRDDLERVRAIWEQREGTTWLTTTQTDGTRTR------
1876|gene|hla       ------PDDPETDLDAVADAWSNRPGTLRRTTTRNDGAAVE------
19080|gene|hbo      TESGDVDDGDESPLKRLRDLWAKRDGRTWLTTTRNDGASIR------
1323|gene|hvo       --------GDRTDLERVKELWDAREGETRLTTTRTDGARIQ------
                            .    :  :   *  . *  . ***.