HOG COG KOG arCOG COG description NRC-1 annotation
cHOG0331COG0031KOG1481arCOG01430Cysteine synthasecysteine synthase

HOG Organism Gene ID Name Annotation
cHOG0331 nrc1 VNG1301 cysK cysteine synthase
cHOG0331 hmu HMUK2310 Hmuk_2310 Pyridoxal-5'-phosphate-dependent protein beta subunit( EC:2.5.1.47 )
cHOG0331 hma RRNAC1236 cysK1 cysteine synthase
cHOG0331 hwa HQ2556A cysK cysteine synthase
cHOG0331 hla HLAC1358 Hlac_1358 Pyridoxal-5'-phosphate-dependent protein beta subunit
cHOG0331 hbo HBOR15620 Hbor_15620 cysteine synthase( EC:2.5.1.47 )
cHOG0331 hut HUTA0509 Huta_0509 cysteine synthase
cHOG0331 hvo HVO_1654 HVO_1654 cysK Cysteine synthase
cHOG0331 nph NP3116A cysK_2 cysteine synthase 2
cHOG0331 hhi 1830 cysK2 cysteine synthase A
MUSCLE (3.6) multiple sequence alignment


1301|gene|nrc1      MKDSILDAIGTPLVRLTAPEGATIAAKLEGFNPAGSAKDRPAREMVLAAERAGEIQPGDQ
3116|gene|nph       MKRGILDTIGSPLVQVESPPGSVIAAKIESKNPGGSAKDRPALAMVEAAEAAGELEPGDR
509|gene|hut        MDDSILETIGSPLVEIDSPPGATVAAKVESFNPGGSAKDRPALGMVEAAERDGDLSPGDR
1358|gene|hla       MDEDVLDTLGSPLVRVDAPTETTVAAKVESRNPGGSAKDRPALYMVEAAEEAGDLTPGDG
2556|gene|hwa       MDANVLETIGSPLVRVNSPPGVTIAAKIESKNPGGSAKDRPALAMVERAERAGDIEPGDA
15620|gene|hbo      MDANILETIGSPLVEVSSPEGATVAAKVESKNPGGSAKDRPALAMIEAAERAGEISPGDE
11236|gene|hma      MKDSILDTIGSPLVSVRAPEGATVAAKIESFNPGGSAKDRPAKYMIDDAERNGSLQPDDT
1830|gene|hhi       MKDSILDTIGSPLVSVRAPEGATVAAKIESFNPGGSAKDRPAKYMIDDAERNGTLEPGDT
                    *. .:*:::*:*** : :*   .:***:*. **.********  *:  **  * : *.* 

1301|gene|nrc1      LVEATSGNTGIGLALTAAARGYDLTIVMPASMSTERKQLLRAYGADLELVDAGMETANQV
3116|gene|nph       IVEPTSGNTGIGLAVVAAAKGYDLTVVMPESKSPERRALMRAYGADLELVEGTISDAKDC
509|gene|hut        IVEPTSGNTGIGISLVAAAKGYDVTIVMPADMSVERRRLMEAYGADLELIEGDMTDARDR
1358|gene|hla       IVEPTSGNTGIGLAMVGAVKGYDVTLVMPEGKSIERRRLMHAYGADVELVDGDISEAKDR
2556|gene|hwa       LVEPTSGNTGIGLSVVAAAKGYDLTVVMPASQSPERRELMRAYGTKIELIDGTISDAKDR
15620|gene|hbo      LVEPTSGNTGIGLSMVAAAKGYDMTIIMPSSQSPERRDIMRAYGATIELVDGDISDAKDR
11236|gene|hma      LVEPTSGNTGIGMAMVGATKGYDVVLVMPSSKSPERRQIMKAYGAEIELVEGDISDAKER
1830|gene|hhi       LVEPTSGNTGIGMAMVGATKGYDVVLVMPSSKSPERRQIMKAYGAEIELVEGDISDAKER
                    :**.********:::..*..***:.::** . * **. :: ***: :**::. :  *.: 

1301|gene|nrc1      ADRVAADTGAFRVGQFENPANPRAHYRTTAEEILDQVEGREIDALVAGVGTGGTITGTAT
3116|gene|nph       ADEL-EAEGMVQIRQFENPANPEAHYRTTGEEILRQVGDRDIDALVAGIGTGGTISGTGR
509|gene|hut        ADALEADEGMVQLRQFENPANPQAHYETTGPEILEQVGDREIDAFVAGVGTGGTISGTAR
1358|gene|hla       ADALERDAGMVQLRQFENPANPQAHYETTGPEILDQVDDRTVDAFVAGVGTGGTLTGIGR
2556|gene|hwa       ADEL-EDTGMKQLRQFENTANPDAHYQTTAEEILEQVEGRTVDALVAGVGTGGTISGIGR
15620|gene|hbo      ADEL-EAEGMTQLRQFENEANPRAHYRTTAEEILEQIGDRTVDALVAGVGTGGTISGIGS
11236|gene|hma      ANELCERDDYVQLRQFENPANPTAHYETTGEEILEQVGDRTVDALVAGVGTGGTLTGTGR
1830|gene|hhi       ADELCERDDYVQLRQFENPANPKAHYETTGEEILEQVGDRTVDALVAGVGTGGTLTGTGR
                    *: :    .  .: **** *** *** **. *** *: .* :**:***:*****::* . 

1301|gene|nrc1      RLREAHPEMSVVAVEPEANAVLSTGES--GDDDFQGMGPGFVSDNLDRELIDTVETVAVD
3116|gene|nph       RLKEVFPEMEVIGVEPEGNAVLSGDEP--GDDDFQGMGPGFVAPNLDRELLDGVEVVDIE
509|gene|hut        RLREAFHDVDVIGVEPAENAVLSTGES--GSDDFQGMGPGFVSDNLDRDVIDEVRTIELA
1358|gene|hla       RLREAFPDVRIDAVEPSDNAVLSGGEP--GIDDFQGMGPGFVSPNLDTDLLDHVHTVDIE
2556|gene|hwa       RLREEFPDMSVIAVEPAENAVISGSEP--GNDDFQGMGPGFISPNLDTELIDDVETVTIG
15620|gene|hbo      RLREEFPEMEIVAVEPAENAVLSTGES--GSDDYQGMGPGFVSPNLDRDLLDDVIPVPLE
11236|gene|hma      RLQDAFPGMDIVAVEPADNAVLSGMEPGTGEDSFQGMGPGFVSDNLDTDLLDDVMTVELP
1830|gene|hhi       RLREAFPEMDIVAVEPADNAVLSGMEPGTGEDSFQGMGPGFVSDNLDTDLLDDVLTVELP
                    **.: .  : : .***  ***:*  *.  * *.:*******:: *** :::* *  : : 

1301|gene|nrc1      AAEAETRRLAREEGVLVGQSSGAAALAATRTAERIADP---SLDCP----DVSFDADAVG
3116|gene|nph       TAEAECRRLAEEEGVLVGQSSGGSLVAAKRVARRLADERGIEVPCPGPVNDIGLLDDEPP
509|gene|hut        DAEAECRRLARAEGLLVGQSSGAMGVIAREVAAERAAP---DAE----------------
1358|gene|hla       DAEAECRRLAREEGLLVGQSSGASNLAAKNAAAELRES---DTFVG--------------
2556|gene|hwa       VAETECRRLAREEGILVGQSSAASNVAAKRIAKQLADP---AQPCNER--EAGYVIEDAP
15620|gene|hbo      DAEAECRRLAREEGILVGQSSGASNLAARRVAERLATP---EANCPEP--PERFVIEDMS
11236|gene|hma      DAEDECRRLAHEEGILVGQSSGASNLAAREFAEQLIRD---GV-----------------
1830|gene|hhi       DAEDECRRLAHEEGILVGQSSGASNLAAREVAEQLVAD---GV-----------------
                     ** * ****  **:******..  : *   *                            

1301|gene|nrc1      DPDAIADGGDGPD-----------------------------------------DCPLVV
3116|gene|nph       ERQAVPD-----------------------------------------------DCPLVV
509|gene|hut        ------------------------------------------------------EPPLIV
1358|gene|hla       ------------------------------------------------------DEPLVV
2556|gene|hwa       KSESVMKSSSTSTDDDTADTATPTSTSNTSTNTNTKAEYETGIAPSSEDRNTQPDCPLVI
15620|gene|hbo      DDP---RGDDGRTDDARADGGVPGGDTD--------------------------DCPLVI
11236|gene|hma      ------------------------------------------------------DDPLVV
1830|gene|hhi       ------------------------------------------------------DDPLVV
                                                                          : **::

1301|gene|nrc1      TILPDTGERYLSAGVFD-------
3116|gene|nph       TVFWDSGERYMSTGMFD-------
509|gene|hut        TVFWDSGERYLSTGLFD-------
1358|gene|hla       TVFWDSGERYLTAGTFDE------
2556|gene|hwa       TVFWDSGERYMSTGLFDDPDQSHE
15620|gene|hbo      TVFWDSGERYMSTGMFD-------
11236|gene|hma      TVYWDSGERYMSTGMFD-------
1830|gene|hhi       TVYWDSGERYMSTGMFD-------
                    *:  *:****:::* **