Web document 5.7. Alignment of bacterial and other globins for input into HMMER.

 

 

 

 

>gi|108465788|gb|ABF90973.1|  protozoan/cyanobacterial globin family protein [Myxococcus xanthus DK 1622]

------------------------------------------------------------

---------------------------------------M--PSP-D-------------

--DLPYHRLGGTDAAMALAEAFYDAMDAHEPELARLH---ELDAEGRVNRGTRERFGLFL

--------------------AGWLGGP--------QDYTERHGHPRLRMRHGHLSIGVAM

RDAWVRSMQR---------AMDARGISG----------------GLRRFLD-ARFAHVAD

FLRNV-------------------------------------------------------

------------------------------------------------------------

------------------------------------------------------------

------------------------------------------------------------

------------------------------------------------------------

------------------------------------------------------------

---------EE-----------------------

>gi|118757595|ref|ZP_01605344.1|    globin [Shewanella pealeana ATCC 700345]

------------------------------------------------------------

----------------------------MNWLKKIIGNRD--SNK-DRD----------P

NQSNAYDRIGGEIVIRAIAHQFYLQMQSNEATQALLS---IHRAP--I-DESEQKLFEFL

--------------------SGWLGGP--------QLYQQKHGHPALRARHMPFKVDEAM

RDQWLLCMKA---------AIEIEVTEP----------------QHQQAIY-SAIATLAD

HMRNQ-------------------------------------------------------

------------------------------------------------------------

------------------------------------------------------------

------------------------------------------------------------

------------------------------------------------------------

------------------------------------------------------------

----------------------------------

>gi|118719249|ref|ZP_01571780.1|    globin [Burkholderia multivorans ATCC 17616]

-----------------MCCG---------------------------------------

------------APRCSRCVTRERGRRLPTILIQTIRMT---DVN-DDA----------P

SQPTAFELVGGEARVREMVDRFYDLMDLEPEFAQIRA---LHPAS--L-DGSRDKLFWFL

--------------------CGWLGGP--------DHYISRFGHPRLRARHLPFPIASVE

RDQWLRCMAW---------AMEDVGLPE----------------PLRERLM-HSFYDTAD

WMRNR-------------------------------------------------------

------------------------------------------------------------

------------------------------------------------------------

------------------------------------------------------------

------------------------------------------------------------

------------------------------------------------------------

---------PG-----------------------

>gi|124548961|ref|ZP_01707405.1|    globin [Shewanella putrefaciens 200]

------------------------------------------------------------

----------------------------MNWLKKIFSKHT--PPQDDRD----------P

IQSNAYDLIGGEKVIRAITKCFYQKMASSAETTTLLA---IHRAP--I-AESEQKLFEFL

--------------------SGWLGGP--------QLYQQKYGHPALRARHMHFAVDEAM

RDQWLFCMKF---------AIEENINKP----------------EHRAAIF-EAISTLAD

HMRNQ-------------------------------------------------------

------------------------------------------------------------

------------------------------------------------------------

------------------------------------------------------------

------------------------------------------------------------

------------------------------------------------------------

----------------------------------

>gi|6324095|ref|NP_014165.1|  Similar to globins and has a functional heme-binding domain; involved in glucose signaling or metabolism; regulated by Rgt1p [Saccharomyces cerevisiae]

-MTGEKILHSQLLTNSDMSSGNVHHTKPMMYNVTLPSYNSSSIGPVDNLKINERPGSHDH

SMRSEMSSKNSGSDFMPQSISRSEGSVYQV---KIDRGDS--PNTEGFD-----------

FKVNARDLL-----------------------LLRMSWDILLREY--LT-PKELKVFQAL

-----LYSNKHITSTER---PYLNTAP--------DGMISKTIDPTARPRKTKQRDNDNK

VDTALFCSQFYDNLIAMDPLLEEYFPSL----------------KHQAVSFCKVLDSAID

NLENV-HVLDDYIVKL-----GKRHSRILGIKTVGFEVMG--------------------

---KAFMTTLQDRFGSFLT-LELKNLWGQLYSYLANCMITAGKD----------------

----------------------------------------------------------PM

EKIQPD------------------------------------------------------

---------------FSYNGD-----SVVLNFSIPKLAMHDISTVNKLQMVKTKNATIPH

NITQVPTNKIPT----EILL---D--NSSTPIKS-DRESTPPISPKGSGSTKPSIGSSTV

VESNTKKNNYD---EKIHLLQKTAQQKNCSIM--

>gi|115387639|ref|XP_001211325.1|   bacterial hemoglobin [Aspergillus terreus NIH2624]

------------------------------------------------------------

--------MSLSPEQVQII-----------------------------------------

----KATVPVLAEHGTTITTVF--------------------------------------

------------------------------------------------------------

----------YKNMLAAHPELNTVFNTS------------NQVNGHQPRSLAGALYAYAS

NIDNL-GALGPAVELI-----CNKHAS-LYIQPEQYKIVG--------------------

---KFLLEAMGEVLGDALT-PEIHDAWATAYWMLANLMIQREADLYKQADGWTDFRDFRV

TKKTVESSEITSFYLAPVD-GKPLPTFQPGQYISVQVFVPGLNYPQTRQYSLSDAPRPDY

YRISVRKEPGLNPAEPGAKAHPGYVSNILHDTINEGDQIKVSHPFGDFYNKEPESPRPVV

LLAAGVGLTPLLSILNTIVSTPSAAGERKIHFVH-GARTAGARAFKDHVL----SLKEKI

-PGLQATYFTSHPGAEEKQGEDYDFAGRIDLAKL-ADKDLFLDEPSAEYYVCGPEGFMTD

IRAALVARGVSADRIKMELFGTG------G-VPA

>gi|70981999|ref|XP_746528.1| flavohemoprotein [Aspergillus fumigatus Af293]

------------------------------------------------------------

--------MPLTPEQVQFI-----------------------------------------

----KATVPVLAEHGTTITTVF--------------------------------------

------------------------------------------------------------

----------YKNMLTAHPELNNVFNTT------------HQVTGHQARALAGALFAYAS

NIDNL-GALGPAVELM-----CHKHAS-LYIKPDDYKIVG--------------------

---KFLLEAMGQVLGDALT-PEILDAWATAYWQLADIMIGREAQLYEQAEGWTDFRDFVV

ALKVPESSEITSFYLKPAD-GKPLPAFQPGQYISVQVHVPELNYLQARQYSLSDMPRSDY

YRISVKKESGLNPAEPGAKAHPGHVSNILHASVNEGDTIKVSHPFGDFFLSDAKAAHPVV

LLSAGVGLTPMTSILNTLTS---QAPERKVSFIH-GARNARARAFKNHIT----SLEQKL

-PNLKSTFFTSHPTEEDKEGDDYQFRGRVDLSQLDSNRDLFLDDATTEYYVCGPDTFMTD

MLNVLKSKGVSEDRVKLELFGTG------G-VPH

>gi|126134831|ref|XP_001383940.1|   flavohemoglobin [Pichia stipitis CBS 6054]

--------MMSTAPQI--------------------------------------------

-----YTIQELTDSQKKIV-----------------------------------------

----LDTVPTLELAGETLTAQF--------------------------------------

------------------------------------------------------------

----------YQNMFVDFPEVRPFFNQT------------DQKFLRQPRILAFALLNYAK

NIENL-EPLTAFVKQI-----VSKHVG-LQVKAEHYPCVG--------------------

---NSLIKTMKELLGPEVANEAFIDAWATAYGNLAQLLIDMEDAEYQKAP-WRGFREFTV

TKIQDECTDVKSIYFKPTNEGDEISLPKRGQYLCFRWSLPGEEQEISREYSISEYPSEKE

YRISVRKLEG------------GKISGYIHNTLKVGDSLKVAPPCGKFVYV--PSEKDIV

LLVGGIGITPIVSILEKALQL-----GRNVTMLY-SNKTVESRPFGNWLK----ELKEKY

GEKFKLTEFFSN----EKNVTAKDVIDAVETRTL-DSRDLDQISKDSDVYLLGPREYMKY

VKGYLGAKGVE--DIKLEYFGPL------E-V--

>gi|134295880|ref|YP_001119615.1|   globin [Burkholderia vietnamiensis G4]

------------------------------------------------------------

-----MTSLPASSDTAPAR-----------------------------------------

----IRDAEPTEANIRDLVYAF--------------------------------------

------------------------------------------------------------

----------YDRVRA-DPLLGPVFDAK------------LD----GRWDTHLPKMVSFW

SSLVL-GTRGYRGNVQ-----QAHQPL-DGIEPAHFSRWL--------------------

---SLFLKT---------------------------------------------------

------------------------------------------------------------

--VEARYTPA------------AAV-----------------------------------

------------------------------RFMEPALRIAQSLQLSRFGW----DYRIPA

EQQ--------------------ALLDAIAPRRR-DA--------DDDGHALPSRARGEP

FPAKIIGRGVE------PEAGPD--------A--

>gi|68490256|ref|XP_711049.1| putative flavohemoglobin [Candida albicans SC5314]

MTV----ASASIINNY--------------------------------------------

-----FESKPLTPEHIQII-----------------------------------------

----IDSVPILEHLDVQLTEKF--------------------------------------

------------------------------------------------------------

----------YKRLLKQNPEFKPFFNET------------HQKLLRQPRIMIHFLIQYAK

NIQDL-TPMIDFIKKI-----ASKHVG-LQVKPEHYPKLG--------------------

---QVLINVIINLFPKQLVHDEFIEAWTLAYQNLANLLIKLESEQYVEKP-WYGFKQFKV

TRLQRECSDVKSLYITPVD-GSPIPKPKRGQYLCMRWLLPGEKHEITREYSISEYPKNNE

YRITVRYIPG------------GKVSNYIHNNINVGDIVYSGPPCGDCVYE--SSSKNLV

FLAGGNGVTALLPMIEAGLTE-----GRQVKLLY-SNRSTDSRSFGKLFQ----SYKLQY

GDRFQVVEFLSR----GRTI---DPIDKFYRRSL-TLEDLDFIVPEDDVYLIGPRTYMKM

IEDYLKDRNIT---VKLDYFGPR------E-I--

>gi|4504347|ref|NP_000549.1|  alpha 1 globin [Homo sapiens]

------------------------------------------------------------

--------MVLSPADKTN------------------------------------------

------------------------------------------------------------

---------------------------VKAAWGKVGAHAGEYG-----------------

-------AEALERMFLSFPTTKTYFPHF------DLSHGSAQVKGHGKKVAD-ALTNAVA

HVDDMPNALSALSDLH-----AH----KLRVDPVNFKLLS--------------------

---HCLLVTLAAHLPAEFT-PAVHASLDKFLASVSTVL----------------------

------------------------------------------------------------

------------------------------------------------------------

------------------------------------------------------------

---------------------------------------------TSKY-----------

-----------------R----------------

>gi|52138655|ref|NP_001004376.1|    alpha 1 globin [Gallus gallus]

------------------------------------------------------------

--------MVLSAADKNN------------------------------------------

------------------------------------------------------------

---------------------------VKGIFTKIAGHAEEYG-----------------

-------AETLERMFTTYPPTKTYFPHF------DLSHGSAQIKGHGKKVVA-ALIEAAN

HIDDIAGTLSKLSDLH-----AH----KLRVDPVNFKLLG--------------------

---QCFLVVVAIHHPAALT-PEVHASLDKFLCAVGTVL----------------------

------------------------------------------------------------

------------------------------------------------------------

------------------------------------------------------------

---------------------------------------------TAKY-----------

-----------------R----------------

>gi|4885477|ref|NP_005359.1|  myoglobin [Homo sapiens]

------------------------------------------------------------

--------MGLSDGEWQL------------------------------------------

------------------------------------------------------------

---------------------------VLNVWGKVEADIPGHG-----------------

-------QEVLIRLFKGHPETLEKFDKFKHLKSEDEMKASEDLKKHGATVLT-ALGGILK

KKGHHEAEIKPLAQSH-----AT----KHKIPVKYLEFIS--------------------

---ECIIQVLQSKHPGDFG-ADAQGAMNKALELFRKDM----------------------

------------------------------------------------------------

------------------------------------------------------------

------------------------------------------------------------

---------------------------------------------ASNY-----------

-----------------KELGFQ--------G--

>gi|50728806|ref|XP_416292.1| PREDICTED: similar to Myoglobin [Gallus gallus]

------------------------------------------------------------

--------MGLSDQEWQQ------------------------------------------

------------------------------------------------------------

---------------------------VLTIWGKVEADIAGHG-----------------

-------HEVLMRLFHDHPETLDRFDKFKGLKTPDQMKGSEDLKKHGATVLT-QLGKILK

QKGNHESELKPLAQTH-----AT----KHKIPVKYLEFIS--------------------

---EVIIKVIAEKHAADFG-ADSQAAMKKALELFRNDM----------------------

------------------------------------------------------------

------------------------------------------------------------

------------------------------------------------------------

---------------------------------------------ASKY-----------

-----------------KEFGFQ--------G--

>gi|4504349|ref|NP_000509.1|  beta globin [Homo sapiens]

-----------------M------------------------------------------

--------VHLTPEEKSA------------------------------------------

------------------------------------------------------------

---------------------------VTALWGKV--NVDEVG-----------------

-------GEALGRLLVVYPWTQRFFESFGDLSTPDAVMGNPKVKAHGKKVLG-AFSDGLA

HLDNLKGTFATLSELH-----CD----KLHVDPENFRLLG--------------------

---NVLVCVLAHHFGKEFT-PPVQAAYQKVVAGVANAL----------------------

------------------------------------------------------------

------------------------------------------------------------

------------------------------------------------------------

---------------------------------------------AHKY-----------

-----------------H----------------

>gi|71895591|ref|NP_001026660.1|    hypothetical protein LOC428114 [Gallus gallus]

-----------------M------------------------------------------

--------VHWTAEEKQL------------------------------------------

------------------------------------------------------------

---------------------------ITGLWGKV--NVAECG-----------------

-------AEALARLLIVYPWTQRFFASFGNLSSATAIIGNPMVRAHGKKVLS-SFGEAVK

NLDNIKKSFAQLSKLH-----CD----KLHVDPENFRLLG--------------------

---DILIIVLASHFSKDFT-PASQAAWQKMVRVVAHAL----------------------

------------------------------------------------------------

------------------------------------------------------------

------------------------------------------------------------

---------------------------------------------AHEY-----------

-----------------H----------------

>gi|94984762|ref|YP_604126.1| globin [Deinococcus geothermalis DSM 11300]

-----------------MTSGPL-------------------------------------

--------LTTSPLSGFGVEVVMASSAALPGAAGLLVPHDGEPVADVRDRPDRWTLLTLL

AEAVRRGVPVLA-------------------------WGSGAALAGRV-LGARVRPGEGA

ADWAEAPRGATVERWQGEVPLLWRAGPVTAWAGET--LPEDLR-----------------

-------SEFLARLMQAEPRA----------------PGSPLEVVGGEAALR-TMLADFY

ARARADTLLGPVFAAHVQDWEAH----LDRVTAFWVTMLGGGPAWRGNLNSVHAGLGLRG

THLRRWLALFREAAEDCLG-PEAAAPLTARAEAMGHRL----------------------

------------------------------------------------------------

------------------------------------------------------------

------------------------------------------------------------

---------------------------------------------GQRN-----------

----------------APHVGRV--------P--