Web document 5.5. A set of bacterial and fungal (and several vertebrate) globins that were subsequently aligned and used to make an HMM.

 

 

 

>gi|4504349|ref|NP_000509.1| beta globin [Homo sapiens]

MVHLTPEEKSAVTALWGKVNVDEVGGEALGRLLVVYPWTQRFFESFGDLSTPDAVMGNPKVKAHGKKVLG

AFSDGLAHLDNLKGTFATLSELHCDKLHVDPENFRLLGNVLVCVLAHHFGKEFTPPVQAAYQKVVAGVAN

ALAHKYH

>gi|71895591|ref|NP_001026660.1| hypothetical protein LOC428114 [Gallus gallus]

MVHWTAEEKQLITGLWGKVNVAECGAEALARLLIVYPWTQRFFASFGNLSSATAIIGNPMVRAHGKKVLS

SFGEAVKNLDNIKKSFAQLSKLHCDKLHVDPENFRLLGDILIIVLASHFSKDFTPASQAAWQKMVRVVAH

ALAHEYH

>gi|4504347|ref|NP_000549.1| alpha 1 globin [Homo sapiens]

MVLSPADKTNVKAAWGKVGAHAGEYGAEALERMFLSFPTTKTYFPHFDLSHGSAQVKGHGKKVADALTNA

VAHVDDMPNALSALSDLHAHKLRVDPVNFKLLSHCLLVTLAAHLPAEFTPAVHASLDKFLASVSTVLTSK

YR

>gi|52138655|ref|NP_001004376.1| alpha 1 globin [Gallus gallus]

MVLSAADKNNVKGIFTKIAGHAEEYGAETLERMFTTYPPTKTYFPHFDLSHGSAQIKGHGKKVVAALIEA

ANHIDDIAGTLSKLSDLHAHKLRVDPVNFKLLGQCFLVVVAIHHPAALTPEVHASLDKFLCAVGTVLTAK

YR

>gi|4885477|ref|NP_005359.1| myoglobin [Homo sapiens]

MGLSDGEWQLVLNVWGKVEADIPGHGQEVLIRLFKGHPETLEKFDKFKHLKSEDEMKASEDLKKHGATVL

TALGGILKKKGHHEAEIKPLAQSHATKHKIPVKYLEFISECIIQVLQSKHPGDFGADAQGAMNKALELFR

KDMASNYKELGFQG

>gi|50728806|ref|XP_416292.1| PREDICTED: similar to Myoglobin [Gallus gallus]

MGLSDQEWQQVLTIWGKVEADIAGHGHEVLMRLFHDHPETLDRFDKFKGLKTPDQMKGSEDLKKHGATVL

TQLGKILKQKGNHESELKPLAQTHATKHKIPVKYLEFISEVIIKVIAEKHAADFGADSQAAMKKALELFR

NDMASKYKEFGFQG

>gi|134295880|ref|YP_001119615.1| globin [Burkholderia vietnamiensis G4]

MTSLPASSDTAPARIRDAEPTEANIRDLVYAFYDRVRADPLLGPVFDAKLDGRWDTHLPKMVSFWSSLVL

GTRGYRGNVQQAHQPLDGIEPAHFSRWLSLFLKTVEARYTPAAAVRFMEPALRIAQSLQLSRFGWDYRIP

AEQQALLDAIAPRRRDADDDGHALPSRARGEPFPAKIIGRGVEPEAGPDA

>gi|124548961|ref|ZP_01707405.1| globin [Shewanella putrefaciens 200]

MNWLKKIFSKHTPPQDDRDPIQSNAYDLIGGEKVIRAITKCFYQKMASSAETTTLLAIHRAPIAESEQKL

FEFLSGWLGGPQLYQQKYGHPALRARHMHFAVDEAMRDQWLFCMKFAIEENINKPEHRAAIFEAISTLAD

HMRNQ

>gi|118757595|ref|ZP_01605344.1| globin [Shewanella pealeana ATCC 700345]

MNWLKKIIGNRDSNKDRDPNQSNAYDRIGGEIVIRAIAHQFYLQMQSNEATQALLSIHRAPIDESEQKLF

EFLSGWLGGPQLYQQKHGHPALRARHMPFKVDEAMRDQWLLCMKAAIEIEVTEPQHQQAIYSAIATLADH

MRNQ

>gi|118719249|ref|ZP_01571780.1| globin [Burkholderia multivorans ATCC 17616]

MCCGAPRCSRCVTRERGRRLPTILIQTIRMTDVNDDAPSQPTAFELVGGEARVREMVDRFYDLMDLEPEF

AQIRALHPASLDGSRDKLFWFLCGWLGGPDHYISRFGHPRLRARHLPFPIASVERDQWLRCMAWAMEDVG

LPEPLRERLMHSFYDTADWMRNRPG

>gi|94984762|ref|YP_604126.1| globin [Deinococcus geothermalis DSM 11300]

MTSGPLLTTSPLSGFGVEVVMASSAALPGAAGLLVPHDGEPVADVRDRPDRWTLLTLLAEAVRRGVPVLA

WGSGAALAGRVLGARVRPGEGAADWAEAPRGATVERWQGEVPLLWRAGPVTAWAGETLPEDLRSEFLARL

MQAEPRAPGSPLEVVGGEAALRTMLADFYARARADTLLGPVFAAHVQDWEAHLDRVTAFWVTMLGGGPAW

RGNLNSVHAGLGLRGTHLRRWLALFREAAEDCLGPEAAAPLTARAEAMGHRLGQRNAPHVGRVP

>gi|108465788|gb|ABF90973.1| protozoan/cyanobacterial globin family protein [Myxococcus xanthus DK 1622]

MPSPDDLPYHRLGGTDAAMALAEAFYDAMDAHEPELARLHELDAEGRVNRGTRERFGLFLAGWLGGPQDY

TERHGHPRLRMRHGHLSIGVAMRDAWVRSMQRAMDARGISGGLRRFLDARFAHVADFLRNVEE

>gi|115387639|ref|XP_001211325.1| bacterial hemoglobin [Aspergillus terreus NIH2624]

MSLSPEQVQIIKATVPVLAEHGTTITTVFYKNMLAAHPELNTVFNTSNQVNGHQPRSLAGALYAYASNID

NLGALGPAVELICNKHASLYIQPEQYKIVGKFLLEAMGEVLGDALTPEIHDAWATAYWMLANLMIQREAD

LYKQADGWTDFRDFRVTKKTVESSEITSFYLAPVDGKPLPTFQPGQYISVQVFVPGLNYPQTRQYSLSDA

PRPDYYRISVRKEPGLNPAEPGAKAHPGYVSNILHDTINEGDQIKVSHPFGDFYNKEPESPRPVVLLAAG

VGLTPLLSILNTIVSTPSAAGERKIHFVHGARTAGARAFKDHVLSLKEKIPGLQATYFTSHPGAEEKQGE

DYDFAGRIDLAKLADKDLFLDEPSAEYYVCGPEGFMTDIRAALVARGVSADRIKMELFGTGGVPA

>gi|6324095|ref|NP_014165.1| Similar to globins and has a functional heme-binding domain; involved in glucose signaling or metabolism; regulated by Rgt1p [Saccharomyces cerevisiae]

MTGEKILHSQLLTNSDMSSGNVHHTKPMMYNVTLPSYNSSSIGPVDNLKINERPGSHDHSMRSEMSSKNS

GSDFMPQSISRSEGSVYQVKIDRGDSPNTEGFDFKVNARDLLLLRMSWDILLREYLTPKELKVFQALLYS

NKHITSTERPYLNTAPDGMISKTIDPTARPRKTKQRDNDNKVDTALFCSQFYDNLIAMDPLLEEYFPSLK

HQAVSFCKVLDSAIDNLENVHVLDDYIVKLGKRHSRILGIKTVGFEVMGKAFMTTLQDRFGSFLTLELKN

LWGQLYSYLANCMITAGKDPMEKIQPDFSYNGDSVVLNFSIPKLAMHDISTVNKLQMVKTKNATIPHNIT

QVPTNKIPTEILLDNSSTPIKSDRESTPPISPKGSGSTKPSIGSSTVVESNTKKNNYDEKIHLLQKTAQQ

KNCSIM

>gi|70981999|ref|XP_746528.1| flavohemoprotein [Aspergillus fumigatus Af293]

MPLTPEQVQFIKATVPVLAEHGTTITTVFYKNMLTAHPELNNVFNTTHQVTGHQARALAGALFAYASNID

NLGALGPAVELMCHKHASLYIKPDDYKIVGKFLLEAMGQVLGDALTPEILDAWATAYWQLADIMIGREAQ

LYEQAEGWTDFRDFVVALKVPESSEITSFYLKPADGKPLPAFQPGQYISVQVHVPELNYLQARQYSLSDM

PRSDYYRISVKKESGLNPAEPGAKAHPGHVSNILHASVNEGDTIKVSHPFGDFFLSDAKAAHPVVLLSAG

VGLTPMTSILNTLTSQAPERKVSFIHGARNARARAFKNHITSLEQKLPNLKSTFFTSHPTEEDKEGDDYQ

FRGRVDLSQLDSNRDLFLDDATTEYYVCGPDTFMTDMLNVLKSKGVSEDRVKLELFGTGGVPH

>gi|126134831|ref|XP_001383940.1| flavohemoglobin [Pichia stipitis CBS 6054]

MMSTAPQIYTIQELTDSQKKIVLDTVPTLELAGETLTAQFYQNMFVDFPEVRPFFNQTDQKFLRQPRILA

FALLNYAKNIENLEPLTAFVKQIVSKHVGLQVKAEHYPCVGNSLIKTMKELLGPEVANEAFIDAWATAYG

NLAQLLIDMEDAEYQKAPWRGFREFTVTKIQDECTDVKSIYFKPTNEGDEISLPKRGQYLCFRWSLPGEE

QEISREYSISEYPSEKEYRISVRKLEGGKISGYIHNTLKVGDSLKVAPPCGKFVYVPSEKDIVLLVGGIG

ITPIVSILEKALQLGRNVTMLYSNKTVESRPFGNWLKELKEKYGEKFKLTEFFSNEKNVTAKDVIDAVET

RTLDSRDLDQISKDSDVYLLGPREYMKYVKGYLGAKGVEDIKLEYFGPLEV

>gi|68490256|ref|XP_711049.1| putative flavohemoglobin [Candida albicans SC5314]

MTVASASIINNYFESKPLTPEHIQIIIDSVPILEHLDVQLTEKFYKRLLKQNPEFKPFFNETHQKLLRQP

RIMIHFLIQYAKNIQDLTPMIDFIKKIASKHVGLQVKPEHYPKLGQVLINVIINLFPKQLVHDEFIEAWT

LAYQNLANLLIKLESEQYVEKPWYGFKQFKVTRLQRECSDVKSLYITPVDGSPIPKPKRGQYLCMRWLLP

GEKHEITREYSISEYPKNNEYRITVRYIPGGKVSNYIHNNINVGDIVYSGPPCGDCVYESSSKNLVFLAG

GNGVTALLPMIEAGLTEGRQVKLLYSNRSTDSRSFGKLFQSYKLQYGDRFQVVEFLSRGRTIDPIDKFYR

RSLTLEDLDFIVPEDDVYLIGPRTYMKMIEDYLKDRNITVKLDYFGPREI