Web document 5.5. A set of bacterial and fungal (and several vertebrate) globins that were subsequently aligned and used to make an HMM.
>gi|4504349|ref|NP_000509.1| beta globin [Homo sapiens]
MVHLTPEEKSAVTALWGKVNVDEVGGEALGRLLVVYPWTQRFFESFGDLSTPDAVMGNPKVKAHGKKVLG
AFSDGLAHLDNLKGTFATLSELHCDKLHVDPENFRLLGNVLVCVLAHHFGKEFTPPVQAAYQKVVAGVAN
ALAHKYH
>gi|71895591|ref|NP_001026660.1| hypothetical protein LOC428114 [Gallus gallus]
MVHWTAEEKQLITGLWGKVNVAECGAEALARLLIVYPWTQRFFASFGNLSSATAIIGNPMVRAHGKKVLS
SFGEAVKNLDNIKKSFAQLSKLHCDKLHVDPENFRLLGDILIIVLASHFSKDFTPASQAAWQKMVRVVAH
ALAHEYH
>gi|4504347|ref|NP_000549.1| alpha 1 globin [Homo sapiens]
MVLSPADKTNVKAAWGKVGAHAGEYGAEALERMFLSFPTTKTYFPHFDLSHGSAQVKGHGKKVADALTNA
VAHVDDMPNALSALSDLHAHKLRVDPVNFKLLSHCLLVTLAAHLPAEFTPAVHASLDKFLASVSTVLTSK
YR
>gi|52138655|ref|NP_001004376.1| alpha 1 globin [Gallus gallus]
MVLSAADKNNVKGIFTKIAGHAEEYGAETLERMFTTYPPTKTYFPHFDLSHGSAQIKGHGKKVVAALIEA
ANHIDDIAGTLSKLSDLHAHKLRVDPVNFKLLGQCFLVVVAIHHPAALTPEVHASLDKFLCAVGTVLTAK
YR
>gi|4885477|ref|NP_005359.1| myoglobin [Homo sapiens]
MGLSDGEWQLVLNVWGKVEADIPGHGQEVLIRLFKGHPETLEKFDKFKHLKSEDEMKASEDLKKHGATVL
TALGGILKKKGHHEAEIKPLAQSHATKHKIPVKYLEFISECIIQVLQSKHPGDFGADAQGAMNKALELFR
KDMASNYKELGFQG
>gi|50728806|ref|XP_416292.1| PREDICTED: similar to Myoglobin [Gallus gallus]
MGLSDQEWQQVLTIWGKVEADIAGHGHEVLMRLFHDHPETLDRFDKFKGLKTPDQMKGSEDLKKHGATVL
TQLGKILKQKGNHESELKPLAQTHATKHKIPVKYLEFISEVIIKVIAEKHAADFGADSQAAMKKALELFR
NDMASKYKEFGFQG
>gi|134295880|ref|YP_001119615.1| globin [Burkholderia vietnamiensis G4]
MTSLPASSDTAPARIRDAEPTEANIRDLVYAFYDRVRADPLLGPVFDAKLDGRWDTHLPKMVSFWSSLVL
GTRGYRGNVQQAHQPLDGIEPAHFSRWLSLFLKTVEARYTPAAAVRFMEPALRIAQSLQLSRFGWDYRIP
AEQQALLDAIAPRRRDADDDGHALPSRARGEPFPAKIIGRGVEPEAGPDA
>gi|124548961|ref|ZP_01707405.1| globin [Shewanella putrefaciens 200]
MNWLKKIFSKHTPPQDDRDPIQSNAYDLIGGEKVIRAITKCFYQKMASSAETTTLLAIHRAPIAESEQKL
FEFLSGWLGGPQLYQQKYGHPALRARHMHFAVDEAMRDQWLFCMKFAIEENINKPEHRAAIFEAISTLAD
HMRNQ
>gi|118757595|ref|ZP_01605344.1| globin [Shewanella pealeana ATCC 700345]
MNWLKKIIGNRDSNKDRDPNQSNAYDRIGGEIVIRAIAHQFYLQMQSNEATQALLSIHRAPIDESEQKLF
EFLSGWLGGPQLYQQKHGHPALRARHMPFKVDEAMRDQWLLCMKAAIEIEVTEPQHQQAIYSAIATLADH
MRNQ
>gi|118719249|ref|ZP_01571780.1| globin [Burkholderia multivorans ATCC 17616]
MCCGAPRCSRCVTRERGRRLPTILIQTIRMTDVNDDAPSQPTAFELVGGEARVREMVDRFYDLMDLEPEF
AQIRALHPASLDGSRDKLFWFLCGWLGGPDHYISRFGHPRLRARHLPFPIASVERDQWLRCMAWAMEDVG
LPEPLRERLMHSFYDTADWMRNRPG
>gi|94984762|ref|YP_604126.1| globin [Deinococcus geothermalis DSM 11300]
MTSGPLLTTSPLSGFGVEVVMASSAALPGAAGLLVPHDGEPVADVRDRPDRWTLLTLLAEAVRRGVPVLA
WGSGAALAGRVLGARVRPGEGAADWAEAPRGATVERWQGEVPLLWRAGPVTAWAGETLPEDLRSEFLARL
MQAEPRAPGSPLEVVGGEAALRTMLADFYARARADTLLGPVFAAHVQDWEAHLDRVTAFWVTMLGGGPAW
RGNLNSVHAGLGLRGTHLRRWLALFREAAEDCLGPEAAAPLTARAEAMGHRLGQRNAPHVGRVP
>gi|108465788|gb|ABF90973.1| protozoan/cyanobacterial globin family protein [Myxococcus xanthus DK 1622]
MPSPDDLPYHRLGGTDAAMALAEAFYDAMDAHEPELARLHELDAEGRVNRGTRERFGLFLAGWLGGPQDY
TERHGHPRLRMRHGHLSIGVAMRDAWVRSMQRAMDARGISGGLRRFLDARFAHVADFLRNVEE
>gi|115387639|ref|XP_001211325.1| bacterial hemoglobin [Aspergillus terreus NIH2624]
MSLSPEQVQIIKATVPVLAEHGTTITTVFYKNMLAAHPELNTVFNTSNQVNGHQPRSLAGALYAYASNID
NLGALGPAVELICNKHASLYIQPEQYKIVGKFLLEAMGEVLGDALTPEIHDAWATAYWMLANLMIQREAD
LYKQADGWTDFRDFRVTKKTVESSEITSFYLAPVDGKPLPTFQPGQYISVQVFVPGLNYPQTRQYSLSDA
PRPDYYRISVRKEPGLNPAEPGAKAHPGYVSNILHDTINEGDQIKVSHPFGDFYNKEPESPRPVVLLAAG
VGLTPLLSILNTIVSTPSAAGERKIHFVHGARTAGARAFKDHVLSLKEKIPGLQATYFTSHPGAEEKQGE
DYDFAGRIDLAKLADKDLFLDEPSAEYYVCGPEGFMTDIRAALVARGVSADRIKMELFGTGGVPA
>gi|6324095|ref|NP_014165.1| Similar to globins and has a functional heme-binding domain; involved in glucose signaling or metabolism; regulated by Rgt1p [Saccharomyces cerevisiae]
MTGEKILHSQLLTNSDMSSGNVHHTKPMMYNVTLPSYNSSSIGPVDNLKINERPGSHDHSMRSEMSSKNS
GSDFMPQSISRSEGSVYQVKIDRGDSPNTEGFDFKVNARDLLLLRMSWDILLREYLTPKELKVFQALLYS
NKHITSTERPYLNTAPDGMISKTIDPTARPRKTKQRDNDNKVDTALFCSQFYDNLIAMDPLLEEYFPSLK
HQAVSFCKVLDSAIDNLENVHVLDDYIVKLGKRHSRILGIKTVGFEVMGKAFMTTLQDRFGSFLTLELKN
LWGQLYSYLANCMITAGKDPMEKIQPDFSYNGDSVVLNFSIPKLAMHDISTVNKLQMVKTKNATIPHNIT
QVPTNKIPTEILLDNSSTPIKSDRESTPPISPKGSGSTKPSIGSSTVVESNTKKNNYDEKIHLLQKTAQQ
KNCSIM
>gi|70981999|ref|XP_746528.1| flavohemoprotein [Aspergillus fumigatus Af293]
MPLTPEQVQFIKATVPVLAEHGTTITTVFYKNMLTAHPELNNVFNTTHQVTGHQARALAGALFAYASNID
NLGALGPAVELMCHKHASLYIKPDDYKIVGKFLLEAMGQVLGDALTPEILDAWATAYWQLADIMIGREAQ
LYEQAEGWTDFRDFVVALKVPESSEITSFYLKPADGKPLPAFQPGQYISVQVHVPELNYLQARQYSLSDM
PRSDYYRISVKKESGLNPAEPGAKAHPGHVSNILHASVNEGDTIKVSHPFGDFFLSDAKAAHPVVLLSAG
VGLTPMTSILNTLTSQAPERKVSFIHGARNARARAFKNHITSLEQKLPNLKSTFFTSHPTEEDKEGDDYQ
FRGRVDLSQLDSNRDLFLDDATTEYYVCGPDTFMTDMLNVLKSKGVSEDRVKLELFGTGGVPH
>gi|126134831|ref|XP_001383940.1| flavohemoglobin [Pichia stipitis CBS 6054]
MMSTAPQIYTIQELTDSQKKIVLDTVPTLELAGETLTAQFYQNMFVDFPEVRPFFNQTDQKFLRQPRILA
FALLNYAKNIENLEPLTAFVKQIVSKHVGLQVKAEHYPCVGNSLIKTMKELLGPEVANEAFIDAWATAYG
NLAQLLIDMEDAEYQKAPWRGFREFTVTKIQDECTDVKSIYFKPTNEGDEISLPKRGQYLCFRWSLPGEE
QEISREYSISEYPSEKEYRISVRKLEGGKISGYIHNTLKVGDSLKVAPPCGKFVYVPSEKDIVLLVGGIG
ITPIVSILEKALQLGRNVTMLYSNKTVESRPFGNWLKELKEKYGEKFKLTEFFSNEKNVTAKDVIDAVET
RTLDSRDLDQISKDSDVYLLGPREYMKYVKGYLGAKGVEDIKLEYFGPLEV
>gi|68490256|ref|XP_711049.1| putative flavohemoglobin [Candida albicans SC5314]
MTVASASIINNYFESKPLTPEHIQIIIDSVPILEHLDVQLTEKFYKRLLKQNPEFKPFFNETHQKLLRQP
RIMIHFLIQYAKNIQDLTPMIDFIKKIASKHVGLQVKPEHYPKLGQVLINVIINLFPKQLVHDEFIEAWT
LAYQNLANLLIKLESEQYVEKPWYGFKQFKVTRLQRECSDVKSLYITPVDGSPIPKPKRGQYLCMRWLLP
GEKHEITREYSISEYPKNNEYRITVRYIPGGKVSNYIHNNINVGDIVYSGPPCGDCVYESSSKNLVFLAG
GNGVTALLPMIEAGLTEGRQVKLLYSNRSTDSRSFGKLFQSYKLQYGDRFQVVEFLSRGRTIDPIDKFYR
RSLTLEDLDFIVPEDDVYLIGPRTYMKMIEDYLKDRNITVKLDYFGPREI