Webdocument 2.1. Myoglobin has three accession numbers corresponding to three splice variants. The DNA entries are as follows:

 
1:  NM_005368
Homo sapiens myoglobin (MB), transcript variant 1, mRNA
gi|44955876|ref|NM_005368.2|[44955876]
 
2:  NM_203377
Homo sapiens myoglobin (MB), transcript variant 2, mRNA
gi|44955884|ref|NM_203377.1|[44955884]
 
3:  NM_203378
Homo sapiens myoglobin (MB), transcript variant 3, mRNA
gi|44955887|ref|NM_203378.1|[44955887]

 

The alignment of the DNA is as follows. To do this, enter the query “NM_005368, NM_203377, NM_203378” into Entrez Nucleotide, select the FASTA format, send to text, and then use a program such as MUSCLE (Chapter 6) to perform a multiple sequence alignment.. This shows that only the 5’ ends of the sequences vary. The start ATG is highlighted.

 

MUSCLE (3.6) multiple sequence alignment
 
 
gi|44955887|ref|NM_203378.1|      -----AATGGCACCTGCCCTAAAATAGCTTCCCATGTGAGGGCTAGAGAAAGGAAA----
gi|44955876|ref|NM_005368.2|      ------GCAGCCTCAAACC-----------------------------------------
gi|44955884|ref|NM_203377.1|      GAGCATGTTGGCCTGGTCCTTTGCTAGGTACTGTAGAGCAGGTGAGAGAGTGAGGGGGAA
                                           *       **                                         
 
gi|44955887|ref|NM_203378.1|      -------AGATTAGACCCTCCCT---GGATGAGAGAGAGAAAGTGAAGGAGGGCAGGGGA
gi|44955876|ref|NM_005368.2|      ---------------CCAGCTGTT------------------------GGGGCCAGGACA
gi|44955884|ref|NM_203377.1|      GGACTCCAAATTAGACCAGTTCTTAGCCATGAAGCAGAGACTCT----GAAGCCAGACTA
                                                 **     *                         *  * ***   *
 
gi|44955887|ref|NM_203378.1|      GGGGGACAGCGAGCCATTGAGC----GATCTTTGTCAAGCATCCCAGAAGACTGCGCCAT
gi|44955876|ref|NM_005368.2|      CCCAGTGAGCCCATACTTGCTCTTTTTGTCTTCTTC------------AGACTGCGCCAT
gi|44955884|ref|NM_203377.1|      CCTGGGT--CCCAATCTTGGGCTTGGTATTTCCTCGCTGTGTGACTCTGGACTGCGCCAT
                                      *    *      ***  *      * *                  ***********
 
gi|44955887|ref|NM_203378.1|      GGGGCTCAGCGACGGGGAATGGCAGTTGGTGCTGAACGTCTGGGGGAAGGTGGAGGCTGA
gi|44955876|ref|NM_005368.2|      GGGGCTCAGCGACGGGGAATGGCAGTTGGTGCTGAACGTCTGGGGGAAGGTGGAGGCTGA
gi|44955884|ref|NM_203377.1|      GGGGCTCAGCGACGGGGAATGGCAGTTGGTGCTGAACGTCTGGGGGAAGGTGGAGGCTGA
                                  ************************************************************
 
gi|44955887|ref|NM_203378.1|      CATCCCAGGCCATGGGCAGGAAGTCCTCATCAGGCTCTTTAAGGGTCACCCAGAGACTCT
gi|44955876|ref|NM_005368.2|      CATCCCAGGCCATGGGCAGGAAGTCCTCATCAGGCTCTTTAAGGGTCACCCAGAGACTCT
gi|44955884|ref|NM_203377.1|      CATCCCAGGCCATGGGCAGGAAGTCCTCATCAGGCTCTTTAAGGGTCACCCAGAGACTCT
                                  ************************************************************
 
gi|44955887|ref|NM_203378.1|      GGAGAAGTTTGACAAGTTCAAGCACCTGAAGTCAGAGGACGAGATGAAGGCGTCTGAGGA
gi|44955876|ref|NM_005368.2|      GGAGAAGTTTGACAAGTTCAAGCACCTGAAGTCAGAGGACGAGATGAAGGCGTCTGAGGA
gi|44955884|ref|NM_203377.1|      GGAGAAGTTTGACAAGTTCAAGCACCTGAAGTCAGAGGACGAGATGAAGGCGTCTGAGGA
                                  ************************************************************
 
gi|44955887|ref|NM_203378.1|      CTTAAAGAAGCATGGTGCCACCGTGCTCACCGCCCTGGGTGGCATCCTTAAGAAGAAGGG
gi|44955876|ref|NM_005368.2|      CTTAAAGAAGCATGGTGCCACCGTGCTCACCGCCCTGGGTGGCATCCTTAAGAAGAAGGG
gi|44955884|ref|NM_203377.1|      CTTAAAGAAGCATGGTGCCACCGTGCTCACCGCCCTGGGTGGCATCCTTAAGAAGAAGGG
                                  ************************************************************
 
gi|44955887|ref|NM_203378.1|      GCATCATGAGGCAGAGATTAAGCCCCTGGCACAGTCGCATGCCACCAAGCACAAGATCCC
gi|44955876|ref|NM_005368.2|      GCATCATGAGGCAGAGATTAAGCCCCTGGCACAGTCGCATGCCACCAAGCACAAGATCCC
gi|44955884|ref|NM_203377.1|      GCATCATGAGGCAGAGATTAAGCCCCTGGCACAGTCGCATGCCACCAAGCACAAGATCCC
                                  ************************************************************
 
gi|44955887|ref|NM_203378.1|      CGTGAAGTACCTGGAGTTCATCTCGGAATGCATCATCCAGGTTCTGCAGAGCAAGCATCC
gi|44955876|ref|NM_005368.2|      CGTGAAGTACCTGGAGTTCATCTCGGAATGCATCATCCAGGTTCTGCAGAGCAAGCATCC
gi|44955884|ref|NM_203377.1|      CGTGAAGTACCTGGAGTTCATCTCGGAATGCATCATCCAGGTTCTGCAGAGCAAGCATCC
                                  ************************************************************
 
gi|44955887|ref|NM_203378.1|      CGGGGACTTTGGTGCTGATGCCCAGGGGGCCATGAACAAGGCCCTGGAGCTGTTCCGGAA
gi|44955876|ref|NM_005368.2|      CGGGGACTTTGGTGCTGATGCCCAGGGGGCCATGAACAAGGCCCTGGAGCTGTTCCGGAA
gi|44955884|ref|NM_203377.1|      CGGGGACTTTGGTGCTGATGCCCAGGGGGCCATGAACAAGGCCCTGGAGCTGTTCCGGAA
                                  ************************************************************
 
gi|44955887|ref|NM_203378.1|      GGACATGGCCTCCAACTACAAGGAGCTGGGCTTCCAGGGCTAGGCCCCTGCCGCTCCCAC
gi|44955876|ref|NM_005368.2|      GGACATGGCCTCCAACTACAAGGAGCTGGGCTTCCAGGGCTAGGCCCCTGCCGCTCCCAC
gi|44955884|ref|NM_203377.1|      GGACATGGCCTCCAACTACAAGGAGCTGGGCTTCCAGGGCTAGGCCCCTGCCGCTCCCAC
                                  ************************************************************
 
gi|44955887|ref|NM_203378.1|      CCCCACCCATCTGGGCCCCGGGTTCAAGAGAGAGCGGGGTCTGATCTCGTGTAGCCATAT
gi|44955876|ref|NM_005368.2|      CCCCACCCATCTGGGCCCCGGGTTCAAGAGAGAGCGGGGTCTGATCTCGTGTAGCCATAT
gi|44955884|ref|NM_203377.1|      CCCCACCCATCTGGGCCCCGGGTTCAAGAGAGAGCGGGGTCTGATCTCGTGTAGCCATAT
                                  ************************************************************
 
gi|44955887|ref|NM_203378.1|      AGAGTTTGCTTCTGAGTGTCTGCTTTGTTTAGTAGAGGTGGGCAGGAGGAGCTGAGGGGC
gi|44955876|ref|NM_005368.2|      AGAGTTTGCTTCTGAGTGTCTGCTTTGTTTAGTAGAGGTGGGCAGGAGGAGCTGAGGGGC
gi|44955884|ref|NM_203377.1|      AGAGTTTGCTTCTGAGTGTCTGCTTTGTTTAGTAGAGGTGGGCAGGAGGAGCTGAGGGGC
                                  ************************************************************
 
gi|44955887|ref|NM_203378.1|      TGGGGCTGGGGTGTTGAAGTTGGCTTTGCATGCCCAGCGATGCGCCTCCCTGTGGGATGT
gi|44955876|ref|NM_005368.2|      TGGGGCTGGGGTGTTGAAGTTGGCTTTGCATGCCCAGCGATGCGCCTCCCTGTGGGATGT
gi|44955884|ref|NM_203377.1|      TGGGGCTGGGGTGTTGAAGTTGGCTTTGCATGCCCAGCGATGCGCCTCCCTGTGGGATGT
                                  ************************************************************
 
gi|44955887|ref|NM_203378.1|      CATCACCCTGGGAACCGGGAGTGGCCCTTGGCTCACTGTGTTCTGCATGGTTTGGATCTG
gi|44955876|ref|NM_005368.2|      CATCACCCTGGGAACCGGGAGTGGCCCTTGGCTCACTGTGTTCTGCATGGTTTGGATCTG
gi|44955884|ref|NM_203377.1|      CATCACCCTGGGAACCGGGAGTGGCCCTTGGCTCACTGTGTTCTGCATGGTTTGGATCTG
                                  ************************************************************
 
gi|44955887|ref|NM_203378.1|      AATTAATTGTCCTTTCTTCTAAATCCCAACCGAACTTCTTCCAACCTCCAAACTGGCTGT
gi|44955876|ref|NM_005368.2|      AATTAATTGTCCTTTCTTCTAAATCCCAACCGAACTTCTTCCAACCTCCAAACTGGCTGT
gi|44955884|ref|NM_203377.1|      AATTAATTGTCCTTTCTTCTAAATCCCAACCGAACTTCTTCCAACCTCCAAACTGGCTGT
                                  ************************************************************
 
gi|44955887|ref|NM_203378.1|      AACCCCAAATCCAAGCCATTAACTACACCTGACAGTAGCAATTGTCTGATTAATCACTGG
gi|44955876|ref|NM_005368.2|      AACCCCAAATCCAAGCCATTAACTACACCTGACAGTAGCAATTGTCTGATTAATCACTGG
gi|44955884|ref|NM_203377.1|      AACCCCAAATCCAAGCCATTAACTACACCTGACAGTAGCAATTGTCTGATTAATCACTGG
                                  ************************************************************
 
gi|44955887|ref|NM_203378.1|      CCCCTTGAAGACAGCAGAATGTCCCTTTGCAATGAGGAGGAGATCTGGGCTGGGCGGGCC
gi|44955876|ref|NM_005368.2|      CCCCTTGAAGACAGCAGAATGTCCCTTTGCAATGAGGAGGAGATCTGGGCTGGGCGGGCC
gi|44955884|ref|NM_203377.1|      CCCCTTGAAGACAGCAGAATGTCCCTTTGCAATGAGGAGGAGATCTGGGCTGGGCGGGCC
                                  ************************************************************
 
gi|44955887|ref|NM_203378.1|      AGCTGGGGAAGCATTTGACTATCTGGAACTTGTGTGTGCCTCCTCAGGTATGGCAGTGAC
gi|44955876|ref|NM_005368.2|      AGCTGGGGAAGCATTTGACTATCTGGAACTTGTGTGTGCCTCCTCAGGTATGGCAGTGAC
gi|44955884|ref|NM_203377.1|      AGCTGGGGAAGCATTTGACTATCTGGAACTTGTGTGTGCCTCCTCAGGTATGGCAGTGAC
                                  ************************************************************
 
gi|44955887|ref|NM_203378.1|      TCACCTGGTTTTAATAAAACAACCTGCAACATCTCA
gi|44955876|ref|NM_005368.2|      TCACCTGGTTTTAATAAAACAACCTGCAACATCTCA
gi|44955884|ref|NM_203377.1|      TCACCTGGTTTTAATAAAACAACCTGCAACATCTCA
                                  ************************************

 

 

 

The three DNA sequences encode the identical myoglobin protein, as follows:

 
>gi|4885477|ref|NP_005359.1| myoglobin [Homo sapiens]
MGLSDGEWQLVLNVWGKVEADIPGHGQEVLIRLFKGHPETLEKFDKFKHLKSEDEMKASEDLKKHGATVL
TALGGILKKKGHHEAEIKPLAQSHATKHKIPVKYLEFISECIIQVLQSKHPGDFGADAQGAMNKALELFR
KDMASNYKELGFQG
 
>gi|44955885|ref|NP_976311.1| myoglobin [Homo sapiens]
MGLSDGEWQLVLNVWGKVEADIPGHGQEVLIRLFKGHPETLEKFDKFKHLKSEDEMKASEDLKKHGATVL
TALGGILKKKGHHEAEIKPLAQSHATKHKIPVKYLEFISECIIQVLQSKHPGDFGADAQGAMNKALELFR
KDMASNYKELGFQG
 
>gi|44955888|ref|NP_976312.1| myoglobin [Homo sapiens]
MGLSDGEWQLVLNVWGKVEADIPGHGQEVLIRLFKGHPETLEKFDKFKHLKSEDEMKASEDLKKHGATVL
TALGGILKKKGHHEAEIKPLAQSHATKHKIPVKYLEFISECIIQVLQSKHPGDFGADAQGAMNKALELFR
KDMASNYKELGFQG