Phylogeny page for 3-week course on Bioinformatics
Links for Phylogeny lecture and exercises
Acknowledgement
This page is a revised version of the page made for the 2001 course by Henrik Stryhn.
Data sets for Phylogeny exercises
Exercise 1
4 3
seq1 AAG
seq2 AAA
seq3 GGA
seq4 AGA
5 846
chimpanzeeAAGCTTCACC GGCGCAATTA TCCTCATAAT CGCCCACGGA CTTACATCCT
gibbonxxxxAAGCTTTACA GGTGCAACCG TCCTCATAAT CGCCCACGGA CTAACCTCTT
gorillaexxAAGCTTCACC GGCGCAGTTG TTCTTATAAT TGCCCACGGA CTTACATCAT
homosapienAAGCTTCACC GGCGCAGTCA TTCTCATAAT CGCCCACGGA CTTACATCCT
orangutangAAGCTTCACC GGCGCAACCA CCCTCATGAT TGCCCATGGA CTCACATCCT
CATTATTATT CTGCCTAGCA AACTCAAATT ATGAACGCAC CCACAGTCGC
CCCTGCTATT CTGCCTTGCA AACTCAAACT ACGAACGAAC TCACAGCCGC
CATTATTATT CTGCCTAGCA AACTCAAACT ACGAACGAAC CCACAGCCGC
CATTACTATT CTGCCTAGCA AACTCAAACT ACGAACGCAC TCACAGTCGC
CCCTACTGTT CTGCCTAGCA AACTCAAACT ACGAACGAAC CCACAGCCGC
ATCATAATTC TCTCCCAAGG ACTTCAAACT CTACTCCCAC TAATAGCCTT
ATCATAATCC TATCTCGAGG GCTCCAAGCC TTACTCCCAC TGATAGCCTT
ATCATAATTC TCTCTCAAGG ACTCCAAACC CTACTCCCAC TAATAGCCCT
ATCATAATCC TCTCTCAAGG ACTTCAAACT CTACTCCCAC TAATAGCTTT
ATCATAATCC TCTCTCAAGG CCTTCAAACT CTACTCCCCC TAATAGCCCT
TTGATGACTC CTAGCAAGCC TCGCTAACCT CGCCCTACCC CCTACCATTA
CTGATGACTC GCAGCAAGCC TCGCTAACCT CGCCCTACCC CCCACTATTA
TTGATGACTT CTGGCAAGCC TCGCCAACCT CGCCTTACCC CCCACCATTA
TTGATGACTT CTAGCAAGCC TCGCTAACCT CGCCTTACCC CCCACTATTA
CTGATGACTT CTAGCAAGCC TCACTAACCT TGCCCTACCA CCCACCATCA
ATCTCCTAGG GGAACTCTCC GTGCTAGTAA CCTCATTCTC CTGATCAAAT
ACCTCCTAGG TGAACTCTTC GTACTAATGG CCTCCTTCTC CTGGGCAAAC
ACCTACTAGG AGAGCTCTCC GTACTAGTAA CCACATTCTC CTGATCAAAT
ACCTACTGGG AGAACTCTCT GTGCTAGTAA CCACATTCTC CTGATCAAAT
ACCTTCTAGG AGAACTCTCC GTACTAATAG CCATATTCTC TTGATCTAAC
ACCACTCTCC TACTCACAGG ATTCAACATA CTAATCACAG CCCTGTACTC
ACTACTATTA CACTCACCGG GCTCAACGTA CTAATCACGG CCCTATACTC
ACCACCCTTT TACTTACAGG ATCTAACATA CTAATCACAG CCCTGTACTC
ATCACTCTCC TACTTACAGG ACTCAACATA CTAGTCACAG CCCTATACTC
ATCACCATCC TACTAACAGG ACTCAACATA CTAATCACAA CCCTATACTC
CCTCTACATG TTTACCACAA CACAATGAGG CTCACTCACC CACCACATTA
CCTTTACATA TTTATCATAA CACAACGAGG CACACTTACA CACCACATTA
CCTTTATATA TTTACCACAA CACAATGAGG CCCACTCACA CACCACATCA
CCTCTACATA TTTACCACAA CACAATGAGG CTCACTCACC CACCACATTA
TCTCTATATA TTCACCACAA CACAACGAGG TACACCCACA CACCACATCA
ATAACATAAA GCCCTCATTC ACACGAGAAA ATACTCTCAT ATTTTTACAC
AAAACATAAA ACCCTCACTC ACACGAGAAA ACATATTAAT ACTTATGCAC
CCAACATAAA ACCCTCATTT ACACGAGAAA ACATCCTCAT ATTCATGCAC
ACAACATAAA ACCCTCATTC ACACGAGAAA ACACCCTCAT GTTCATACAC
ACAACATAAA ACCTTCTTTC ACACGCGAAA ATACCCTCAT GCTCATACAC
CTATCCCCCA TCCTCCTTCT ATCCCTCAAT CCTGATATCA TCACTGGATT
CTCTTCCCCC TCCTCCTCCT AACCCTCAAC CCTAACATCA TTACTGGCTT
CTATCCCCCA TCCTCCTCCT ATCCCTCAAC CCCGATATTA TCACCGGGTT
CTATCCCCCA TTCTCCTCCT ATCCCTCAAC CCCGACATCA TTACCGGGTT
CTATCCCCCA TCCTCCTCTT ATCCCTCAAC CCCAGCATCA TCGCTGGGTT
CACCTCCTGT AAATATAGTT TAACCAAAAC ATCAGATTGT GAATCTGACA
TACTCCCTGT AAACATAGTT TAATCAAAAC ATTAGATTGT GAATCTAACA
CACCTCCTGT AAATATAGTT TAACCAAAAC ATCAGATTGT GAATCTGATA
TTCCTCTTGT AAATATAGTT TAACCAAAAC ATCAGATTGT GAATCTGACA
CGCCTACTGT AAATATAGTT TAACCAAAAC ATTAGATTGT GAATCTAATA
ACAGAGGCTC ACGACCCCTT ATTTACCGAG AAAGCTTATA AGAACTGCTA
ATAGAGGCTC GAAACCTCTT GCTTACCGAG AAAGCCCACA AGAACTGCTA
ACAGAGGCTC ACAACCCCTT ATTTACCGAG AAAGCTCGTA AGAGCTGCTA
ACAGAGGCTT ACGACCCCTT ATTTACCGAG AAAGCTCACA AGAACTGCTA
ATAGGGCCCC ACAACCCCTT ATTTACCGAG AAAGCTCACA AGAACTGCTA
ATTCATATCC CCATGCCTGA CAACATGGCT TTCTCAACTT TTAAAGGATA
ACTCACTATC CCATGTATGA CAACATGGCT TTCTCAACTT TTAAAGGATA
ACTCATACCC CCGTGCTTGA CAACATGGCT TTCTCAACTT TTAAAGGATA
ACTCATGCCC CCATGTCTAA CAACATGGCT TTCTCAACTT TTAAAGGATA
ACTCNTCACT CCATGTGTGA CAACATGGCT TTCTCAGCTT TTAAAGGATA
ACAGCCATCC GTTGGTCTTA GGCCCCAAAA ATTTTGGTGC AACTCCAAAT
ACAGCTATCC ATTGGTCTTA GGACCCAAAA ATTTTGGTGC AACTCCAAAT
ACAGCTATCC ATTGGTCTTA GGACCCAAAA ATTTTGGTGC AACTCCAAAT
ACAGCTATCC ATTGGTCTTA GGCCCCAAAA ATTTTGGTGC AACTCCAAAT
ACAGCTATCC CTTGGTCTTA GGATCCAAAA ATTTTGGTGC AACTCCAAAT
AAAAGTAATA ACCATGTATA CTACCATAAC CACCTTAACC CTAACTCCCT
AAAAGTAATA GCAATGTACA CCACCATAGC CATTCTAACG CTAACCTCCC
AAAAGTAATA ACTATGTACG CTACCATAAC CACCTTAGCC CTAACTTCCT
AAAAGTAATA ACCATGCACA CTACTATAAC CACCCTAACC CTGACTTCCC
AAAAGTAACA GCCATGTTTA CCACCATAAC TGCCCTCACC TTAACTTCCC
TAATTCTCCC CATCCTCACC ACCCTCATTA ACCCTAACAA AAAAAACTCA
TAATTCCCCC CATTACAGCC ACCCTTATTA ACCCCAATAA AAAGAACTTA
TAATTCCCCC TATCCTTACC ACCTTCATCA ATCCTAACAA AAAAAGCTCA
TAATTCCCCC CATCCTTACC ACCCTCGTTA ACCCTAACAA AAAAAACTCA
TAATCCCCCC CATTACCGCT ACCCTCATTA ACCCCAACAA AAAAAACCCA
TATCCCCATT ATGTGAAATC CATTATCGCG TCCACCTTTA TCATTAGCCT
TACCCGCACT ACGTAAAAAT GACCATTGCC TCTACCTTTA TAATCAGCCT
TACCCCCATT ACGTAAAATC TATCGTCGCA TCCACCTTTA TCATCAGCCT
TACCCCCATT ATGTAAAATC CATTGTCGCA TCCACCTTTA TTATCAGTCT
TACCCCCACT ATGTAAAAAC GGCCATCGCA TCCGCCTTTA CTATCAGCCT
TTTCCCCACA ACAATATTCA TATGCCTAGA CCAAGAAGCT ATTATCTCAA
ATTTCCCACA ATAATATTCA TGTGCACAGA CCAAGAAACC ATTATTTCAA
CTTCCCCACA ACAATATTTC TATGCCTAGA CCAAGAAGCT ATTATCTCAA
CTTCCCCACA ACAATATTCA TGTGCCTAGA CCAAGAAGTT ATTATCTCGA
TATCCCAACA ACAATATTTA TCTGCCTAGG ACAAGAAACC ATCGTCACAA
ACTGGCACTG AGCAACAACC CAAACAACCC AGCTCTCCCT AAGCTT
ACTGACACTG AACTGCAACC CAAACGCTAG AACTCTCCCT AAGCTT
GCTGACACTG AGCAACAACC CAAACAATTC AACTCTCCCT AAGCTT
ACTGACACTG AGCCACAACC CAAACAACCC AGCTCTCCCT AAGCTT
ACTGATGCTG AACAACCACC CAGACACTAC AACTCTCACT AAGCTT
5 149
chimpanzeeSFTGAIILIIAHGLTSSLLFCLANSNYERTHSRIIILSQGLQTLLPLIAF
gibbonxxxxSFTGATVLIIAHGLTSSLLFCLANSNYERTHSRIIILSRGLQALLPLIAF
gorillaexxSFTGAVVLIIAHGLTSSLLFCLANSNYERTHSRIIILSQGLQTLLPLIAL
homosapienSFTGAVILIIAHGLTSSLLFCLANSNYERTHSRIIILSQGLQTLLPLIAF
orangutangSFTGATTLMIAHGLTSSLLFCLANSNYERTHSRIIILSQGLQTLLPLIAL
LLASLANLALPPTINLLGELSVLVTSFSSNTTLLLTGFNILITALYS
LAASLANLALPPTINLLGELFVLMASFSANTTITLTGLNVLITALYS
LLASLANLALPPTINLLGELSVLVTTFSSNTTLLLTGSNILITALYS
LLASLANLALPPTINLLGELSVLVTTFSSNITLLLTGLNILVTALYS
LLASLTNLALPPTINLLGELSVLIAIFSSNITILLTGLNILITTLYS
LYMFTTTQGSLTHHINNIKPSFTRENTLIFLHLSPILLLSLNPDIITGF
LYIFIITQGTLTHHIKNIKPSLTRENILILMHLFPLLLLTLNPNIITGF
LYIFTTTQGPLTHHITNIKPSFTRENILIFMHLSPILLLSLNPDIITGF
LYIFTTTQGSLTHHINNIKPSFTRENTLMFIHLSPILLLSLNPDIITGF
LYIFTTTQGTPTHHINNIKPSFTRENTLMLIHLSPILLLSLNPSIIAGF
TSC
TPC
TSC
SSC
AYC
31 1137
AdelphinicAGGCTACATCAGCGA
PhocenobacAGGCTACATCAGCGA
ActinomuriAGGCTACATCAGCGA
BistseventAGGCTACATCAGCGA
PlnagaaxxxAGGCTACATCAGCGA
PgallinaruAGGCTACATCAGCGA
AsuccinogeAGGCTACATCAGCGA
PaerogenesAGGCTACATCAGCGA
AporcinusxAGGCTACATCAGCGA
ArossiixxxAGGCTACATCAGCGA
AminorxxxxAGGCTACATCAGCGA
HparasuisxAGGCTACATCAGCGA
PmultocidaAGGCTACATCAGCGA
AcapsulatuAGGCTACATCAGCGA
HfelisxxxxAGGCTACATCAGCGA
PtrehalosiAGGCTACATCAGCGA
BistsevenxAGGCTACATCAGCGA
HhaemoglobAGGCTACATCAGCGA
BistfivexxAGGCTACATCAGCGA
HducreyixxAGGCTACATCAGCGA
PbettixxxxAGGCTACATCAGCGA
MhaemolytiAGGCTACATCAGCGA
AlignieresAGGCTACATCAGCGA
HinfleunzaAGGCTACATCAGCGA
AactinomycAGGCTACATCAGCGG
HparainfluAGGCTACATCAGCGA
LonekoalaxAGGCTACATCAGCGA
HsomnusxxxAGGCTACATCAGCGA
AsalpingitAGGCTACATCAGCGA
BistthrethAGGCTACATCAGCGA
PtestudiniAGGCTACATCAGCGA
ACGGTACAGCTACAGTGGCGGACGGG
ACGGTACAGCTACAGTGGCGGACGGG
ACGGTGCAGCTACAGTGGCGGACGGG
ACGGTACAGCTACAGTGGCGGACGGG
ACGGTACTATTACAGTGGCGGACGGG
ACGGTACAGCTACAGTGGCGGACGGG
ACGGTACAGCTACAGTGGCGGACGGG
ACGGTACAGCTACAGTGGCGGACGGG
ACGGTACAGCTACAGTGGCGGACGGG
ACGGTACAGCTACAGTGGCGGACGGG
ACGGTACAGCTACAGTGGCGGACGGG
ACGGTGCAGCTACAGTGGCGGACGGG
ACGGTGCAGCTACAGTGGCGGACGGG
ACGGTGCTATTACAGTGGCGGACGGG
ACGGTGCAGCTACAGTGGCGGACGGG
ACGGTGCAGCTACAGTGGCGGACGGG
ACGGTGCAGCTACAGTGGCGGACGGG
ACGGTGCAGCTACAGTGGCGGACGGG
ACGGTGCAGCTACAGTGGCGGACGGG
ACGGTGCGGCTACAGTGGCGGACGGG
ACGGTGCAGCTACAGTGGCGGACGGG
ACGGTGCAGCTACAGTGGCGGACGGG
ACGGTACAGCCACAGTGGCGGACGGG
ACGGTGCAGCTACAGTGGCGGACGGG
ACGGTGCTATTACAGTGGCGGACGGG
ACGGTACAGCTACAGTGGCGGACGGG
GCGGTACCTCTACAGTGGCGGACGGG
ACGGTACAGCTACAGTGGCGGACGGG
ACGGTACAGCTACAGTGGCGGACGGG
ACGGTACAATTACAGTGGCGGACGGG
gCGGTACAGCCACAGCGGCGGACGGG
TGAGTAGCTGGGATTTTCATGGAGGGATAACAGGGAAC
TGAGTAGCTGGGATTCCTATGGAGGGATAACAGGGAAC
TGAGTAGCTGGGATTGCTATGGAGGGATAACATGGAAC
TGAGTAGCTGGGATTGCTACGGAGGGATAACGGGGAAC
TGAGTAGCTGGGATTGCTATGGAGGGATAACAGGGAAC
TGAGTAGCTGGGATTGCTATGGAGGGATAACATGGAAC
TGAGTAGCTGGGATTGCTATGGAGGGATAACAGGGAAC
TGAGTAGCTGGGATTGCTATGGAGGGATAACAGGGAAC
TGAGTAGCTGGGATTGCTATGGAGGGATAACAGGGAAC
TGAGTAGCTGGGATTGCTATGGAGGGATAACAGGGAAC
TGAGTAGCTGGGATTGCTATGGAGGGATAACAGGGAAC
TGAGTAGCTGGGATTGCTATGGAGGGATAACAGGGAAC
TGAGTAGCTGGGATTGCTATGGAGGGATAACGGGGAAC
TGAGTAGCTGGGATTGCTATGGAGGGATAACATGGAAC
TGAGTAGCTGGGATTGCTATGGAGGGATAACAGGGAAC
TGAGTAGCTGGGATTGCTATGGAGGGATAACATGGAAC
TGAGTAGCTGGGATTGCTATGGAGGGATAACAGGGAAC
TGAGTAGCTGGGATTGCTATGGAGGGATAACATGGAAC
TGAGTAGCTGGGATTGCTATGGAGGGATAACAGGGAAC
TGAGTAGCTGGGATTGCTATGGAGGGATAACAGGGAAC
TGAGTAGCTGGGATTGCTATGGAGGGATAACAGGGAAC
TGAGTAGCTGGGATTGCTATGGAGGGATAACATGGAAC
TGAGTAGCTGGGATTGCTATGGAGGGATAACAGGGAAC
TGAGTAGCTGGGATTGCTATGGAGGGATAACAGGGAAC
TGAGTAGCTGGGATTTCTATGGAGGGATAACAGGGAAC
TGAGTAGCTGGGATTGCTATGGAGGGATAACAGGGAAC
TGAGTAGCTGGGATTGCTATGGAGGGATAACAGGGAAC
TGAGTAGCTGGGATTGCAATGGAGGGATAGCAGGGAAC
TGAGTAGCTGGGATTGCTATGGAGGGATAACATGGAAC
TGAGTAGCTGGGATTGCTATGGAGGGATAACAGGGAAC
TGAGTAGCTGGGATTCCTATGGCGGGATAACAGGGAAC
TGTCTAATACCGCGTAATCGAGATTAAAG
TGTCTAATACCGCGTAATCTGGATTAAAG
GGTCTAATACCGCGTAGTCTGGACTAAAG
TGCCTAATACCGCGTAATCGAGATGAAAG
TGTCTAATACCGCGTAATCGAGATGAAAG
GATCTAATACCGCATAATCGAGATTAAAG
TGTCTAATACCGCGTAGTCTGGACTAAAG
TGTCTAATACCGCGTAGTCGAGACGAAAG
TGTCTAATACCGCGTAATCTAGATGAAAG
TGTCTAATACCGCGTAATCGAGATGAAAG
TGTCTAATACCGCGTAATCTGGATTAAAG
TGTCTAATACCGCGTAGTCTGGACTAGAG
TGCCTAATACCGCGTACTCTGGAGGAAAG
GGTCTAATACCGCGTAATCGAGATTAAAG
TGTCTAATACCGCGTAATCGAGATTAAAA
GGTCTAATACCGCGTAATCGAGATTAAAG
TGTCTAATACCGCGTAATCTGGATTAAAG
GATCTAATACCGCGTAATCGAGATTAAAG
TGTCTAATACCGCGTAATCTGGATTAAAG
TGTCTAATACCGCGTAATCTGGATTAAAG
TGTCTAATACCGCGTAGTCTGGACTAAAG
GGTCTAATACCGCGTAGTCTGGACTAAAG
TGTCTAATACCGCGTAATCTGGATTAAAG
TGTCTAATACCGCGTAATCGAGATGAAAG
TGTCTAATACCGCGTAATCGAGATGAAAG
TGTCTAATACCGCGTAATCGAGATGAAAG
TGTCTAATACCGCGTAATCTGGATTAAAG
TGTGTAATACCGCGTAGTCTGGACTAAAG
GATCTAATACCGCATAATCGAGATGAAAG
TGTCTAATACCGCGTAATCGAGATTAAAT
TGTCTAATACCGCGTAATCTACTTTAAAG
GGGCCATCCATGAAGTGAGCCCAAGTGGGATTAGATAG
GGGCCATCCATAAGATGAGCCCAAGTGAGATTAGGTAG
GCGACTTCCATAAGATGAGCCCAAGTGGGATTAGGTAG
GCGACTTCCGTGAGATGAGCCCAAGTGGGATTAGGTAG
GGGCCATCCATAGGATGAGCCCAAGTGGGATTAGGTAG
GGGTCATCCATAAGATGAGCCCAAGTGGGATTAGGTAG
GGGTCGCCCATAAGATGAGCCCAAGTGGGATTAGGTAG
GGGCCGTCCATAAGATGAGCCCAAGTGGGATTAGGTAG
GGGTCATCCATAAGATGAGCCCAAGTGGGATTAGGTAG
GGGCCATCCATGGGATGAGCCCAAGTGGGATTAGGTAG
GGGTCATCCATAAGATGAGCCCAAGTGGGATTAGGTAG
GGGTCATCCATAAGATGAGCCCAAGTGGGATTAGGTAG
GGGCCATCCATAAGATGAGCCCAAGTGGGATTAGGTAG
GGGCCATCCATAAGATGAGCCCAAGTGGGATTAGGTAG
GGGTAGTCCATAAGATGAGCCCAAGTGGGATTAGGTAG
GGGTCATCCATAAGATGAGCCCAAGTGGGATTAGGTAG
GGGTCATCCATAAGATGAGCCCAAGTGGGATTAGGTAG
GGGTCATCCATAAGATGAGCCCAAGTGGGATTAGGTAG
GGGCCATCCATAAGATGAGCCCAAGTGGGATTAGGTAG
GGGCCATCCATAAGATGAGCCCAAGTGGGATTAGGTAG
GGGTCATCCATGAGATGAGCCCAAGTGGGATTAGGTAG
GGGCCATCCATAAGATGAGCCCAAGTGGGATTAGGTAG
GGGTCATCCATAAGATGAGCCCAAGTGGGATTAGGTAG
GGGTCGTCCATAGGATGAGCCCAAGTGGGATTAGGTAG
GGGCCATCCATAGAATGAGCCCAAGTGTGATTAGGTAG
GGGCCATCCATAGGATGAGCCCAAGTGGGATTAGGTAG
GGGCCATCCATAAGATGAGCCCAAGTGGGATTAGGTAG
GGGTCATCCATTTGATGAGCCCAAGTGGGATTAGGTAG
GGGGCATCCAAAGGATGAACCCAAGTGAGATTAGGTAG
GGGTCAACCATAAGATAAGCCCAAGTGAGATTAGCTAG
GGGTCACCCATGAGATGAGCCCAAGTGAGATTAGGTAG
TTGGTGAGGTAGGCTCACAGTCACGATCTCTAGCTGTCTGAG
TTGGTGAGGTAGGCTCACAGCTGCGATCTCTAGCTGTTTGAG
TTGGTGAGGTAGGCTCACAGCCGCGATCTCTAGCTGTCTGAG
TTGGTGGGGTAGGCCTACAGCCTCGATCTCTAGCTGTCTGAG
TTGGTAGGGTAGGCCTACAGCCGCGATCTCTAGCTGTCTGAG
TTGGTGGGGTAGGCCTACAGCCTCGATCTCTAGCTGTCTGAG
TTGGTGGGGTAGGCCTACAGCCGCGATCTCTAGCTGTCTGAG
TTGGTGGGGTAGGCCTACAGCCTCGATCTCTAGCTGTCTGAG
TTGGTTAGGTAGGCTGACAGCCGCGATCTCTAGCTGTCTGAG
TTGGTAAGGTAGGCTTACAGCCGCGATCTCTAGCTGTCTGAG
TTGGTGGGGTAGGCCTACAGCCGCGATCTCTAGCTGTCTGAG
TTGGTGGGGTAGGCCTACAGCCGCGATCTCTAGCTGTCTGAG
TTGGTGGGGTAGGCCTACAGCCTCGATCTCTAGCTGTCTGAG
TTGGTGGGGTAGGCCTACAGCCTCGATCTCTAGCTGTCTGAG
TTGGTGGGGTAGGCCTACAGCCGCGATCTCTAGCTGTCTGAG
TTGGTGGGGTAGGCCTACAGCCTCGATCTCTAGCTGTCTGAG
TTGGTGGGGTAGGCCTACAGCCGCGATCTCTAGCTGTCTGAG
TTGGTGGGGTAGGCCTACAGCCTCGATCTCTAGCTGTCTGAG
TTGGTGGGGTAGGCCTACAGCCTCGATCTCTAGCTGTCTGAG
TTGGTTAGGTAGGCTGACAGCCGCGATCTCTAGCTGTCTGAG
TTGGTGGGGTAGGCCTACAGCCGCGATCTCTAGCTGTCTGAG
TTGGTGAGGTAGGCTCACAGCCCCGATCTCTAGCTGTCTGAG
TTGGTTAGGTAGGCTGACAGCCGCGATCTCTAGCTGTCTGAG
TTGGTGGGGTATGCCTACAGCCTCGATCTCTAGCTGTCTGAG
TTGGTGGGGTAGGCCTACAGCCGCGATCGCTAGCTGTCTGAG
TTGGTGAGGTAGGCTCACAGCCGCGATCTCTAGCTGTCTGAG
TTGGTGGGGTAGGCCTACAGCCGCGATCTCTAGCTGTCTGAG
TTGGTGAGGTAGGCTCACAGCTGCGATCTCTAGCTGTCTGAG
TTGGTGGGGTAGGCCTACAGCCGCGATCTCTAGCTGTCTGAG
TTGGTGGGGTAGGCCTACAGGCGCGATCTCTAGCTGTCTGAG
TTGGTGAGGTAGGCTCACAGCCGCGATCTCTAACTGTCTGAG
ATGACCAGCCCCTGGGACTGAGACCGGCCCGCTTACGGGAGG
ATGACCAGCCCCTGGAACTGAGACCGGTCCGCTTACGGGAGG
ATGACCAGCCCCTGGAACTGAGACCGGTCCGCTTACGGGAGG
ATGACCAGCCCCCGGAACTGAGACCGGTCCGCTTACGGGAGG
ATGACCAGCCCCTGGAACTGAGACCGGTCCGCTTACGGGAGG
ATGGCCAGCCCCTGGGACTGAGACCGGCCCGCTTACGGGAGG
ATGACCAGCCCCTGGAACTGAGACCGGTCCGCTTACGGGAGG
ATGGCCAGCCCCTGGAACTGAGACCGGTCCGCTTACGGGAGG
ATGACCAGCCCCTGGAACTGAGACCGGTCCGCTTACGGGAGG
ATGACCAGCCCCTGGAACTGAGACCGGTCCGCTTACGGGAGG
ATGACCAGCCCCTGGAACTGAGACCGGTCCGCTTACGGGAGG
ATGACCAGCCCCTGGAACTGAGACCGGTCCGCTTACGGGAGG
ATGACCAGCCCCTGGAACTGAGACCGGTCCGCTTACGGGAGG
ATGACCAGCCCCTGGAACTGAGACCGGTCCGCTTACGGGAGG
ATGACCAGCCCCTGGAACTGAGACCGGTCCGCTTACGGGAGG
ATGGCCAGCCCCTGGAACTGAGACCGGTCCGCTTACGGGAGG
ATGACCAGCCCCTGGGACTGAGACCGGCCCGCTTACGGGAGG
ATGACCAGCCCCTGGAACTGAGACCGGTCCGCTTACGGGAGG
ATGGCCAGCCCCTGGAACTGAGACCGGTCCGCTTACGGGAGG
ATGACCAGCCCCTGGAACTGAGACCGGTCCGCTTACGGGAGG
ATGACCAGCCCCTGGAACTGAGACCGGTCCGCTTACGGGAGG
ATGACCAGCCCCTGGAACTGAGACCGGTCCGCTTACGGGAGG
ATGACCAGCCCCTGGAACTGAGACCGGTCCGCTTACGGGAGG
ATGACCAGCCCCTGGAACTGAGACCGGTCCGCTTACGGGAGG
ATGGCCAGCCCCCGGGACTGAGACCGGCCCGCTTACGGGAGG
ATGACCAGCCCCTGGGACTGAGACCGGCCCGCTTACGGGAGG
ATGACCAGCCCCTGGAACTGAGACCGGTCCGCTTACGGGAGG
ATGGCCAGCCCCTGGAACTGAGACCGGTCCGCTTACGGGAGG
ATGGCCAGCCCCTGGGACTGAGACCGGCCCGCTTACGGGAGG
ATGACCAGCCCCTGGGACTGAGACCGGCCCGCTTACGGGAGG
ATGACCAGTCCCTGGGACTGAGACCGGCCCGCTTACGGGAGG
CGCGTGGGAATTTGCCATGGGCGCAAGCCTGATGCAGCATGCC
CGCGTGGGAATTTGCCATGGGCGCAAGCCTGATGCAGCATGCC
CGCGTGGGAATTTGCCATGGGGGAAACCCTGATGCAGCATGCC
CGCGTGGGAATTTGCCATGGGGGGAACCCTGACGCAGCATGCC
CGCGTGGGAATTTGCCATGGGGGCAACCCTGATGCAGCATGCC
CGCGTGGGAATTTGCCATGGGGGGAACCCTGACGCAGCATGCC
CGCGTGGGAATTTGCCATGGGGGCAACCCTGACGCAGCATGCC
CGCGTGGGAATTTGCCATGGGGGCAACCCTGACGCAGCATGCC
CGCGTGGGAATTTGCCATGGGGGGAACCCTGATGCAGCATGCC
CGCGTGGGAATTTGCCATGGGGGGAACCCTGATGCAGCATGCC
CGCGTGGGAATTTGCCATGGGGGGAACCCTGATGCAGCATGCC
CGCGTGGGAATTTGCCATGGGGGGAACCCTGATGCAGCATGCC
CGCGTGGGAATTTGCCATGGGGGGAACCCTGACGCAGCATGCC
CGCGTGGGAATTTGCCATGGGGGGAACCCTGATGCAGCATGCC
CGCGTGGGAATTTGCCATGGGGGGAACCCTGATGCAGCATGCC
CGCGTGGGAATTTGCCATGGGGGGAACCCTGATGCAGCATGCC
CGCGTGGGAATTTGCCATGGGGGGAACCCTGACGCAGCATGCC
CGCGTGGGAATTTGCCATGGGGGCAACCCTGACGCAGCATGCC
CGCGTGGGAATTTGCCATGGGGGGAACCCTGATGCAGCATGCC
CGCGTGGGAATTTGCCATGGGGGAAACCCTGATGCAGCATGCC
CGCGTGGGAATTTGCCATGGGGGGAACCCTGATGCAGCATGCC
CGCGTGGGAATTTGCCATGGGGGGAACCCTGATGCAGCATGCC
CGCGTGGGAATTTGCCATGGGGGGAACCCTGATGCAGCATGCC
CGCGTGGGAATTTGCCATGGGGGGAACCCTGACGCAGCATGCC
CGCGTGGGAATTTGCCATGGGGGCAACCCTGACGCAGCATGCC
CGCGTGGGAATTTGCCATGGGGGCAACCCTGACGCAGCATGCC
CGCGTGGGAATTTGCCATGGGGGCAACCCTGACGCAGCATGCC
CGCGTGGGAATTTGCCATGGGGGGAACCCTGACGCAGCATGCC
CGCGTGGGAATTTGCCATGGGGGGAACCCTGACGCAGCATGCC
CGCGTGGGAATTTGCCATGGAGGGAACTCTGACGCAGCATGCC
CGCGTGGGAATTTGCCATGGGGGCAACCCTGACGCAGCATGCC
GCGTGAATGAAGAAGGCCTTCGGGTTGTAAAGTTCTTTCGGTGTGAGGA
GCGTGAATGAAGAAGGCTTTCGGGTTGTAAAGTTCTTTCAGCATGAGGA
GCGTGAATGAAGAAGGCCTTCGGGTTGTAAAGTTCTTTCGGTATGAGGA
GCGTGAATGAAGAAGGCCTTCGGGTTGTAAAGTTCTTTCGGTACGAGGA
GCGTGAATGAAGAAGGCCTTCGGGTTGTAAAGTTCTTTCGGTGAGAGGA
GCGTGAATGAAGAAGGCCTTCGGGTTGTAAAGTTCTTTCGGTGTGAGGA
GCGTGAATGAAGAAGGCCTTCGGGTTGTAAAGTTCTTTCGGTGTGAGGA
GCGTGAATGAAGAAGGCCTTCGGGTTGTAAAGTTCTTTCGGTATGAGGA
GCGTGAATGAAGAAGGCCTTCGGGTTGTAAAGTTCTTTCGGTATGAGGA
GCGTGAATGAAGAAGGCCTTCGGGTTGTAAAGTTCTTTCGGTATGAGGA
GCGTGAATGAAGAAGGCCTTCGGGTTGTAAAGTTCTTTCGGTATGAGGA
GCGTGAATGAAGAAGGCCTTCGGGTTGTAAAGTTCTTTCGGTATGAGGA
GCGTGAATGAAGAAGGCCTTCGGGTTGTAAAGTTCTTTCGGTATGAGGA
GCGTGAATGAAGAAGGCCTTCGGGTTGTAAAGTTCTTTCGGTATGAGGA
GCGTGAATGAAGAAGGCCTTCGGGTTGTAAAGTTCTTTCGGTGTGAGGA
GCGTGAATGAAGAAGGCCTTCGGGTTGTAAAGTTCTTTCGGTATGAGGA
GCGTGAATGAAGAAGGCCTTCGGGTTGTAAAGTTCTTTCGGTATGAGGA
GCGTGAATGAAGAAGGCCTTCGGGTTGTAAAGTTCTTTCGGTATGAGGA
GCGTGAATGAAGAAGGCCTTCGGGTTGTAAAGTTCTTTCGGTATGAGGA
GCGTGAATGAAGAAGGCCTTCGGGTTGTAAAGTTCTTTCGGTATGAGGA
GCGTGAATGAAGAAGGCCTTCGGGTTGTAAAGTTCTTTCGGTATGAGGA
GCGTGAATGAAGAAGGCCTTCGGGTTGTAAAGTTCTTTCGGTACGAGGA
GCGTGAATGAAGAAGGCCTTCGGGTTGTAAAGTTCTTTCGGTGCGAGGA
GCGTGAATGAAGAAGGCCTTCGGGTTGTAAAGTTCTTTCGGTTTGAGGA
GCGTGAATGAAGAAGGCCTTCGGGTTGTAAAGTTCTTTCGGTATGAGGA
GCGTGAATGAAGAAGGCCTTCGGGTTGTAAAGTTCTTTCGGTGCGAGGA
GCGTGAATGAAGAAGTCCTTCGGGTTGTAAAGTTCTTTCGGTATGAGGA
GCGTGAATGAAGAAGGCCTTCGGGTTGTAAAGTTCTTTCGGTATGAGGA
GCGTGAATGAAGAAGGCCTTCGGGTTGTAAAGTTCTTTCGGTGTGAGGA
GCGTGAATGAAGAAGGCCTTCGGGTTGTAAAGTTCTTTCGGTGTGAGGA
GCGTGAATGAAGAAGGCCTTCGGGTTGTAAAGTTCTTTCGGTATGAGGA
AGGTGCGTTATAGACTTCTTGACGTTGCCACAGAAGAAGC
AGGGTATCTATAGTGAGATTGACGTTGTCGCAGAAGAAGC
AGGGAAGTTATACCGTAATTGACGTTGTCACAGAAGAAGC
AGGGTAGTTATACCTATATTGACGTTATCACAGAAGAAGC
AGGGACGTGATAGAATTATTGACGTTCCTACAGAAGAAGC
AGGGTAGTTATAGGCATTTTGACGTTGCCACAGAAGAAGC
AGGGATGTTACAGTTTTATTGACGTTGCCACAGAAGAAGC
AGGGTTTTTATAGCTATATTGACGTTATCACAGAAGAAGC
AGGGTTTTTATAGACAGATTGACGTTGTCACAGAAGAAGC
AGGATTTTTATAGGTGTATTGACGTTGTCACAGAAGAAGC
AGGGTTTTTATAGACATATTGACGTTGTCACAGAAGAAGC
AGGTATTTTATAGGCTTATTGACGTTGTCACAGAAGAAGC
AGGAGTTTAATAGTACAATTGACGTTATTACAGAAGAAGC
AGGGTTTTTATAGCTATATTGACGTTGTCACAGAAGAAGC
AGGTTAGTTATAGATACATTGACGTTACCACAGAAGAAGC
AGGGTTTTTATAGCAAGATTGACGTTGTCACAGAAGAAGC
AGGACGTTTATAGTTGGATTGACGTTATCACAGAAGAAGC
AGGTAGGTTATAGATTAATTGACGTTGTCACAGAAGAAGC
AGGTGTTTTATAGGCTCATTGACGTTGTCACAGAAGAAGC
AGGATTGTTATAGAAATATTGACGTTGTCACAGAAGAAGC
AGGAGTTTTATAGATCTATTGACGTTATCACAGAAGAAGC
AGGGTTTTTATAGACGTATTGACGTTATCACAGAAGAAGC
AGGACATTTATAGTTGGATTGACGTTACTACAGAAGAAGC
AGGTATGTTATAGACTCATTGACGTTAATACAGAAGAAGC
AGGTGTGTTATAGATCCATTGACGTTAATACAGAAGAAGC
AGGATTTTTATAGCTGGATTGACGTTACTACAGAAGAAGC
AGGATTTTTATAGCTAGATTGACGTTGTCACAGAAGAAGC
AGGGTTTTTAGAGTTATATTGACGATATCACAGAAGAAGC
AGGTATGTTATAGATTTTTTGACGTTGCCACAGAAGAAGC
AGGTCTTTAATAGGCGTATTGACGTTGCCACAGAAGAAGC
AGGGTGGTTATAGTTAAATTGACGTTGTCACAGAAGAAGC
ACCGGCTACTCCGTGCCAGCAGCCCGGTATCGAGGGTGAGCGT
ACCGGCTACTCCGTGCCAGCAGCCCGGTATCGAGGGTGAGCGT
ACCGGCTACTCCGTGCCAGCAGCCCGGTATCGAGGGTGAGCGT
ACCGGCTACTCCGTGCCAGCAGCACGGTATCGAGGGTGAGCGT
ACCGGCTACTCCGTGCCAGCAGCCCGGTATCGAGGGTGAGCGT
ACCGGCTACTCCGTGCCAGCAGCCCGGTATCGAGGGTGAGCGT
ACCGGCTACTCCGTGCCAGCAGCCCGGTATCGAGGGTGAGCGT
ACCGGCTACTCCGTGCCAGCAGCCCGGTATCGAGGGTGAGCGT
ACCGGCTACTCCGTGCCAGCAGCCCGGTATCGAGGGTGAGCGT
ACCGGCTACTCCGTGCCAGCAGCCCGGTATCGAGGGTGAGCGT
ACCGGCTACTCCGTGCCAGCAGCCCGGTATCGAGGGTGAGCGT
ACCGGCTACTCCGTGCCAGCAGCCCGGTATCGAGGGTGAGCGT
ACCGGCTACTCCGTGCCAGCAGCCCGGTATCGAGGGTGAGCGT
ACCGGCTACTCCGTGCCAGCAGCCCGGTATCGAGGGTGAGCGT
ACCGGCTACTCCGTGCCAGCAGCCCGGTATCGAGGGTGAGCGT
ACCGGCTACTCCGTGCCAGCAGCCCGGTATCGAGGGTGAGCGT
ACCGGCTACTCCGTGCCAGCAGCCCGGTATCGAGGGTGAGCGT
ACCGGCTACTCCGTGCCAGCAGCCCGGTATCGAGGGTGAGCGT
ACCGGCTACTCCGTGCCAGCAGCCCGGTATCGAGGGTGAGCGT
ACCGGCTACTCCGTGCCAGCAGCCCGGTATCGAGGGTGAGCGT
ACCGGCTACTCCGTGCCAGCAGCCCGGTATCGAGGGTGAGCGT
ACCGGCTACTCCGTGCCAGCAGCCCGGTATCGGGGGTGAGCGT
ACCGGCTACTCCGTGCCAGCAGCCCGGTATCGAGGGTGAGCGT
ACCGGCTACTCCGTGCCAGCAGCCCGGTATCGAGGGTGAGCGT
ACCGGCTACTCCGTGCCAGCAGCCCGGTATCGGGGGTGAGCGT
ACCGGCTACTCCGTGCCAGCAGCCCGGTATCGAGGGTGAGCGT
ACCGGCTACTCCGTGCCAGCAGCCCGGTATCGAGGGTGAGCGT
ACCGGCTACTCCGTGCCAGCAGCCCGGTATCGAGGGTGAGCGT
ACCGGCTACTCCGTGCCAGCAGCCCGGTATCGAGGGTGAGCGT
ACCGGCTACTCCGTGCCAGCAGCCCGGTATCGAGGGTGAGCGT
ACCGGCTACTCCGTGCCAGCAGCCCGGTATCGAGGGTGAGCGT
TATCGAATACTGGGCGTAAAGGGCACGCAGGCGGTATTAAGTGG
TATCGAATACTGGGCGTAAAGGGCACGCAGGCGGTGCTAAGTCG
TATCGAATACTGGGCGTAAAGGGCACGCAGGCGGTGTTAAGTGG
TATCGAATACTGGGCGTAAAGGGCACGCAGGCGGATTTAAGTGG
TATCGAATACTGGGCGTAAAGGGCACGCAGGCGGTGTTAAGTGG
TATCGAATACTGGGCGTAAAGGGCACGCAGGCGGTATTAAGTGG
TATCGAATACTGGGCGTAAAGGGCACGCAGGCGGCATTAAGTGG
TATCGAATACTGGGCGTAAAGGGCACGCAGGCGGTGTTAAGTGG
TATCGAATACTGGGCGTAAAGGGCACGCAGGCGGTATTAAGTGG
TATCGAATACTGGGCGTAAAGGGCACGCAGGCGGTATTAAGTGG
TATCGAATACTGGGCGTAAAGGGCACGCAGGCGGTATTAAGTGG
TATCGAATACTGGGCGTAAAGGGCACGCAGGCGGTATTAAGTGG
TATCGAATACTGGGCGTAAAGGGCACGCAGGCGGATTTAAGTGG
TATCGAATACTGGGCGTAAAGGGCACGCAGGCGGATTTAAGTGG
TATCGAATACTGGGCGTAAAGGGCACGCAGGCGGATTTAAGTGG
TATCGAATACTGGGCGTAAAGGGCACGCAGGCGGATTTAAGTGG
TATCGAATACTGGGCGTAAAGGGCACGCAGGCGGTATTAAGTGG
TATCGAATACTGGGCGTAAAGGGCACGCAGGCGGTATTAAGTGG
TATCGAATACTGGGCGTAAAGGGCACGCAGGCGGTGTTAAGTGG
TATCGAATACTGGGCGTAAAGGGCACGCAGGCGGTGTTAAGTGG
TATCGAATACTGGGCGTAAAGGGCATGCAGGCGGTGTTAAGTGG
TATCGAATACTGGGCGTAAAGGGCACGCAGGCGGTGTTAAGTGG
TATCGAATACTGGGCGTAAAGGGCACGCAGGCGGTGTTAAGTGG
TATCGAATACTGGGCGTAAAGGGCACGCAGGCGGTATTAAGTGG
TATCGAATACTGGGCGTAAAGGGCACGTAGGCGGACTTAAGTGG
TATCGAATACTGGGCGTAAAGGGCACGCAGGCGGTATTAAGTGG
TATCGAATACTGGGCGTAAAGGGCACGCAGGCGGTATTAAGTAG
TATCGAATACTGGGCGTAAAGGGCACGCAGGTGGTATTAAGTGG
TATCGAATACTGGGCGTAAAGGGCACGCAGGCGGGCTTAAGTGG
TATCGAATACTGGGCGTAAAGGGCACGCAGGCGGTATTAAGTGG
TATCGAATACTGGGCGTAAAGGGCACGCAGGCGGAAATAAGTGG
GTGTGAAAGCCCGGCTACCTAGGATTGCATTTCATACTGTGC
ATGTGAAAGCCCGGCTACTTGGGATTGCATTTGAGACTGCAC
GTGTGAAAGCCCGGCCACCTGGGATTGCATTTCAGACTGCAC
GTGTGAAAGCCCGGCTACCTGGGATAGCATTTCAGACTGATC
ATGTGAAAGCCCGGCTACTTGGGATAGCATTTCATACTGCAC
ATGTGAAATCCCGGCTACTTGGGATTGCATTTCAGACTTTAC
ATGTGAAATCCCGGCTACCTGGGACTGCATTTCAGACTGTGC
GTGTGAAATCCCGGCTACCTGGGATTGCATTTCAGACTGCGC
ATGTGAAAGCCCGGCTACCTGGGATTGCATTTCATACTGTGC
ATGTGAAAGCCCGGCTACCTGGGATTGCATTTCATACTGTGC
ATGTGAAAGCCCGGCTACTTGGGATTGCATTTCATACTGTGC
ATGTGAAAGCCCGGCTACTTGGGATTGCATTTCATACTGTGC
ATGTGAAATCCCGGCTACTTGGGACTGCATTTCAGACTGATC
ATGTGAAAGCCCGGCTACTTGGGATTGCATTTCAGACTGATC
ATGTGAAATCCCGGCTACTTGGGATTGCATTTCAGACTGATC
ATGTGAAAGCCCGGCTACCTGGGATTGCATTTCAGACTGATC
GTGTGAAAGCCCGGCTACCTGGGATTGCATTTCATACTGTGC
GTGTGAAATCCCGGCTACCTGGGATTGCATTTCAGACTGTGC
ATGTGAAAGCCCGGCTACTTGGGATTGCATTTCAGACTGCAC
ATGTGAAAGCCCGGCTACCTGGGATTGCATTTCATACTGCAC
ATGTGAAAGCCCGGCTACCTGGGATTGCATTTCAGACTGCGC
GTGTGAAAGCCCGGCTACCTGGGATTGCATTTCAGACTACAC
ATGTGAAAGCCCGGCTACCTGGGATTGCATTTCATACTGCAC
GTGTGAAAGCCCGGCTACCTAGGATTGCATTTCAGACTGTAC
GTGTGAAATCCCGGCTACCTGGGATTGCATTTCATACTGGTC
GTGTGAAAGCCCGGCTACCTGGGATTGCATTTCATACTGTGC
ATGTGAAAGCCCGGCTACTTGGGATTGCATTTTAGACTGTAC
GTGTGAAATCCCGGCTACCTGGGACAGCATTTCAGACTGTAC
ATGTGAAAGCCCGGCTACCTGGGACAGCATTTCATACTTGTC
GTGTGAAATCCCGGCTACCTAGGATTGCATTTCATACTGTAC
ATGTGAAATCCCGGCTACTTGGGACTGCATTTCATACTTTTC
TGAGTTAGGGAGGGTAGAATTCCACGTGTAGCGGTGAAATGCGT
TGAGTTAGGGAGGGTAGAATTCCACGTGTAGCGGTGAAATGCGT
TGAGTTAGGGAGGGTAGAATTCCACGTGTAGCGGTGAAATGCGT
TGAGTTAGGGAGGGTAGAATTCCACGTGTAGCGGTGAAATGCGT
TGAGTTAGGGAGGGTAGAATTCCACGTGTAGCGGTGAAATGCGT
TGAGTTAGGGAGGGTAGAATTCCACGTGTAGCGGTGAAATGCGT
TGAGTTAGGGAGGGTAGAATTCCACGTGTAGCGGTGAAATGCGT
TGAGTTAGGGAGGGTAGAATTCCACGTGTAGCGGTGAAATGCGT
TGAGTTAGGGAGGGTAGAATTCCACGTGTAGCGGTGAAATGCGT
TGAGTTAGGGAGGGTAGAATTCCACGTGTAGCGGTGAAATGCGT
TGAGTTAGGGAGGGTAGAATTCCACGTGTAGCGGTGAAATGCGT
TGAGTTAGGGAGGGTAGAATTCCACGTGTAGCGGTGAAATGCGT
TGAGTTAGGGAGGGTAGAATTCCACGTGTAGCGGTGAAATGCGT
TGAGTTAGGGAGGGTAGAATTCCACGTGTAGCGGTGAAATGCGT
TGAGTTAGGGAGGGTAGAATTCCACGTGTAGCGGTGAAATGCGT
TGAGTTAGGGAGGGTAGAATTCCACGTGTAGCGGTGAAATGCGT
TGAGTTAGGGAGGGTAGAATTCCACGTGTAGCGGTGAAATGCGT
TGAGTTAGGGAGGGTAGAATTCCACGTGTAGCGGTGAAATGCGT
TGAGTTAGGGAGGGTAGAATTCCATGTGTAGCGGTGAAATGCGT
TGAGTTAGGGAGGGTAGAATTCCACGTGTAGCGGTGAAATGCGT
TGAGTTAGGGAGGGTAGAATTCCACGTGTAGCGGTGAAATGCGT
TGAGTTAGGGAGGGTAGAATTCCACGTGTAGCGGTGAAATGCGT
TGAGTTAGGGAGGGTAGAATTCCACGTGTAGCGGTGAAATGCGT
TGAGTTAGGGAGGGTAGAATTCCACGTGTAGCGGTGAAATGCGT
TGAGTTAGGGAGGGTAGAATTCCACGTGTAGCGGTGAAATGCGT
TGAGTTAGGGAGGGTAGAATTCCACGTGTAGCGGTGAAATGCGT
TGAGTTAGGGAGGGTAGAATTCCACGTGTAGCGGTGAAATGCGT
TGAGTTAGGGAGGGTAGAATTCCACGTGTAGCGGTGAAATGCGT
TGAGTTAGGGAGGGTAGAATTCCACGTGTAGCGGTGAAATGCGT
TGAGTTAGGGAGGGTAGAATTCCACGTGTAGCGGTGAAATGCGT
TGAGTTAGAGAGGGTAGAATTCCACGTGTAGCGGTGAAATGCGT
GAGATGTGAGGAATACCGAAGGCGAAGGCAGCCCTTGGATATACT
GAGATGTGAGGAATACCGAAGGCGAAGGCAGCCCTTGGAGAGACT
GAGATGTGAGGAATACCGAAGGCGAAGGCAGCCCTTGGATGTACT
GAGATGTGAGGAATACCGAAGGCGAAGGCAGCCCTTGGATGTACT
GAGATGTGAGGAATACCGAAGGCGAAGGCAGCCCTTGGATGTACT
GAGATGTGAGGAATACCGAAGGCGAAGGCAGCCCTTGGAGATACT
GAGATGTGAGGAATACCGAAGGCGAAGGCAGCCCTTGGACGTACT
GAGATGTGAGGAATACCGAAGGCGAAGGCAGCCCTTGGAAATACT
GAGATGTGAGGAATACCGAAGGCGAAGGCAGCCCTTGGATGTACT
GAGATGTGAGGAATACCGAAGGCGAAGGCAGCCCTTGGATGTACT
GAGATGTGAGGAATACCGAAGGCGAAGGCAGCCCTTGGATGTACT
GAGATGTGAGGAATACCGAAGGCGAAGGCAGCCCTTGGAAATACT
GAGATGTGAGGAATACCGAAGGCGAAGGCAGCCCTTGGATGTACT
GAGATGTGAGGAATACCGAAGGCGAAGGCAGCCCTTGGATGTACT
GAGATGTGAGGAATACCGAAGGCGAAGGCAGCCCTTGGATATACT
GAGATGTGAGGAATACCGAAGGCGAAGGCAGCCCTTGGATATACT
GAGATGTGAGGAATACCGAAGGCGAAGGCAGCCCTTGGATGTACT
GAGATGTGAGGAATACCGAAGGCGAAGGCAGCCCTTGGATGTACT
GAGATGTGAGGAATACCGAAGGCGAAGGCAGCCCTTGGATGTACT
GAGATGTGAGGAATACCGAAGGCGAAGGCAGCCCTTGGATATACT
GAGATGTGAGGAATACCGAAGGCGAAGGCAGCCCTTGGATGTACT
GAGATGTGAGGAATACCGAAGGCGAAGGCAGCCCTTGGATGTACT
GAGATGTGAGGAATACCGAAGGCGAAGGCAGCCCTTGGATGTACT
GAGATGTGAGGAATACCGAAGGCGAAGGCAGCCCTTGGATGTACT
GAGATGTGAGGAATACCGAAGGCGAAGGCAGCCCTTGGATGTACT
GAGATGTGAGGAATACCGAAGGCGAAGGCAGCCCTTGGATGTACT
GAGATGTGAGGAATACCGAAGGCGAAGGCACCCCTTGGATGTACT
GAGATGTGAGGAATACCGAAGGCGAAGGCAGCCCTTGGAGATACT
GAGATGTGAGGAATACCGAAGGCGAAGGCAGCCCTTGGAGATACT
GAGATGTGAGGAATACCGAAGGCGAAGGCAGCCCTTGGAGAGACT
GAGATGTGAGGAATACCGGAGGCGAAGGCGGCCCTTGGAAAGACT
GACGCTCATGTGCAAGCGTGGGGAGCACAGGATTGATACCCTGT
GACGCTGAGGTGCAAGCGTGGGTAGCACAGGATTGATACCCTGT
GACGCTCATGTGCAAGCGTGGGGAGCACAGGATTGATACCCTGT
GACGCTCATGTGCAAGCGTGGGGAGCACAGGATTGATACCCTGT
GACGCTCATGTGCAAGCGTGGGGAGCACAGGATTGATACCCTGT
GACGCTCATGTGCAAGCGTGGGGAGCACAGGATTGATACCCTGT
GACGCTCATGTGCAAGCGTGGGGAGCACAGGATTGATACCCTGT
GACGCTCATGTGCAAGCGTGGGGAGCACAGGATTGATACCCTGT
GACGCTCATGTGCAAGCGTGGGGAGCACAGGATTGATACCCTGT
GACGCTCATGTGCAAGCGTGGGGAGCACAGGATTGATACCCTGT
GACGCTCATGTGCAAGCGTGGGGAGCACAGGATTGATACCCTGT
GACGCTCATGTGCAAGCGTGGGGAGCACAGGATTGATACCCTGT
GACGCTCATGTGCAAGCGTGGGGAGCACAGGATTGATACCCTGT
GACGCTCATGTGCAAGCGTGGGGAGCACAGGATTGATACCCTGT
GACGCTCATGTGCAAGCGTGGGGAGCACAGGATTGATACCCTGT
GACGCTCATGTGCAAGCGTGGGGAGCACAGGATTGATACCCTGT
GACGCTCATGTGCAAGCGTGGGGAGCACAGGATTGATACCCTGT
GACGCTCATGTGCAAGCGTGGGGAGCACAGGATTGATACCCTGT
GACGCTCATGTGCAAGCGTGGGGAGCACAGGATTGATACCCTGT
GACGCTCATGTGCAAGCGTGGGGAGCACAGGATTGATACCCTGT
GACGCTCATATGCAAGCGTGGGGAGCACAGGATTGATACCCTGT
GACGCTCATGTGCAAGCGTGGGGAGCACAGGATTGATACCCTGT
GACGCTCATGTGCAAGCGTGGGGAGCACAGGATTGATACCCTGT
GACGCTCATGTGCAAGCGTGGGGAGCACAGGATTGATACCCTGT
GACGCTGATGTGCAAGCGTGGGGAGCACAGGATTGATACCCTGT
GACGCTCATGTGCAAGCGTGGGGAGCACAGGATTGATACCCTGT
GACGCTCATGTGCAAGCGTGGGGAGCACAGGATTGATACCCTGT
GACGCTCGAGTGCAAGCGTGGGGAGCACAGGATTGATACCCTGT
GACGCTCATGTGCAAGCGTGGGGAGCACAGGATTGATACCCTGT
GACGCTCATGTGCAAGCGTGGGGAGCACAGGATTGATACCCTGT
GACGCTCATGTGCAAGCGTGGGGAGCACAGGATTGATACCCTGT
AGTCCCGCTGTAACGATGTCGATTTGGGGATTGGGTGG
AGTCCCGCTGTAACGATGTCGATTTGGGGGTTGGGTGG
AGTCCCGCCGTAACGCTGTCGATTTGGGGATTGGGTGG
AGTCCCGCTGTAACGCTGTCGATTTGGGGGTTGGGTGG
AGTCCCGCTGTAACGATGTCGATTTGGGGGTTGAGTGG
AGTCCCGCTGTAACGCTGTCGATTTGGGGATTGGGTGG
AGTCCCGCTGTAACGCTGTCGATTTGGGGATTGGGTGG
AGTCCCGCTGTAACGCTGTCGATTTGGGGATTGGGTGG
AGTCCCGCTGTAACGCTGTCGATTTGGGGATTGGGTGG
AGTCCCGCTGTAACGCTGTCGATTTGGGGATTGAGTGG
AGTCCCGCTGTAACGCTGTCGATTTGGGGATTGGGTGG
AGTCCCGCTGTAACGCTGTCGATTTGGGGATTGGGTGG
AGTCCCGCTGTAACGCTGTCGATTTGGGGATTGGGTGG
AGTCCCGCTGTAACGCTGTCGATTTGGGGATTGGGTGG
AGTCCCGCTGTAACGCTGTCGATTTGGGGATTGGGTGG
AGTCCCGCTGTAACGATGTTGATTTGGGGATTGGGTGG
AGTCCCGCTGTAACGCTGTCGATTTGGGGGTTGGGTGG
AGTCCCGCTGTAACGCTGTCGATTTGGGGATTGGGTGG
AGTCCCGCTGTAACGCTGTCGATTTGGGGATTGGGTGG
AGTCCCGCTGTAACGATGTCGATTTGGGGGTTGGGTGG
AGTCCCGCTGTAACGATGTCGATTTGGGGATTGAGTGG
AGTCCCGCCGTAACGCTGTCGATTTGGGGATTGGCTGG
AGTCCCGCTGTAACGCTGTCGATTTGGGGATTGGGTGG
AGTCCCGCTGTAACGCTGTCGATTTGGGGATTGGGTGG
AGTCCCGCTGTAACGGTGTCGATTTGGGGATTGGCTGG
AGTCCCGCTGTAACGATGTCGATTTGGGGGTTGAGTGG
AGTCCCGCTGTAACGCTGTCGATTTGGGGGTTGGGTGG
AGTCCCGCTGTAACGATGTCGATTTGGGGATTGGTTGG
AGTCCCGCTGTAACGATGTCGATTTGGGGGTTGGATGG
AGTCCCGCTGTAACGCTGTCGATTTGGGGATTGGTTGG
AGTCCCGCTGTAACGATGTCGATTTGGGGGTTGGCTGG
TGCCGTAGCTACGTGATAAATCGCCGCCGGGGAGTACGGCCGCAG
CGCCGTAGCTACGTGATAAATCGCCGCCGGGGAGTACGGCCGCAG
TGCCGTAGCTACGTGATAAATCGCCGCCGGGGAGTACGGCCGCAG
CGCCGTAGCTACGTGATAAATCGCCGCCGGGGAGTACGGCCGCAG
CGCCGTAGCTACGTGATAAATCGCCGCCGGGGAGTACGGCCGCAG
TGCCGTAGCTACGTGATAAATCGCCGCCGGGGAGTACGGCCGCAG
TGCCGAAGCTACGTGATAAATCGCCGCCGGGGAGTACGGCCGCAG
TGCCGTAGCTACGTGATAAATCGCCGCCGGGGAGTACGGCCGCAG
TGCCGTAGCTACGTGATAAATCGCCGCCGGGGAGTACGGCCGCAG
TGCCGTAGCTACGTGATAAATCGCCGCCGGGGAGTACGGCCGCAG
TGCCGTAGCTACGTGATAAATCGCCGCCGGGGAGTACGGCCGCAG
TGCCGTAGCTACGTGATAAATCGCCGCCGGGGAGTACGGCCGCAG
TGCCGAAGCTACGTGATAAATCGCCGCCGGGGAGTACGGCCGCAG
TGCCGTAGCTACGTGATAAATCGCCGCCGGGGAGTACGGCCGCAG
TGCCGAAGCTACGTGATAAATCGCCGCCGGGGAGTACGGCCGCAG
TGCCGTAGCTACGTGATAAATCACCGCCGGGGAGTACGGCCGCAG
CGCCGTAGCTACGTGATAAATCGCCGCCGGGGAGTACGGCCGCAG
TGCCGTAGCTACGTGATAAATCGCCGCCGGGGAGTACGGCCGCAG
TGCCGTAGTTACGTGATAAATCGCCGCCGGGGAGTACGGCCGCAG
CGCCGAAGCTACGTGATAAATCGCCGCCGGGGAGTACGGCCGCAG
TGCCGAAGCTACGTGATAAATCGCCGCCGGGGAGTACGGCCGCAG
TGCCGTAGCTACGTGATAAATCGCCGCCGGGGAGTACGGCCGCAG
TGCCGAAGCTACGTGATAAATCGCCGCCGGGGAGTACGGCCGCAG
TGCCGTAGCTACGTGATAAATCGCCGCCGGGGAGTACGGCCGCAG
TGCCGAAGCTACGTGATAAATCGCCGCCGGGGAGTACGGCCGCAG
CGCCGTAGCTACGTGATAAATCGCCGCCGGGGAGTACGGCCGCAG
CGCCGTAGCTACGTGATAAATCGCCGCCGGGGAGTACGGCCGCAG
TGCCGAAGCTACGTGATAAATCGCCGCCGGGGAGTACGGCCGCAG
CTCCGAAGCAACGTGATAAATCGCCGCCGGGGAGTACGGCCGCAG
TGCCGTAGTTACGTGATAAATCGCCGCCGGGGAGTACGGCCGCAG
CTCCGTAGCTACGCGTTAAATCGCCGCCGGGGAGTACGGCCGCAG
GTTAAAACAATATTGCGGGGGCGCACACGGTGGAGCATGT
GTTAAAACAATATTGCGGGGGCGCACACGGTGGAGCATGT
GTTAAAACAATATTGCGGGGGCGCACACGGTGGAGCATGT
GTTAAAACAATATTGCGGGGGCGCACACGGTGGAGCATGT
GTTAAAACAATATTGCGGGGGCGCACACGGTGGAGCATGT
GTTAAAACAATATTGCGGGGGCGCACACGGTGGAGCATGT
GTTAAAACAATATTGCGGGGGCGCACACGGTGGAGCATGT
GTTAAAACAATATTGCGGGGGCGCACACGGTGGAGCATGT
GTTAAAACAATATTGCGGGGGCGCACACGGTGGAGCATGT
GTTAAAACAATATTGCGGGGGCGCACACGGTGGAGCATGT
GTTAAAACAATATTGCGGGGGCGCACACGGTGGAGCATGT
GTTAAAACAATATTGCGGGGGCGCACACGGTGGAGCATGT
GTTAAAACAATATTGCGGGGGCGCACACGGTGGAGCATGT
GTTAAAACAATATTGCGGGGGCGCACACGGTGGAGCATGT
GTTAAAACAATATTGCGGGGGCGCACACGGTGGAGCATGT
GTTAAAACAATATTGCGGGGGCGCACACGGTGGAGCATGT
GTTAAAACAATATTGCGGGGGCGCACACGGTGGAGCATGT
GTTAAAACAATATTGCGGGGGCGCACACGGTGGAGCATGT
GTTAAAACAATATTGCGGGGGCGCACACGGTGGAGCATGT
GTTAAAACAATATTGCGGGGGCGCACACGGTGGAGCATGT
GTTAAAACAATATTGCGGGGGCGCACACGGTGGAGCATGT
GTTAAAACAATATTGCGGGGGCGCACACGGTGGAGCATGT
GTTAAAACAATATTGCGGGGGCGCACACGGTGGAGCATGT
GTTAAAACAATATTGCGGGGGCGCACACGGTGGAGCATGT
GTTAAAACAATATTGCGGGGGCGCACACGGTGGAGCATGT
GTTAAAACAATATTGCGGGGGCGCACACGGTGGAGCATGT
GTTAAAACAATATTGCGGGGGCGCACACGGTGGAGCATGT
GTTAAAACAATATTGCGGGGGCGCACACGGTGGAGCATGT
GTTAAAACAATATTGCGGGGGCGCACACGGTGGAGCATGT
GTTAAAACAATATTGCGGGGGCGCACACGGTGGAGCATGT
GTTAAAACAATATTGCGGGGGCGCACACGGTGGAGCATGT
GGTTTATCGATAACGCGAGAACCTACCTCTCTTGACATCATG
GGTTTATCGATAACGCGAGAACCTACCTCTCTTGACATCATA
GGTTTATCGATAACGCGAGAACCTACCTCTCTTGACATCTAA
GGTTTATCGATAACGCGAGAACCTACCTCTCTTGACATCTCA
GGTTTATCGATAACGCGAGAACCTACCTCTCTTGACATCAGA
GGTTTATCGATAACGCGAGAACCTACCTCTCTTGACATCTAA
GGTTTATCGATAACGCGAGAACCTACCTCTCTTGACATCTCA
GGTTTATCGATAACGCGAGAACCTACCTCTCTTGACATCAGA
GGTTTATCGATAACGCGAGAACCTACCTCTCTTGACATCTAA
GGTTTATCGATAACGCGAGAACCTACCTCTCTTGACATCATA
GGTTTATCGATAACGCGAGAACCTACCTCTCTTGACATCAAA
GGTTTATCGATAACGCGAGAACCTACCTCTCTTGACATCTAA
GGTTTATCGATAACGCGAGAACCTACCTCTCTTGACATCTAA
GGTTTATCGATAACGCGAGAACCTACCTCTCTTGACATCTCA
GGTTTATCGATAACGCGAGAACCTACCTCTCTTGACATCTAA
GGTTTATCGATAACGCGAGAACCTACCTCTCTTGACATCAGA
GGTTTATCGATAACGCGAGAACCTACCTCTCTTGACATCAAC
GGTTTATCGATAACGCGAGAACCTACCTCTCTTGACATCTAA
GGTTTATCGATAACGCGAGAACCTACCTCTCTTGACATCATG
GGTTTATCGATAACGCGAGAACCTACCTCTCTTGACATCATA
GGTTTATCGATAACGCGAGAACCTACCTCTCTTGACATCATA
GGTTTATCGATAACGCGAGAACCTACCTCTCTTGACATCATG
GGTTTATCGATAACGCGAGAACCTACCTCTCTTGACATCATG
GGTTTATCGATAACGCGAGAACCTACCTCTCTTGACATCTAA
GGTTTATCGATAACGCGAGAACCTACCTCTCTTGACATCGAA
GGTTTATCGATAACGCGAGAACCTACCTCTCTTGACATCAGA
GGTTTATCGATAACGCGAGAACCTACCTCTCTTGACATCATA
GGTTTATCGATAACGCGAGAACCTACCTCTCTTGACATCTAA
GGTTTATCGATAACGCGAGAACCTACCTCTCTTGACATCTAA
GGTTTATCGATAACGCGAGAACCTACCTCTCTTGACATCTAA
GGTTTATCGATAACGCGAGAACCTACCTCTCTTGACATCTAA
GAAGAGATAGGGGCCTTCGGAACCTGAGACAGGTGCT
GAATAGATGGTGGCCTTCGGAACTTGAGACAGGTGCT
GAACAGATATTGGCCTTCGGAACTAGAGACAGGTGCT
GAATAGATGGTGGCCTTTGGAACTAGAGACAGGTGCT
GAACAGATATTGGCCTTCGGAGCTTGAGACAGGTGCT
GAAGAGATAGAGGCCTTCGGAGCTAGAGACAGGTGCT
GAAGAGATACAGGCCTTCGGAACTAGAGACAGGTGCT
GAAGAGATGGTGGCCTTCGGAGCTTGAGACAGGTGCT
GAATAGATGGTGGCCTTCGGAACTAGAGACAGGTGCT
GAAGAGATAGAGGCCTTAGGAGCTTGAGACAGGTGCT
GAAGAGATAGTGGCCTTCGGAACTTGAGACAGGTGCT
GAATAGATGGTGGCCTTCGGAACTAGAGACAGGTGCT
GAATAGATGGTGGCCTTCGGAACTAGAGACAGGTGCT
GAAAAGATATAGGCCTTTGGAACTAGAGACAGGTGCT
GAATAGATGGTGGCCTTCGGAACTAGAGACAGGTGCT
GAAGAGATGGTGGCCTTCGGAGCTTGAGACAGGTGCT
GAAAAGATAGAGGCCTTCGGAACGTGAGACAGGTGCT
GAAGAGATAGTGGCCTTCGGAACTAGAGACAGGTGCT
GAAGAGATAGAGGCCTTCGGAACCTGAGACAGGTGCT
GAATAGATGGTGGCCTTCGGAACTTGTGACAGGTGCT
GAATAGATGGTGGCCTTCGGAGCTTGAGACAGGTGCT
GAAGAGATAGAGGCCTTCGGAACCTGAGACAGGTGCT
GAAGAGATAGAGGCCTTCGGAACCTGAGACAGGTGCT
GAATAGATGGTGGCCTTCGGAACTAGAGACAGGTGCT
GAATAGATGGTGGCCTTAGGAGCTTGAGACAGGTGCT
GAACAGATGAGGGCCTTCGGAACTTGAGACAGGTGCT
GAATAGATGGTGGCCTTCGGAACTTGAGACAGGTGCT
GAACAGATGTTGGCCTACGGAGCTAGAGACAGGTGCT
GGATAGATAATGGCTTTCAGAACTAGAGACAGGTGCT
GAATAGATAACGGCCGTAAGAGCTAGAGACAGGTGCT
GAATAGATGGTGGCCTTCGGAACTAGAGACAGGTGCT
GCATGGCTCGTCAGCTCGTGTTGTGAAATGTTGGGTTGTCCCGCAA
GCATGGCTCGTCAGCTCGTGTTGTGAAATGTTGGGTTGTCCCGCAA
GCATGGCTCGTCAGCTCGTGTTGTGAAATGTTGGGTTGTCCCGCAA
GCATGGCTCGTCAGCTCGTGTTGTGAAATGTTGGGTTGTCCCGCAA
GCATGGCTCGTCAGCTCGTGTTGTGAAATGTTGGGTTGTCCCGCAA
GCATGGCTCGTCAGCTCGTGTTGTGAAATGTTGGGTTGTCCCGCAA
GCATGGCTCGTCAGCTCGTGTTGTGAAATGTTGGGTTGTCCCGCAA
GCATGGCTCGTCAGCTCGTGTTGTGAAATGTTGGGTTGTCCCGCAA
GCATGGCTCGTCAGCTCGTGTTGTGAAATGTTGGGTTGTCCCGCAA
GCATGGCTCGTCAGCTCGTGTTGTGAAATGTTGGGTTGTCCCGCAA
GCATGGCTCGTCAGCTCGTGTTGTGAAATGTTGGGTTGTCCCGCAA
GCATGGCTCGTCAGCTCGTGTTGTGAAATGTTGGGTTGTCCCGCAA
GCATGGCTCGTCAGCTCGTGTTGTGAAATGTTGGGTTGTCCCGCAA
GCATGGCTCGTCAGCTCGTGTTGTGAAATGTTGGGTTGTCCCGCAA
GCATGGCTCGTCAGCTCGTGTTGTGAAATGTTGGGTTGTCCCGCAA
GCATGGCTCGTCAGCTCGTGTTGTGAAATGTTGGGTTGTCCCGCAA
GCATGGCTCGTCAGCTCGTGTTGTGAAATGTTGGGTTGTCCCGCAA
GCATGGCTCGTCAGCTCGTGTTGTGAAATGTTGGGTTGTCCCGCAA
GCATGGCTCGTCAGCTCGTGTTGTGAAATGTTGGGTTGTCCCGCAA
GCATGGCTCGTCAGCTCGTGTTGTGAAATGTTGGGTTGTCCCGCAA
GCATGGCTCGTCAGCTCGTGTTGTGAAATGTTGGGTTGTCCCGCAA
GCATGGCTCGTCAGCTCGTGTTGTGAAATGTTGGGTTGTCCCGCAA
GCATGGCTCGTCAGCTCGTGTTGTGAAATGTTGGGTTGTCCCGCAA
GCATGGCTCGTCAGCTCGTGTTGTGAAATGTTGGGTTGTCCCGCAA
GCATGGCTCGTCAGCTCGTGTTGTGAAATGTTGGGTTGTCCCGCAA
GCATGGCTCGTCAGCTCGTGTTGTGAAATGTTGGGTTGTCCCGCAA
GCATGGCTCGTCAGCTCGTGTTGTGAAATGTTGGGTTGTCCCGCAA
GCATGGCTCGTCAGCTCGTGTTGTGAAATGTTGGGTTGTCCCGCAA
GCATGGCTCGTCAGCTCGTGTTGTGAAATGTTGGGTTGTCCCGCAA
GCATGGCTCGTCAGCTCGTGTTGTGAAATGTTGGGTTGTCCCGCAA
GCATGGCTCGTCAGCTCGTGTTGTGAAATGTTGGGTTGTCCCGCAA
CGAGCGCACCCTTATCCTTTGTTGCCAGCTGGCGGGAACTCAA
CGAGCGCACCCTTATCCTTTGTTGCTAGCTTGCGAGAACTCAA
CGAGCGCACCCTTATCCTTTGTTGCCAGCTGGTGGGAACTCAA
CGAGCGCACCCTTATCCTTTGTTGCCAGCTGGGGGGAACTCAA
CGAGCGCACCCTTATCCTTTGTTGCCAGCGAAGGGGAACTCAA
CGAGCGCACCCTTATCCTTTGTTGCCAGCTGGGGGGAACTCAA
CGAGCGCACCCTTATCCTTTGTTGCCAGCTAGAGGGAACTCAA
CGAGCGCACCCTTATCCTTTGTTGCCAGCTTGGGGGAACTCAA
CGAGCGCACCCTTATCCTTTGTTGCCAGCCGGTGGGAACTCAA
CGAGCGCACCCTTATCCTTTGTTGCCAGCTGGTGGGAACTCAA
CGAGCGCACCCTTATCCTTTGTTGCCAGCTGGTGGGAACTCAA
CGAGCGCACCCTTATCCTTTGTTGCCAGCTGGTGGGAACTCAA
CGAGCGCACCCTTATCCTTTGTTGCCAGCTGGTGGGAACTCAA
CGAGCGCACCCTTATCCTTTGTTGCCAGCTGGTGGGAACTCAA
CGAGCGCACCCTTATCCTTTGTTGCCAGCTGGTGGGAACTCAA
CGAGCGCACCCTTATCCTTTGTTGCCAGCTGGTGGGAACTCAA
CGAGCGCACCCTTGTCCTTTGTTGCCATCTGGTGGGAACTCAA
CGAGCGCACCCTTATCCTTTGTTGCCAGCTGGTGGGAACTCAA
CGAGCGCACCCTTATCCTTTGTTGCCAGCTGGTGGGAACTCAA
CGAGCGCACCCTTATCCTTTGTTGCCAGCTTGAGGGAACTCAA
CGAGCGCACCCTTATCCTTTGTTGCCAGCCGGTGGGAACTCAA
CGAGCGCACCCTTATCCTTTGTTGCCAGCTGGTGGGAACTCAA
CGAGCGCACCCTTATCCTTTGTTGCCAGCTGGTGGGAACTCAA
CGAGCGCACCCTTATCCTTTGTTGCCAGCCGGTGGGAACTCAA
CGAGCGCACCCTTATCCTTTGTTGCCAGCCGGTGGGAACTCAA
CGAGCGCACCCTTATCCTTTGTTGCCAGCTGGTGGGAACTCAA
CGAGCGCACCCTTATCCTTTGTTGCCAGCTGGTGGGAACTCAA
CGAGCGCACCCTTATCCTTTGTTGCCAGCCGGGGGGAACTCAA
CGAGCGCACCCTTATTCTTTGTTACCAACTAGCGGGGACTCAA
CGAGCGCACCCTTATCCTTTGTTGCCAGCTGGTGGGAACTCAA
CGAGCGCACCCCTATCCTTTGTTGCCATCTAGTGGGAACTCAA
AGGAGACTCCAGTGATACTGGAGGAAGGGGGATGACGTCAAGTCA
AGGAGACTCCAGTGATACTGGAGGAAGGGGGACGACGTCAAGTCA
AGGAGACTCCGGTGATACCGGAGGAAGGGGGATGACGTCAAGTCA
AGGAGACTCCGGTGATACCGGAGGAAGGGGGATGACGTCAAGTCA
AGGAGACTCCAGTGATACTGGAGGAAGGGGGATGACGTCAAGTCA
AGGAGACTCCAGTGATACTGGAGGAAGGGGGATGACGTCAAGTCA
AGGAGACTCCGGTGATACCGGAGGAAGGGGGATGACGTCAAGTCA
AGGAGACTCCGGTGATACCGGAGGAAGGGGGATGACGTCAAGTCA
AGGAGACTCCGGTGATACCGGAGGAAGGGGGATGACGTCAAGTCA
AGGAGACTCCAGTGATACTGGAGGAAGGGGGATGACGTCAAGTCA
AGGAGACTCCAGTGATACTGGAGGAAGGGGGATGACGTCAAGTCA
AGGAGACTCCAGTGATACTGGAGGAAGGGGGATGACGTCAAGTCA
AGGAGACTCCAGTGACACTGGAGGAAGGGGGATGACGTCAAGTCA
AGGAGACTCCAGTGATACTGGAGGAAGGGGGATGACGTCAAGTCA
AGGAGACTCCAGTGATACTGGAGGAAGGGGGATGACGTCAAGTCA
AGGAGACTCCAGTGATACTGGAGGAAGGGGGATGACGTCAAGTCA
AGGAGACTCCAGTGATACTGGAGGAAGGGGGATGACGTCAAGTCA
AGGAGACTCCAGTGATACTGGAGGAAGGGGGATGACGTCAAGTCA
AGGAGACTCCGGTGATACCGGAGGAAGGGGGATGACGTCAAGTCA
AGGAGACTCCAGTGATACTGGAGGAAGGGGGATGACGTCAAGTCA
AGGAGACTCCAGTGATACTGGAGGAAGGGGGATGACGTCAAGTCA
AGGAGACTCCGGTGATACCGGAGGAAGGGGGATGACGTCAAGTCA
AGGAGACTCCGGTGATACCGGAGGAAGGGGGATGACGTCAAGTCA
AGGAGACTCCAGTGATACTGGAGGAAGGGGGATGACGTCAAGTCA
AGGAGACTCCGGTGATACCGGAGGAAGGGGGATGACGTCAAGTCA
AGGAGACTCCGGTGATACCGGAGGAAGGGGGATGACGTCAAGTCA
AGGAGACTCCAGTGATACTGGAGGAAGGGGGATGACGTCAAGTCA
AGGAGACCCCGGTGATACTGGAGGAAGGGGGATGACGTCAAGTCA
AGGAGACTCCAGTGACGCTGGAGGAAGGGGGATGACGTCAAGTCA
AGGAGACTCCAGTGATACTGGAGGAAGGGGGATGACGTCAAGTCA
AGGAGACTCCGGTGACACCGGAGGAAGGGGGATGACGTCAAGTCA
TCTGGCCTTACGAGTAGGGCTACACACGTGCTACAATGGCGTATACG
TCTGGCCTTACGAGTAGGGCTACACACGTGCTACAATGGCGTATACG
TCTGGCCTTACGAGTAGGGCTACACACGTGCTACAATGGTGCATACG
TCTGGCCTTACGAGTAGGGCTACACACGTGCTACAATGGCGTATACG
TCTGGCCTTACGAGTAGGGCTACACACGTGCTACAATGGTGCATACG
TCTGGCCTTACGAGTAGGGCTACACACGTGCTACAATGGTGCATACG
TCTGGCCTTACGAGTAGGGCTACACACGTGCTACAATGGCGTATACG
TCTGGCCTTACGAGTAGGGCTACACACGTGCTACAATGGTGCATACG
TCTGGCCTTACGAGTAGGGCTACACACGTGCTACAATGGTGCATACG
TCTGGCCTTACGAGTAGGGCTACACACGTGCTACAATGGTGCATACG
TCTGGCCTTACGAGTAGGGCTACACACGTGCTACAATGGCGTATACG
TCTGGCCTTACGAGTAGGGCTACACACGTGCTACAATGGTGCATACG
TCTGGCCTTACGAGTAGGGCTACACACGTGCTACAATGGTGCATACG
TCTGGCCTTACGAGTAGGGCTACACACGTGCTACAATGGTGCATACG
TCTGGCCTTACGAGTAGGGCTACACACGTGCTACAATGGTGCATACG
TCTGGCCTTACGAGTAGGGCTACACACGTGCTACAATGGTGCATACG
TCTGGCCTTACGAGTAGGGCTACACACGTGCTACAATGGTGCATACG
TCTGGCCTTACGAGTAGGGCTACACACGTGCTACAATGGTGCATACG
TCTGGCCTTACGAGTAGGGCTACACACGTGCTACAATGGTGCATACG
TCTGGCCTTACGAGTAGGGCTACACACGTGCTACAATGGCGTATACG
TCTGGCCTTACGAGTAGGGCTACACACGTGCTACAATGGTGCATACG
TCTGGCCTTACGAGTAGGGCTACACACGTGCTACAATGGCGTATACG
TCTGGCCTTACGAGTAGGGCTACACACGTGCTACAATGGCGTATACG
TCTGGCCTTACGAGTAGGGCTACACACGTGCTACAATGGCGTATACG
TCTGGCCTTACGAGTAGGGCTACACACGTGCTACAATGGCGTATACG
TCTGGCCTTACGAGTAGGGCTACACACGTGCTACAATGGCGTATACG
TCTGGCCTTACGAGTAGGGCTACACACGTGCTACAATGGTGCATACG
TCTGGCCTTACGAGTAGGGCTACACACGTGCTACAATGGCGTATACG
TCTGGCCTTACGAGTAGGGCTACACACGTGCTACAATGGTGCATACG
TCTGGCCTTACGAGTAGGGCTACACACGTGCTACAATGGTGCATACG
TCTGGCCTTACGAGTAGGGCTACACACGTGCTACAATGGCGTATACG
AGGGCAGCGGCGCGGGTAGCGAATCTCAGAAAGTACGTCTAAGT
AGGGCGGCGACGCGGGTAGCGAATCTCAGAAAGTACGTCTAAGT
AGGGTAGCGACGCGGGTAGCCAATCTCAGAAAGTGCATCTAAGT
AGGGGTGCGACGCTGTGAGCGAATCTCATAAAGTGCGTCTAAGT
AGGGCGGCGACGCGGGTAGCGAATCTCAGAAAGTGCATCGTAGT
AGGGGTGCGGCGCGGGTAGCGAATCTCAGAAAGTGCATCTAAGT
AGGGAAACGGCGCGGGGAGTGAATCTCAGAAAGTGCGTCGTAGT
AGAGCGACGGCGCGGTGAGTGAATCTCAGAAAGTGCATCTAAGT
AGGGCGGCAACGCGGGAAGCGAATCTCAGAAAGTGCATCGTAGT
AGGGCGGCGACGCGGGTAGCGAATCTCAGAAAGTGCATCGTAGT
AGGGAAGCAATGCCATGAGCAAATCTCACAAAGTACGTCTAAGT
AGGGCGACGACGCGGTGAGTGAATCTCAGAAAGTGCATCTAAGT
AGGGCAGCGGGGCGCTTAGCGAATCTCAGAAAGTGCATCTAAGT
AGGGCGACAACGCGGGGAGTGAATCTCAGAAAGTGCATCTAAGT
AGGGAAGCGGGGCGCTGAGCGAATCTCAGAAAGTGCATCTAAGT
AGGGTAGCGGCGCTGGTAGCCAATCTCAGAAAGTGCATCTAAGT
AGGGTGGCGGCGCTGGTGGCGAATCTCTTAAAGTGCATCTAAGT
AGGGCGACGACGCTGGGAGTGAATCTCAGAAAGTGCATCTAAGT
AGGGCGGCGACGTTGCGAGCGAATCTCAGAAAGTGCATCTAAGT
AGGGCGGCAACGCAGGGAGCGAATCTCACAAAGTACGTCTAAGT
AGGGAAGCGGCGCGGGGAGCGAATCTCAGAAAGTGCATCTAAGT
AGGGCGGCGGCGCTGGGAGCGAATCTCAGAAAGTACGTCTAAGT
AGGGAAGCAGTGCCATGAGCAAATCTCACAAAGTACGTCTAAGT
AGGGAAGCGACGCGGTGAGCGAATCTCATAAAGTACGTCTAAGT
AGGGTAACCACGCTGGGAGTGAATCTCAGAAAGTACGTCTAAGT
AGGGAAGCGGGGCGCTGAGCGAATCTCACAAAGTACGTCTAAGT
AGGGAAGCGTCGCGGTGAGCGAATCTCACAAAGTGCATCTAAGT
AGGGAGGCGGCGCGGGTAGCGAAGCTCAGAAAGTACGTCTAAGT
AGGGAGGCGATGCTATGAGCGAAACTCAGAAAGTGCATCGTAGT
AGGGCAGCGGGGCGCTTAGCGAATCTCAGAAAGTGCATCTAAGT
AGGGAAGCGGGGCGCTTAGCAAATCTCTTAAAGTGCGTCGTAGT
CCGGATTGGAGTCTGCACTCGACTCCATGAAGTCGGAATCGCTAGTAAT
CCGGATTGGAGTCTGCACTCGACTCCATGAAGTCGGAATCGCTAGTAAT
CCGGATTGGAGTCTGCACTCGACTCCATGAAGTCGGAATCGCTAGTAAT
CCGGATTGGAGTCTGCACTCGACTCCATGAAGTCGGAATCGCTAGTAAT
CCGGATTGGAGTCTGCACTCGACTCCATGAAGTCGGAATCGCTAGTAAT
CCGGATTGGAGTCTGCACTCGACTCCATGAAGTCGGAATCGCTAGTAAT
CCGGATTGGAGTCTGCACTCGACTCCATGAAGTCGGAATCGCTAGTAAT
CCGGATTGGAGTCTGCACTCGACTCCATGAAGTCGGAATCGCTAGTAAT
CCGGATTGGAGTCTGCACTCGACTCCATGAAGTCGGAATCGCTAGTAAT
CCGGATTGGAGTCTGCACTCGACTCCATGAAGTCGGAATCGCTAGTAAT
CCGGATTGGAGTCTGCACTCGACTCCATGAAGTCGGAATCGCTAGTAAT
CCGGATTGGAGTCTGCACTCGACTCCATGAAGTCGGAATCGCTAGTAAT
CCGGATTGGAGTCTGCACTCGACTCCATGAAGTCGGAATCGCTAGTAAT
CCGGATTGGAGTCTGCACTCGACTCCATGAAGTCGGAATCGCTAGTAAT
CCGGATTGGAGTCTGCACTCGACTCCATGAAGTCGGAATCGCTAGTAAT
CCGGATTGGAGTCTGCACTCGACTCCATGAAGTCGGAATCGCTAGTAAT
CCGGATTGGAGTCTGCACTCGACTCCATGAAGTCGGAATCGCTAGTAAT
CCGGATTGGAGTCTGCACTCGACTCCATGAAGTCGGAATCGCTAGTAAT
CCGGATTGGCGTCTGCACTCGACGCCATGAAGTCGGAATCGCTAGTAAT
CCGGATTGGAGTCTGCACTCGACTCCATGAAGTCGGAATCGCTAGTAAT
CCGGATTGGAGTCTGCACTCGACTCCATGAAGTCGGAATCGCTAGTAAT
CCGGATTGGAGTCTGCACTCGACTCCATGAAGTCGGAATCGCTAGTAAT
CCGGATTGGAGTCTGCACTCGACTCCATGAAGTCGGAATCGCTAGTAAT
CCGGATTGGAGTCTGCACTCGACTCCATGAAGTCGGAATCGCTAGTAAT
CCGGATTGGAGTCTGCACTCGACTCCATGAAGTCGGAATCGCTAGTAAT
CCGGATTGGAGTCTGCACTCGACTCCATGAAGTCGGAATCGCTAGTAAT
CCGGATTGGAGTCTGCACTCGACTCCATGAAGTCGGAATCGCTAGTAAT
CCGGATTGGAGTCTGGACTCGACTCCATGAAGTCGGAATCGCTAGTAAT
CCGGATTGGAGTCTGCACTCGACTCCATGAAGTCGGAATCGCTAGTAAT
CCGGATTGGAGTCTGCACTCGACTCCATGAAGTCGGAATCGCTAGTAAT
CCGGATTGGAGTCTGCACTCGACTCCATGAAGTCGGAATCGCTAGTAAT
CGCGAATCGAATGTCGCGGTAATACGTTCCCGGGCCT
CGCGAATCGAATGTCGCGGTAATACGTTCCCGGGCCT
CGCAAATCGAATGTTGCGGTAATACGTTCCCGGGCCT
CGCGAATCGAATGTCGCGGTAATACGTTCCCGGGCCT
CGCAAATCGAATGTTGCGGTAATACGTTCCCGGGCCT
CGCGAATCGAATGTCGCGGTAATACGTTCCCGGGCCT
CGCGAATCGCATGTCGCGGTAATACGTTCCCGGGCCT
CGCGAATCGAATGTCGCGGTAATACGTTCCCGGGCCT
CGCAAATCGAATGTTGCGGTAATACGTTCCCGGGCCT
CGCAAATCGAATGTTGCGGTAATACGTTCCCGGGCCT
CGCAAATCGAATGTTGCGGTAATACGTTCCCGGGCCT
CGCGAATCGAATGTCGCGGTAATACGTTCCCGGGCCT
CGCAAATCGAATGTTGCGGTAATACGTTCCCGGGCCT
CGCGAATCGAATGTCGCGGTAATACGTTCCCGGGCCT
CGCAAATCGAATGTTGCGGTAATACGTTCCCGGGCCT
CGCAAATCGAATGTTGCGGTAATACGTTCCCGGGCCT
CGCAAATCGAATGTTGCGGTAATACGTTCCCGGGCCT
CGCGAATCGAATGTTGCGGTAATACGTTCCCGGGCCT
CGCAAATCGAATGTTGCGGTAATACGTTCCCGGGCCT
CGCAAATCGAATGTTGCGGTAATACGTTCCCGGGCCT
CGCAAATCGAATGTTGCGGTAATACGTTCCCGGGCCT
CGCAAATCGAATGTTGCGGTAATACGTTCCCGGGCCT
CGCAAATCGAATGTTGCGGTAATACGTTCCCGGGCCT
CGCGAATCGAATGTCGCGGTAATACGTTCCCGGGCCT
CGCGAATCGAATGTTGCGGTAATACGTTCCCGGGCCT
CGCAAATCGAATGTTGCGGTAATACGTTCCCGGGCCT
CGCAAATCGAATGTTGCGGTAATACGTTCCCGGGCCT
CGCGGATCGAATGTCGCGGTAATACGTTCCCGGGCCT
CGCAAATCGAATGTTGCGGTAATACGTTCCCGGGCCT
CGCAAATCGAATGTTGCGGTAATACGTTCCCGGGCCT
CGCAAATCGAATGTTGCGGTAATACGTTCCCGGGCCT
Output files for phylogenetic exercises
DNA parsimony algorithm, version 3.573c
Name Sequences
---- ---------
seq1 AAG
seq2 ..A
seq3 GGA
seq4 .GA
One most parsimonious tree found:
+--seq4
+--3
+--2 +--seq3
! !
--1 +-----seq2
!
+--------seq1
remember: this is an unrooted tree!
requires a total of 3.000
steps in each site:
0 1 2 3 4 5 6 7 8 9
*-----------------------------------------
0! 1 1 1
>From To Any Steps? State at upper node
( . means same as in the node below it on tree)
1 AAR
1 2 maybe ..A
2 3 yes .G.
3 seq4 no ...
3 seq3 yes G..
2 seq2 no ...
1 seq1 maybe ..G
DNA parsimony algorithm, version 3.573c
One most parsimonious tree found:
+--------gorillaexx
+--2
! ! +-----homosapien
! +--3
--1 ! +--orangutang
! +--4
! +--gibbonxxxx
!
+-----------chimpanzee
remember: this is an unrooted tree!
requires a total of 330.000
Exercise 2b
Neighbor-Joining/UPGMA method version 3.573c
+-------gibbonxxxx
+--1
+--2 +-----orangutang
! !
! +---gorillaexx
!
--3-homosapien
!
+--chimpanzee
remember: this is an unrooted tree!
Between And Length
------- --- ------
3 2 0.00318
2 1 0.03598
1 gibbonxxxx 0.12602
1 orangutang 0.09198
2 gorillaexx 0.05777
3 homosapien 0.04015
3 chimpanzee 0.05195
Exercise 2c (dnaml program)
Nucleic acid sequence Maximum Likelihood method, version 3.573c
Empirical Base Frequencies:
A 0.30929
C 0.32750
G 0.10570
T(U) 0.25751
Transition/transversion ratio = 2.000000
(Transition/transversion parameter = 1.653039)
+-homosapien
+--2
! ! +-----orangutang
! +--3
! +-------gibbonxxxx
!
--1---gorillaexx
!
+--chimpanzee
remember: this is an unrooted tree!
Ln Likelihood = -2514.48557
Examined 17 trees
Between And Length Approx. Confidence Limits
------- --- ------ ------- ---------- ------
1 2 0.01720 ( 0.00594, 0.02847) **
2 homosapien 0.02875 ( 0.01514, 0.04262) **
2 3 0.05455 ( 0.03466, 0.07469) **
3 orangutang 0.09121 ( 0.06758, 0.11573) **
3 gibbonxxxx 0.13271 ( 0.10424, 0.16187) **
1 gorillaexx 0.06191 ( 0.04344, 0.08064) **
1 chimpanzee 0.05097 ( 0.03402, 0.06806) **
* = significantly positive, P < 0.05
** = significantly positive, P < 0.01
Exercise 2d
(Bootstrapping of parsimony)
Majority-rule and strict consensus tree program, version 3.573c
Species in order:
gorillaexx
homosapien
orangutang
gibbonxxxx
chimpanzee
Sets included in the consensus tree
Set (species in order) How many times out of 100.00
..**. 100.00
.***. 73.67
Sets NOT included in consensus tree:
Set (species in order) How many times out of 100.00
..*** 20.17
.*..* 6.17
CONSENSUS TREE:
the numbers at the forks indicate the number
of times the group consisting of the species
which are to the right of that fork occurred
among the trees, out of 100.00 trees
+----gibbonxxxx
+-100.0
+-73.7 +----orangutang
! !
+-100.0 +---------homosapien
! !
! +--------------chimpanzee
!
+-------------------gorillaexx
remember: this is an unrooted tree!
Protein parsimony algorithm, version 3.573c
One most parsimonious tree found:
+--homosapien
+-----3
! +--gorillaexx
+--2
! ! +--orangutang
--1 +-----4
! +--gibbonxxxx
!
+-----------chimpanzee
remember: this is an unrooted tree!
requires a total of 57.000
Neighbor-Joining/UPGMA method version 3.573c
Neighbor-joining method
Negative branch lengths allowed
+--------gibbonxxxx
+--1
! +-----orangutang
!
--3-gorillaexx
!
! +-chimpanzee
+--2
+-homosapien
remember: this is an unrooted tree!
Between And Length
------- --- ------
3 1 0.02121
1 gibbonxxxx 0.14023
1 orangutang 0.08913
3 gorillaexx 0.03381
3 2 0.00696
2 chimpanzee 0.02877
2 homosapien 0.03076
Results are given for neighbor joining, as well as sample results for a run of maximum parsimony and maximum
likelihood, corresponding to the order of sequences in the data set.
Note that latter results depend strongly on that order. In a data set of this
size an exhaustive search is impossible, but randomizing the sequence
will give different values, among which the best ones can be chosen. For
parsimony the value obtained (827 substitutions) is quite good. For
maximum likelihood the log likelihood value can be improved up to at least
values around -6622.3.
Exercise 4: Neighbor joining
31 Populations
Neighbor-Joining/UPGMA method version 3.573c
Neighbor-joining method
Negative branch lengths allowed
+Actinomuri
+-16
! ! +Bistfivexx
! +--4
+-28 +Mhaemolyti
! !
! ! +Ptrehalosi
! +-27
! +Pbettixxxx
!
! +-Bistsevent
! +-13
! ! ! +Hinfleunza
! ! +--8
! +-20 +Aactinomyc
! ! !
! ! ! +-Asuccinoge
! ! ! +--9
! ! +-15 +-Hsomnusxxx
! ! !
! ! +Paerogenes
! +-23
! ! ! +Pgallinaru
! ! ! +--6
! ! ! ! +Bistthreth
! ! ! +-11
! ! ! ! ! +-Asalpingit
! ! ! ! +--2
-29-24 +-21 +-Ptestudini
! ! !
! ! ! +Pmultocida
! ! +--5
! ! +Hfelisxxxx
! !
! ! +Hparasuisx
! +-17
! ! +Acapsulatu
! +-14
! +Hhaemoglob
!
! +-Adelphinic
! +-10
! ! ! +Phocenobac
! ! +--1
! +-19 +Hducreyixx
! ! !
! ! ! +Aminorxxxx
! ! +-12
! ! ! +Alignieres
! +-25 +--3
! ! ! +Hparainflu
! ! !
! ! ! +-Plnagaaxxx
! ! ! +--7
+-26 +-18 +Arossiixxx
! !
! +Aporcinusx
!
! +Bistsevenx
+-22
+Lonekoalax
remember: this is an unrooted tree!
Between And Length
------- --- ------
29 28 0.00041
28 16 0.00318
16 Actinomuri 0.01928
16 4 0.00272
4 Bistfivexx 0.01366
4 Mhaemolyti 0.01594
28 27 0.00069
27 Ptrehalosi 0.01605
27 Pbettixxxx 0.01465
29 24 0.00152
24 23 0.00042
23 20 0.00157
20 13 0.00285
13 Bistsevent 0.02478
13 8 0.00197
8 Hinfleunza 0.01359
8 Aactinomyc 0.03031
20 15 0.00198
15 9 0.00251
9 Asuccinoge 0.01980
9 Hsomnusxxx 0.03060
15 Paerogenes 0.01609
23 21 0.00139
21 11 0.00341
11 6 0.00214
6 Pgallinaru 0.01507
6 Bistthreth 0.02603
11 2 0.00443
2 Asalpingit 0.03543
2 Ptestudini 0.03827
21 5 0.00387
5 Pmultocida 0.01569
5 Hfelisxxxx 0.01131
24 17 0.00226
17 Hparasuisx 0.01199
17 14 0.00156
14 Acapsulatu 0.01477
14 Hhaemoglob 0.01133
29 26 0.00090
26 25 0.00089
25 19 0.00219
19 10 0.00288
10 Adelphinic 0.02672
10 1 0.00443
1 Phocenobac 0.02806
1 Hducreyixx 0.01204
19 12 0.00132
12 Aminorxxxx 0.01064
12 3 0.00461
3 Alignieres 0.01688
3 Hparainflu 0.01562
25 18 0.00207
18 7 0.00301
7 Plnagaaxxx 0.02251
7 Arossiixxx 0.01099
18 Aporcinusx 0.01284
26 22 0.00172
22 Bistsevenx 0.01834
22 Lonekoalax 0.01606
Exercise 4: Parsimony
DNA parsimony algorithm, version 3.573c
One most parsimonious tree found:
+--Hparainflu
+-25
+-------------------------------------------------------------------------22 +--Alignieres
! !
! +-----Aminorxxxx
!
! +-----Aporcinusx
! +-----------------------------------------------------------------8
! ! ! +--Arossiixxx
! ! +--9
! ! +--Plnagaaxxx
+----10 !
! ! ! +-----Pbettixxxx
! ! ! +----------------------------------------------20
! ! ! ! ! +--Lonekoalax
! ! ! ! +-26
! ! ! ! +--Bistsevenx
! ! ! !
! ! ! +-------16 +--------Hparasuisx
! ! ! ! ! +----------------------------------11
! +--------4 ! ! ! ! +-----Paerogenes
! ! ! ! ! +--7
! ! ! ! ! ! +--Hsomnusxxx
! ! ! +--------6 +-27
! ! ! ! +--Asuccinoge
! ! ! !
! ! ! ! +--------------------------------Hhaemoglob
! ! ! ! !
! ! ! +----------17 +-----------------------------Acapsulatu
! ! ! ! !
! ! ! ! ! +--------------------------Ptrehalosi
! ! ! +-13 !
! ! ! ! ! +--------------Pmultocida
! ! ! ! ! !
+--2 +--------3 ! ! +-------12 +-----------Hfelisxxxx
! ! ! +-15 ! ! !
! ! ! ! ! +-14 +--Ptestudini
! ! ! ! ! ! +----30
! ! ! ! ! ! ! +--Asalpingit
! ! ! ! ! +-28
! ! ! +--5 ! +--Bistthreth
! ! ! ! +----29
! ! ! ! +--Pgallinaru
! ! ! !
! ! ! ! +--Aactinomyc
! ! ! ! +-24
! ! ! +----------------23 +--Hinfleunza
--1 ! ! !
! ! ! +-----Bistsevent
! ! !
! ! ! +-----Bistfivexx
! ! +-------------------------------------------------------18
! ! ! +--Mhaemolyti
! ! +-21
! ! +--Actinomuri
! !
! ! +--Hducreyixx
! +----------------------------------------------------------------------------------19
! +--Phocenobac
!
+-----------------------------------------------------------------------------------------Adelphinic
remember: this is an unrooted tree!
requires a total of 827.000
Exercise 4: Maximum likelihood
fastDNAml, version 1.2.1, March 9, 1998
Based on Joseph Felsenstein's
Nucleic acid sequence Maximum Likelihood method, version 3.3
31 Species, 1137 Sites
...
Total weight of positions in analysis = 1137
There are 186 distinct data patterns (columns)
Empirical Base Frequencies:
A 0.23894
C 0.21466
G 0.32715
T(U) 0.21925
Transition/transversion ratio = 2.000000
(Transition/transversion parameter = 1.484598)
...
Examined 1887 trees
+ Hducreyixx
+--------18
! +--------- Phocenobac
!
! +--------- Hparainflu
! +--------24
! +--------21 + Alignieres
! ! !
! ! + Aminorxxxx
! !
! ! +--------- Lonekoalax
---1 ! +-25
! ! ! +--------- Bistsevenx
! ! !
! ! ! +--------- Hfelisxxxx
! ! ! +-13
! ! ! ! +--------- Pmultocida
! ! ! +--------11
! ! ! ! ! +-------------------- Aactinomyc
! ! ! ! ! +--------23
! ! ! ! +-22 + Hinfleunza
! ! ! +-14 !
+--9 +-15 ! ! +-------------------- Bistsevent
! ! ! ! ! !
! ! ! ! ! ! + Ptrehalosi
! ! ! ! +--2 +--------12
! ! ! ! ! ! + Acapsulatu
! ! ! ! ! !
! ! ! ! ! ! +--------- Hhaemoglob
! ! ! ! ! ! !
! ! ! ! ! +-16 +--------- Bistthreth
! ! ! ! ! ! +--------28
! ! ! ! ! ! ! ! +------------------------------ Ptestudini
! ! ! ! ! +--------27 +-29
! ! ! +--7 ! +-------------------- Asalpingit
! ! ! ! !
! ! ! ! +--------- Pgallinaru
! ! ! !
! ! ! ! +--------- Pbettixxxx
! +---------3 ! +-19
! ! ! ! ! +--------- Mhaemolyti
! ! ! +-17 +--------20
! ! ! ! ! +--------- Bistfivexx
! ! ! ! !
! ! +--5 +-------------------- Actinomuri
! ! !
! ! ! + Hparasuisx
! ! ! !
! ! +---------6 +-------------------- Hsomnusxxx
! ! ! +--------26
! ! +-10 +--------- Asuccinoge
! ! !
! ! +--------- Paerogenes
! !
! ! +--------- Arossiixxx
! ! +--8
! +--4 +-------------------- Plnagaaxxx
! !
! +--------- Aporcinusx
!
+-------------------- Adelphinic
Remember: this is an unrooted tree!
Ln Likelihood = -6644.56672
Between And Length Approx. Confidence Limits
------- --- ------ ------- ---------- ------
1 18 0.00837 ( 0.00272, 0.01407) **
18 Hducreyixx 0.01213 ( 0.00533, 0.01900) **
18 Phocenobac 0.02807 ( 0.01797, 0.03834) **
1 9 0.00684 ( 0.00150, 0.01222) **
9 21 0.00693 ( 0.00179, 0.01212) **
21 24 0.01251 ( 0.00563, 0.01947) **
24 Hparainflu 0.02188 ( 0.01292, 0.03097) **
24 Alignieres 0.01167 ( 0.00497, 0.01843) **
21 Aminorxxxx 0.00367 ( zero, 0.00769) **
9 3 0.00386 ( 0.00007, 0.00767) **
3 15 0.00266 ( zero, 0.00623) **
15 25 0.00636 ( 0.00123, 0.01153) **
25 Lonekoalax 0.01693 ( 0.00897, 0.02499) **
25 Bistsevenx 0.01945 ( 0.01093, 0.02809) **
15 7 0.00290 ( zero, 0.00640) **
7 2 0.00404 ( zero, 0.00816) **
2 14 0.00388 ( 0.00012, 0.00765) **
14 11 0.00602 ( 0.00053, 0.01155) **
11 13 0.00568 ( 0.00109, 0.01031) **
13 Hfelisxxxx 0.00946 ( 0.00337, 0.01562) **
13 Pmultocida 0.01799 ( 0.00992, 0.02618) **
11 22 0.00768 ( 0.00185, 0.01357) **
22 23 0.00741 ( 0.00174, 0.01312) **
23 Aactinomyc 0.03562 ( 0.02422, 0.04724) **
23 Hinfleunza 0.00998 ( 0.00354, 0.01649) **
22 Bistsevent 0.02472 ( 0.01521, 0.03440) **
14 12 0.00211 ( zero, 0.00562) *
12 Ptrehalosi 0.01503 ( 0.00773, 0.02242) **
12 Acapsulatu 0.01437 ( 0.00727, 0.02155) **
2 16 0.00362 ( zero, 0.00778) **
16 Hhaemoglob 0.01116 ( 0.00467, 0.01773) **
16 27 0.01046 ( 0.00415, 0.01683) **
27 28 0.00786 ( 0.00190, 0.01388) **
28 Bistthreth 0.02454 ( 0.01477, 0.03448) **
28 29 0.01170 ( 0.00472, 0.01875) **
29 Ptestudini 0.04200 ( 0.02943, 0.05485) **
29 Asalpingit 0.03249 ( 0.02130, 0.04389) **
27 Pgallinaru 0.01283 ( 0.00565, 0.02010) **
7 5 0.00289 ( zero, 0.00630) **
5 17 0.00249 ( zero, 0.00594) **
17 19 0.00323 ( zero, 0.00689) **
19 Pbettixxxx 0.01382 ( 0.00665, 0.02107) **
19 20 0.00930 ( 0.00335, 0.01532) **
20 Mhaemolyti 0.01693 ( 0.00912, 0.02485) **
20 Bistfivexx 0.01281 ( 0.00595, 0.01974) **
17 Actinomuri 0.02377 ( 0.01457, 0.03311) **
5 6 0.00782 ( 0.00242, 0.01328) **
6 Hparasuisx 0.00721 ( 0.00182, 0.01265) **
6 10 0.00830 ( 0.00222, 0.01444) **
10 26 0.00706 ( 0.00159, 0.01259) **
26 Hsomnusxxx 0.03306 ( 0.02193, 0.04440) **
26 Asuccinoge 0.01859 ( 0.01002, 0.02728) **
10 Paerogenes 0.01708 ( 0.00890, 0.02537) **
3 4 0.00517 ( 0.00069, 0.00967) **
4 8 0.00667 ( 0.00147, 0.01192) **
8 Arossiixxx 0.00578 ( 0.00103, 0.01058) **
8 Plnagaaxxx 0.02878 ( 0.01874, 0.03899) **
4 Aporcinusx 0.01098 ( 0.00468, 0.01734) **
1 Adelphinic 0.02925 ( 0.01893, 0.03976) **
* = significantly positive, P < 0.05
** = significantly positive, P < 0.01
Ib Skovgaard
(ims@kvl.dk) 2002-06-17