Phylogeny page for 3-week course on Bioinformatics

Links for Phylogeny lecture and exercises


Acknowledgement

This page is a revised version of the page made for the 2001 course by Henrik Stryhn.

Data sets for Phylogeny exercises

Exercise 1

4 3
seq1      AAG
seq2      AAA
seq3      GGA
seq4      AGA

Exercise 2

  5 846
chimpanzeeAAGCTTCACC GGCGCAATTA TCCTCATAAT CGCCCACGGA CTTACATCCT
gibbonxxxxAAGCTTTACA GGTGCAACCG TCCTCATAAT CGCCCACGGA CTAACCTCTT
gorillaexxAAGCTTCACC GGCGCAGTTG TTCTTATAAT TGCCCACGGA CTTACATCAT
homosapienAAGCTTCACC GGCGCAGTCA TTCTCATAAT CGCCCACGGA CTTACATCCT
orangutangAAGCTTCACC GGCGCAACCA CCCTCATGAT TGCCCATGGA CTCACATCCT

CATTATTATT CTGCCTAGCA AACTCAAATT ATGAACGCAC CCACAGTCGC
CCCTGCTATT CTGCCTTGCA AACTCAAACT ACGAACGAAC TCACAGCCGC
CATTATTATT CTGCCTAGCA AACTCAAACT ACGAACGAAC CCACAGCCGC
CATTACTATT CTGCCTAGCA AACTCAAACT ACGAACGCAC TCACAGTCGC
CCCTACTGTT CTGCCTAGCA AACTCAAACT ACGAACGAAC CCACAGCCGC

ATCATAATTC TCTCCCAAGG ACTTCAAACT CTACTCCCAC TAATAGCCTT
ATCATAATCC TATCTCGAGG GCTCCAAGCC TTACTCCCAC TGATAGCCTT
ATCATAATTC TCTCTCAAGG ACTCCAAACC CTACTCCCAC TAATAGCCCT
ATCATAATCC TCTCTCAAGG ACTTCAAACT CTACTCCCAC TAATAGCTTT
ATCATAATCC TCTCTCAAGG CCTTCAAACT CTACTCCCCC TAATAGCCCT

TTGATGACTC CTAGCAAGCC TCGCTAACCT CGCCCTACCC CCTACCATTA
CTGATGACTC GCAGCAAGCC TCGCTAACCT CGCCCTACCC CCCACTATTA
TTGATGACTT CTGGCAAGCC TCGCCAACCT CGCCTTACCC CCCACCATTA
TTGATGACTT CTAGCAAGCC TCGCTAACCT CGCCTTACCC CCCACTATTA
CTGATGACTT CTAGCAAGCC TCACTAACCT TGCCCTACCA CCCACCATCA

ATCTCCTAGG GGAACTCTCC GTGCTAGTAA CCTCATTCTC CTGATCAAAT
ACCTCCTAGG TGAACTCTTC GTACTAATGG CCTCCTTCTC CTGGGCAAAC
ACCTACTAGG AGAGCTCTCC GTACTAGTAA CCACATTCTC CTGATCAAAT
ACCTACTGGG AGAACTCTCT GTGCTAGTAA CCACATTCTC CTGATCAAAT
ACCTTCTAGG AGAACTCTCC GTACTAATAG CCATATTCTC TTGATCTAAC

ACCACTCTCC TACTCACAGG ATTCAACATA CTAATCACAG CCCTGTACTC
ACTACTATTA CACTCACCGG GCTCAACGTA CTAATCACGG CCCTATACTC
ACCACCCTTT TACTTACAGG ATCTAACATA CTAATCACAG CCCTGTACTC
ATCACTCTCC TACTTACAGG ACTCAACATA CTAGTCACAG CCCTATACTC
ATCACCATCC TACTAACAGG ACTCAACATA CTAATCACAA CCCTATACTC

CCTCTACATG TTTACCACAA CACAATGAGG CTCACTCACC CACCACATTA
CCTTTACATA TTTATCATAA CACAACGAGG CACACTTACA CACCACATTA
CCTTTATATA TTTACCACAA CACAATGAGG CCCACTCACA CACCACATCA
CCTCTACATA TTTACCACAA CACAATGAGG CTCACTCACC CACCACATTA
TCTCTATATA TTCACCACAA CACAACGAGG TACACCCACA CACCACATCA

ATAACATAAA GCCCTCATTC ACACGAGAAA ATACTCTCAT ATTTTTACAC
AAAACATAAA ACCCTCACTC ACACGAGAAA ACATATTAAT ACTTATGCAC
CCAACATAAA ACCCTCATTT ACACGAGAAA ACATCCTCAT ATTCATGCAC
ACAACATAAA ACCCTCATTC ACACGAGAAA ACACCCTCAT GTTCATACAC
ACAACATAAA ACCTTCTTTC ACACGCGAAA ATACCCTCAT GCTCATACAC

CTATCCCCCA TCCTCCTTCT ATCCCTCAAT CCTGATATCA TCACTGGATT
CTCTTCCCCC TCCTCCTCCT AACCCTCAAC CCTAACATCA TTACTGGCTT
CTATCCCCCA TCCTCCTCCT ATCCCTCAAC CCCGATATTA TCACCGGGTT
CTATCCCCCA TTCTCCTCCT ATCCCTCAAC CCCGACATCA TTACCGGGTT
CTATCCCCCA TCCTCCTCTT ATCCCTCAAC CCCAGCATCA TCGCTGGGTT

CACCTCCTGT AAATATAGTT TAACCAAAAC ATCAGATTGT GAATCTGACA
TACTCCCTGT AAACATAGTT TAATCAAAAC ATTAGATTGT GAATCTAACA
CACCTCCTGT AAATATAGTT TAACCAAAAC ATCAGATTGT GAATCTGATA
TTCCTCTTGT AAATATAGTT TAACCAAAAC ATCAGATTGT GAATCTGACA
CGCCTACTGT AAATATAGTT TAACCAAAAC ATTAGATTGT GAATCTAATA

ACAGAGGCTC ACGACCCCTT ATTTACCGAG AAAGCTTATA AGAACTGCTA
ATAGAGGCTC GAAACCTCTT GCTTACCGAG AAAGCCCACA AGAACTGCTA
ACAGAGGCTC ACAACCCCTT ATTTACCGAG AAAGCTCGTA AGAGCTGCTA
ACAGAGGCTT ACGACCCCTT ATTTACCGAG AAAGCTCACA AGAACTGCTA
ATAGGGCCCC ACAACCCCTT ATTTACCGAG AAAGCTCACA AGAACTGCTA

ATTCATATCC CCATGCCTGA CAACATGGCT TTCTCAACTT TTAAAGGATA
ACTCACTATC CCATGTATGA CAACATGGCT TTCTCAACTT TTAAAGGATA
ACTCATACCC CCGTGCTTGA CAACATGGCT TTCTCAACTT TTAAAGGATA
ACTCATGCCC CCATGTCTAA CAACATGGCT TTCTCAACTT TTAAAGGATA
ACTCNTCACT CCATGTGTGA CAACATGGCT TTCTCAGCTT TTAAAGGATA

ACAGCCATCC GTTGGTCTTA GGCCCCAAAA ATTTTGGTGC AACTCCAAAT
ACAGCTATCC ATTGGTCTTA GGACCCAAAA ATTTTGGTGC AACTCCAAAT
ACAGCTATCC ATTGGTCTTA GGACCCAAAA ATTTTGGTGC AACTCCAAAT
ACAGCTATCC ATTGGTCTTA GGCCCCAAAA ATTTTGGTGC AACTCCAAAT
ACAGCTATCC CTTGGTCTTA GGATCCAAAA ATTTTGGTGC AACTCCAAAT

AAAAGTAATA ACCATGTATA CTACCATAAC CACCTTAACC CTAACTCCCT
AAAAGTAATA GCAATGTACA CCACCATAGC CATTCTAACG CTAACCTCCC
AAAAGTAATA ACTATGTACG CTACCATAAC CACCTTAGCC CTAACTTCCT
AAAAGTAATA ACCATGCACA CTACTATAAC CACCCTAACC CTGACTTCCC
AAAAGTAACA GCCATGTTTA CCACCATAAC TGCCCTCACC TTAACTTCCC

TAATTCTCCC CATCCTCACC ACCCTCATTA ACCCTAACAA AAAAAACTCA
TAATTCCCCC CATTACAGCC ACCCTTATTA ACCCCAATAA AAAGAACTTA
TAATTCCCCC TATCCTTACC ACCTTCATCA ATCCTAACAA AAAAAGCTCA
TAATTCCCCC CATCCTTACC ACCCTCGTTA ACCCTAACAA AAAAAACTCA
TAATCCCCCC CATTACCGCT ACCCTCATTA ACCCCAACAA AAAAAACCCA

TATCCCCATT ATGTGAAATC CATTATCGCG TCCACCTTTA TCATTAGCCT
TACCCGCACT ACGTAAAAAT GACCATTGCC TCTACCTTTA TAATCAGCCT
TACCCCCATT ACGTAAAATC TATCGTCGCA TCCACCTTTA TCATCAGCCT
TACCCCCATT ATGTAAAATC CATTGTCGCA TCCACCTTTA TTATCAGTCT
TACCCCCACT ATGTAAAAAC GGCCATCGCA TCCGCCTTTA CTATCAGCCT

TTTCCCCACA ACAATATTCA TATGCCTAGA CCAAGAAGCT ATTATCTCAA
ATTTCCCACA ATAATATTCA TGTGCACAGA CCAAGAAACC ATTATTTCAA
CTTCCCCACA ACAATATTTC TATGCCTAGA CCAAGAAGCT ATTATCTCAA
CTTCCCCACA ACAATATTCA TGTGCCTAGA CCAAGAAGTT ATTATCTCGA
TATCCCAACA ACAATATTTA TCTGCCTAGG ACAAGAAACC ATCGTCACAA

ACTGGCACTG AGCAACAACC CAAACAACCC AGCTCTCCCT AAGCTT
ACTGACACTG AACTGCAACC CAAACGCTAG AACTCTCCCT AAGCTT
GCTGACACTG AGCAACAACC CAAACAATTC AACTCTCCCT AAGCTT
ACTGACACTG AGCCACAACC CAAACAACCC AGCTCTCCCT AAGCTT
ACTGATGCTG AACAACCACC CAGACACTAC AACTCTCACT AAGCTT

Exercise 3

  5  149
chimpanzeeSFTGAIILIIAHGLTSSLLFCLANSNYERTHSRIIILSQGLQTLLPLIAF
gibbonxxxxSFTGATVLIIAHGLTSSLLFCLANSNYERTHSRIIILSRGLQALLPLIAF
gorillaexxSFTGAVVLIIAHGLTSSLLFCLANSNYERTHSRIIILSQGLQTLLPLIAL
homosapienSFTGAVILIIAHGLTSSLLFCLANSNYERTHSRIIILSQGLQTLLPLIAF
orangutangSFTGATTLMIAHGLTSSLLFCLANSNYERTHSRIIILSQGLQTLLPLIAL

LLASLANLALPPTINLLGELSVLVTSFSSNTTLLLTGFNILITALYS
LAASLANLALPPTINLLGELFVLMASFSANTTITLTGLNVLITALYS
LLASLANLALPPTINLLGELSVLVTTFSSNTTLLLTGSNILITALYS
LLASLANLALPPTINLLGELSVLVTTFSSNITLLLTGLNILVTALYS
LLASLTNLALPPTINLLGELSVLIAIFSSNITILLTGLNILITTLYS

LYMFTTTQGSLTHHINNIKPSFTRENTLIFLHLSPILLLSLNPDIITGF
LYIFIITQGTLTHHIKNIKPSLTRENILILMHLFPLLLLTLNPNIITGF
LYIFTTTQGPLTHHITNIKPSFTRENILIFMHLSPILLLSLNPDIITGF
LYIFTTTQGSLTHHINNIKPSFTRENTLMFIHLSPILLLSLNPDIITGF
LYIFTTTQGTPTHHINNIKPSFTRENTLMLIHLSPILLLSLNPSIIAGF

TSC
TPC
TSC
SSC
AYC

Exercise 4

31 1137
AdelphinicAGGCTACATCAGCGA
PhocenobacAGGCTACATCAGCGA
ActinomuriAGGCTACATCAGCGA
BistseventAGGCTACATCAGCGA
PlnagaaxxxAGGCTACATCAGCGA
PgallinaruAGGCTACATCAGCGA
AsuccinogeAGGCTACATCAGCGA
PaerogenesAGGCTACATCAGCGA
AporcinusxAGGCTACATCAGCGA
ArossiixxxAGGCTACATCAGCGA
AminorxxxxAGGCTACATCAGCGA
HparasuisxAGGCTACATCAGCGA
PmultocidaAGGCTACATCAGCGA
AcapsulatuAGGCTACATCAGCGA
HfelisxxxxAGGCTACATCAGCGA
PtrehalosiAGGCTACATCAGCGA
BistsevenxAGGCTACATCAGCGA
HhaemoglobAGGCTACATCAGCGA
BistfivexxAGGCTACATCAGCGA
HducreyixxAGGCTACATCAGCGA
PbettixxxxAGGCTACATCAGCGA
MhaemolytiAGGCTACATCAGCGA
AlignieresAGGCTACATCAGCGA
HinfleunzaAGGCTACATCAGCGA
AactinomycAGGCTACATCAGCGG
HparainfluAGGCTACATCAGCGA
LonekoalaxAGGCTACATCAGCGA
HsomnusxxxAGGCTACATCAGCGA
AsalpingitAGGCTACATCAGCGA
BistthrethAGGCTACATCAGCGA
PtestudiniAGGCTACATCAGCGA

ACGGTACAGCTACAGTGGCGGACGGG
ACGGTACAGCTACAGTGGCGGACGGG
ACGGTGCAGCTACAGTGGCGGACGGG
ACGGTACAGCTACAGTGGCGGACGGG
ACGGTACTATTACAGTGGCGGACGGG
ACGGTACAGCTACAGTGGCGGACGGG
ACGGTACAGCTACAGTGGCGGACGGG
ACGGTACAGCTACAGTGGCGGACGGG
ACGGTACAGCTACAGTGGCGGACGGG
ACGGTACAGCTACAGTGGCGGACGGG
ACGGTACAGCTACAGTGGCGGACGGG
ACGGTGCAGCTACAGTGGCGGACGGG
ACGGTGCAGCTACAGTGGCGGACGGG
ACGGTGCTATTACAGTGGCGGACGGG
ACGGTGCAGCTACAGTGGCGGACGGG
ACGGTGCAGCTACAGTGGCGGACGGG
ACGGTGCAGCTACAGTGGCGGACGGG
ACGGTGCAGCTACAGTGGCGGACGGG
ACGGTGCAGCTACAGTGGCGGACGGG
ACGGTGCGGCTACAGTGGCGGACGGG
ACGGTGCAGCTACAGTGGCGGACGGG
ACGGTGCAGCTACAGTGGCGGACGGG
ACGGTACAGCCACAGTGGCGGACGGG
ACGGTGCAGCTACAGTGGCGGACGGG
ACGGTGCTATTACAGTGGCGGACGGG
ACGGTACAGCTACAGTGGCGGACGGG
GCGGTACCTCTACAGTGGCGGACGGG
ACGGTACAGCTACAGTGGCGGACGGG
ACGGTACAGCTACAGTGGCGGACGGG
ACGGTACAATTACAGTGGCGGACGGG
gCGGTACAGCCACAGCGGCGGACGGG

TGAGTAGCTGGGATTTTCATGGAGGGATAACAGGGAAC
TGAGTAGCTGGGATTCCTATGGAGGGATAACAGGGAAC
TGAGTAGCTGGGATTGCTATGGAGGGATAACATGGAAC
TGAGTAGCTGGGATTGCTACGGAGGGATAACGGGGAAC
TGAGTAGCTGGGATTGCTATGGAGGGATAACAGGGAAC
TGAGTAGCTGGGATTGCTATGGAGGGATAACATGGAAC
TGAGTAGCTGGGATTGCTATGGAGGGATAACAGGGAAC
TGAGTAGCTGGGATTGCTATGGAGGGATAACAGGGAAC
TGAGTAGCTGGGATTGCTATGGAGGGATAACAGGGAAC
TGAGTAGCTGGGATTGCTATGGAGGGATAACAGGGAAC
TGAGTAGCTGGGATTGCTATGGAGGGATAACAGGGAAC
TGAGTAGCTGGGATTGCTATGGAGGGATAACAGGGAAC
TGAGTAGCTGGGATTGCTATGGAGGGATAACGGGGAAC
TGAGTAGCTGGGATTGCTATGGAGGGATAACATGGAAC
TGAGTAGCTGGGATTGCTATGGAGGGATAACAGGGAAC
TGAGTAGCTGGGATTGCTATGGAGGGATAACATGGAAC
TGAGTAGCTGGGATTGCTATGGAGGGATAACAGGGAAC
TGAGTAGCTGGGATTGCTATGGAGGGATAACATGGAAC
TGAGTAGCTGGGATTGCTATGGAGGGATAACAGGGAAC
TGAGTAGCTGGGATTGCTATGGAGGGATAACAGGGAAC
TGAGTAGCTGGGATTGCTATGGAGGGATAACAGGGAAC
TGAGTAGCTGGGATTGCTATGGAGGGATAACATGGAAC
TGAGTAGCTGGGATTGCTATGGAGGGATAACAGGGAAC
TGAGTAGCTGGGATTGCTATGGAGGGATAACAGGGAAC
TGAGTAGCTGGGATTTCTATGGAGGGATAACAGGGAAC
TGAGTAGCTGGGATTGCTATGGAGGGATAACAGGGAAC
TGAGTAGCTGGGATTGCTATGGAGGGATAACAGGGAAC
TGAGTAGCTGGGATTGCAATGGAGGGATAGCAGGGAAC
TGAGTAGCTGGGATTGCTATGGAGGGATAACATGGAAC
TGAGTAGCTGGGATTGCTATGGAGGGATAACAGGGAAC
TGAGTAGCTGGGATTCCTATGGCGGGATAACAGGGAAC

TGTCTAATACCGCGTAATCGAGATTAAAG
TGTCTAATACCGCGTAATCTGGATTAAAG
GGTCTAATACCGCGTAGTCTGGACTAAAG
TGCCTAATACCGCGTAATCGAGATGAAAG
TGTCTAATACCGCGTAATCGAGATGAAAG
GATCTAATACCGCATAATCGAGATTAAAG
TGTCTAATACCGCGTAGTCTGGACTAAAG
TGTCTAATACCGCGTAGTCGAGACGAAAG
TGTCTAATACCGCGTAATCTAGATGAAAG
TGTCTAATACCGCGTAATCGAGATGAAAG
TGTCTAATACCGCGTAATCTGGATTAAAG
TGTCTAATACCGCGTAGTCTGGACTAGAG
TGCCTAATACCGCGTACTCTGGAGGAAAG
GGTCTAATACCGCGTAATCGAGATTAAAG
TGTCTAATACCGCGTAATCGAGATTAAAA
GGTCTAATACCGCGTAATCGAGATTAAAG
TGTCTAATACCGCGTAATCTGGATTAAAG
GATCTAATACCGCGTAATCGAGATTAAAG
TGTCTAATACCGCGTAATCTGGATTAAAG
TGTCTAATACCGCGTAATCTGGATTAAAG
TGTCTAATACCGCGTAGTCTGGACTAAAG
GGTCTAATACCGCGTAGTCTGGACTAAAG
TGTCTAATACCGCGTAATCTGGATTAAAG
TGTCTAATACCGCGTAATCGAGATGAAAG
TGTCTAATACCGCGTAATCGAGATGAAAG
TGTCTAATACCGCGTAATCGAGATGAAAG
TGTCTAATACCGCGTAATCTGGATTAAAG
TGTGTAATACCGCGTAGTCTGGACTAAAG
GATCTAATACCGCATAATCGAGATGAAAG
TGTCTAATACCGCGTAATCGAGATTAAAT
TGTCTAATACCGCGTAATCTACTTTAAAG

GGGCCATCCATGAAGTGAGCCCAAGTGGGATTAGATAG
GGGCCATCCATAAGATGAGCCCAAGTGAGATTAGGTAG
GCGACTTCCATAAGATGAGCCCAAGTGGGATTAGGTAG
GCGACTTCCGTGAGATGAGCCCAAGTGGGATTAGGTAG
GGGCCATCCATAGGATGAGCCCAAGTGGGATTAGGTAG
GGGTCATCCATAAGATGAGCCCAAGTGGGATTAGGTAG
GGGTCGCCCATAAGATGAGCCCAAGTGGGATTAGGTAG
GGGCCGTCCATAAGATGAGCCCAAGTGGGATTAGGTAG
GGGTCATCCATAAGATGAGCCCAAGTGGGATTAGGTAG
GGGCCATCCATGGGATGAGCCCAAGTGGGATTAGGTAG
GGGTCATCCATAAGATGAGCCCAAGTGGGATTAGGTAG
GGGTCATCCATAAGATGAGCCCAAGTGGGATTAGGTAG
GGGCCATCCATAAGATGAGCCCAAGTGGGATTAGGTAG
GGGCCATCCATAAGATGAGCCCAAGTGGGATTAGGTAG
GGGTAGTCCATAAGATGAGCCCAAGTGGGATTAGGTAG
GGGTCATCCATAAGATGAGCCCAAGTGGGATTAGGTAG
GGGTCATCCATAAGATGAGCCCAAGTGGGATTAGGTAG
GGGTCATCCATAAGATGAGCCCAAGTGGGATTAGGTAG
GGGCCATCCATAAGATGAGCCCAAGTGGGATTAGGTAG
GGGCCATCCATAAGATGAGCCCAAGTGGGATTAGGTAG
GGGTCATCCATGAGATGAGCCCAAGTGGGATTAGGTAG
GGGCCATCCATAAGATGAGCCCAAGTGGGATTAGGTAG
GGGTCATCCATAAGATGAGCCCAAGTGGGATTAGGTAG
GGGTCGTCCATAGGATGAGCCCAAGTGGGATTAGGTAG
GGGCCATCCATAGAATGAGCCCAAGTGTGATTAGGTAG
GGGCCATCCATAGGATGAGCCCAAGTGGGATTAGGTAG
GGGCCATCCATAAGATGAGCCCAAGTGGGATTAGGTAG
GGGTCATCCATTTGATGAGCCCAAGTGGGATTAGGTAG
GGGGCATCCAAAGGATGAACCCAAGTGAGATTAGGTAG
GGGTCAACCATAAGATAAGCCCAAGTGAGATTAGCTAG
GGGTCACCCATGAGATGAGCCCAAGTGAGATTAGGTAG

TTGGTGAGGTAGGCTCACAGTCACGATCTCTAGCTGTCTGAG
TTGGTGAGGTAGGCTCACAGCTGCGATCTCTAGCTGTTTGAG
TTGGTGAGGTAGGCTCACAGCCGCGATCTCTAGCTGTCTGAG
TTGGTGGGGTAGGCCTACAGCCTCGATCTCTAGCTGTCTGAG
TTGGTAGGGTAGGCCTACAGCCGCGATCTCTAGCTGTCTGAG
TTGGTGGGGTAGGCCTACAGCCTCGATCTCTAGCTGTCTGAG
TTGGTGGGGTAGGCCTACAGCCGCGATCTCTAGCTGTCTGAG
TTGGTGGGGTAGGCCTACAGCCTCGATCTCTAGCTGTCTGAG
TTGGTTAGGTAGGCTGACAGCCGCGATCTCTAGCTGTCTGAG
TTGGTAAGGTAGGCTTACAGCCGCGATCTCTAGCTGTCTGAG
TTGGTGGGGTAGGCCTACAGCCGCGATCTCTAGCTGTCTGAG
TTGGTGGGGTAGGCCTACAGCCGCGATCTCTAGCTGTCTGAG
TTGGTGGGGTAGGCCTACAGCCTCGATCTCTAGCTGTCTGAG
TTGGTGGGGTAGGCCTACAGCCTCGATCTCTAGCTGTCTGAG
TTGGTGGGGTAGGCCTACAGCCGCGATCTCTAGCTGTCTGAG
TTGGTGGGGTAGGCCTACAGCCTCGATCTCTAGCTGTCTGAG
TTGGTGGGGTAGGCCTACAGCCGCGATCTCTAGCTGTCTGAG
TTGGTGGGGTAGGCCTACAGCCTCGATCTCTAGCTGTCTGAG
TTGGTGGGGTAGGCCTACAGCCTCGATCTCTAGCTGTCTGAG
TTGGTTAGGTAGGCTGACAGCCGCGATCTCTAGCTGTCTGAG
TTGGTGGGGTAGGCCTACAGCCGCGATCTCTAGCTGTCTGAG
TTGGTGAGGTAGGCTCACAGCCCCGATCTCTAGCTGTCTGAG
TTGGTTAGGTAGGCTGACAGCCGCGATCTCTAGCTGTCTGAG
TTGGTGGGGTATGCCTACAGCCTCGATCTCTAGCTGTCTGAG
TTGGTGGGGTAGGCCTACAGCCGCGATCGCTAGCTGTCTGAG
TTGGTGAGGTAGGCTCACAGCCGCGATCTCTAGCTGTCTGAG
TTGGTGGGGTAGGCCTACAGCCGCGATCTCTAGCTGTCTGAG
TTGGTGAGGTAGGCTCACAGCTGCGATCTCTAGCTGTCTGAG
TTGGTGGGGTAGGCCTACAGCCGCGATCTCTAGCTGTCTGAG
TTGGTGGGGTAGGCCTACAGGCGCGATCTCTAGCTGTCTGAG
TTGGTGAGGTAGGCTCACAGCCGCGATCTCTAACTGTCTGAG

ATGACCAGCCCCTGGGACTGAGACCGGCCCGCTTACGGGAGG
ATGACCAGCCCCTGGAACTGAGACCGGTCCGCTTACGGGAGG
ATGACCAGCCCCTGGAACTGAGACCGGTCCGCTTACGGGAGG
ATGACCAGCCCCCGGAACTGAGACCGGTCCGCTTACGGGAGG
ATGACCAGCCCCTGGAACTGAGACCGGTCCGCTTACGGGAGG
ATGGCCAGCCCCTGGGACTGAGACCGGCCCGCTTACGGGAGG
ATGACCAGCCCCTGGAACTGAGACCGGTCCGCTTACGGGAGG
ATGGCCAGCCCCTGGAACTGAGACCGGTCCGCTTACGGGAGG
ATGACCAGCCCCTGGAACTGAGACCGGTCCGCTTACGGGAGG
ATGACCAGCCCCTGGAACTGAGACCGGTCCGCTTACGGGAGG
ATGACCAGCCCCTGGAACTGAGACCGGTCCGCTTACGGGAGG
ATGACCAGCCCCTGGAACTGAGACCGGTCCGCTTACGGGAGG
ATGACCAGCCCCTGGAACTGAGACCGGTCCGCTTACGGGAGG
ATGACCAGCCCCTGGAACTGAGACCGGTCCGCTTACGGGAGG
ATGACCAGCCCCTGGAACTGAGACCGGTCCGCTTACGGGAGG
ATGGCCAGCCCCTGGAACTGAGACCGGTCCGCTTACGGGAGG
ATGACCAGCCCCTGGGACTGAGACCGGCCCGCTTACGGGAGG
ATGACCAGCCCCTGGAACTGAGACCGGTCCGCTTACGGGAGG
ATGGCCAGCCCCTGGAACTGAGACCGGTCCGCTTACGGGAGG
ATGACCAGCCCCTGGAACTGAGACCGGTCCGCTTACGGGAGG
ATGACCAGCCCCTGGAACTGAGACCGGTCCGCTTACGGGAGG
ATGACCAGCCCCTGGAACTGAGACCGGTCCGCTTACGGGAGG
ATGACCAGCCCCTGGAACTGAGACCGGTCCGCTTACGGGAGG
ATGACCAGCCCCTGGAACTGAGACCGGTCCGCTTACGGGAGG
ATGGCCAGCCCCCGGGACTGAGACCGGCCCGCTTACGGGAGG
ATGACCAGCCCCTGGGACTGAGACCGGCCCGCTTACGGGAGG
ATGACCAGCCCCTGGAACTGAGACCGGTCCGCTTACGGGAGG
ATGGCCAGCCCCTGGAACTGAGACCGGTCCGCTTACGGGAGG
ATGGCCAGCCCCTGGGACTGAGACCGGCCCGCTTACGGGAGG
ATGACCAGCCCCTGGGACTGAGACCGGCCCGCTTACGGGAGG
ATGACCAGTCCCTGGGACTGAGACCGGCCCGCTTACGGGAGG

CGCGTGGGAATTTGCCATGGGCGCAAGCCTGATGCAGCATGCC
CGCGTGGGAATTTGCCATGGGCGCAAGCCTGATGCAGCATGCC
CGCGTGGGAATTTGCCATGGGGGAAACCCTGATGCAGCATGCC
CGCGTGGGAATTTGCCATGGGGGGAACCCTGACGCAGCATGCC
CGCGTGGGAATTTGCCATGGGGGCAACCCTGATGCAGCATGCC
CGCGTGGGAATTTGCCATGGGGGGAACCCTGACGCAGCATGCC
CGCGTGGGAATTTGCCATGGGGGCAACCCTGACGCAGCATGCC
CGCGTGGGAATTTGCCATGGGGGCAACCCTGACGCAGCATGCC
CGCGTGGGAATTTGCCATGGGGGGAACCCTGATGCAGCATGCC
CGCGTGGGAATTTGCCATGGGGGGAACCCTGATGCAGCATGCC
CGCGTGGGAATTTGCCATGGGGGGAACCCTGATGCAGCATGCC
CGCGTGGGAATTTGCCATGGGGGGAACCCTGATGCAGCATGCC
CGCGTGGGAATTTGCCATGGGGGGAACCCTGACGCAGCATGCC
CGCGTGGGAATTTGCCATGGGGGGAACCCTGATGCAGCATGCC
CGCGTGGGAATTTGCCATGGGGGGAACCCTGATGCAGCATGCC
CGCGTGGGAATTTGCCATGGGGGGAACCCTGATGCAGCATGCC
CGCGTGGGAATTTGCCATGGGGGGAACCCTGACGCAGCATGCC
CGCGTGGGAATTTGCCATGGGGGCAACCCTGACGCAGCATGCC
CGCGTGGGAATTTGCCATGGGGGGAACCCTGATGCAGCATGCC
CGCGTGGGAATTTGCCATGGGGGAAACCCTGATGCAGCATGCC
CGCGTGGGAATTTGCCATGGGGGGAACCCTGATGCAGCATGCC
CGCGTGGGAATTTGCCATGGGGGGAACCCTGATGCAGCATGCC
CGCGTGGGAATTTGCCATGGGGGGAACCCTGATGCAGCATGCC
CGCGTGGGAATTTGCCATGGGGGGAACCCTGACGCAGCATGCC
CGCGTGGGAATTTGCCATGGGGGCAACCCTGACGCAGCATGCC
CGCGTGGGAATTTGCCATGGGGGCAACCCTGACGCAGCATGCC
CGCGTGGGAATTTGCCATGGGGGCAACCCTGACGCAGCATGCC
CGCGTGGGAATTTGCCATGGGGGGAACCCTGACGCAGCATGCC
CGCGTGGGAATTTGCCATGGGGGGAACCCTGACGCAGCATGCC
CGCGTGGGAATTTGCCATGGAGGGAACTCTGACGCAGCATGCC
CGCGTGGGAATTTGCCATGGGGGCAACCCTGACGCAGCATGCC

GCGTGAATGAAGAAGGCCTTCGGGTTGTAAAGTTCTTTCGGTGTGAGGA
GCGTGAATGAAGAAGGCTTTCGGGTTGTAAAGTTCTTTCAGCATGAGGA
GCGTGAATGAAGAAGGCCTTCGGGTTGTAAAGTTCTTTCGGTATGAGGA
GCGTGAATGAAGAAGGCCTTCGGGTTGTAAAGTTCTTTCGGTACGAGGA
GCGTGAATGAAGAAGGCCTTCGGGTTGTAAAGTTCTTTCGGTGAGAGGA
GCGTGAATGAAGAAGGCCTTCGGGTTGTAAAGTTCTTTCGGTGTGAGGA
GCGTGAATGAAGAAGGCCTTCGGGTTGTAAAGTTCTTTCGGTGTGAGGA
GCGTGAATGAAGAAGGCCTTCGGGTTGTAAAGTTCTTTCGGTATGAGGA
GCGTGAATGAAGAAGGCCTTCGGGTTGTAAAGTTCTTTCGGTATGAGGA
GCGTGAATGAAGAAGGCCTTCGGGTTGTAAAGTTCTTTCGGTATGAGGA
GCGTGAATGAAGAAGGCCTTCGGGTTGTAAAGTTCTTTCGGTATGAGGA
GCGTGAATGAAGAAGGCCTTCGGGTTGTAAAGTTCTTTCGGTATGAGGA
GCGTGAATGAAGAAGGCCTTCGGGTTGTAAAGTTCTTTCGGTATGAGGA
GCGTGAATGAAGAAGGCCTTCGGGTTGTAAAGTTCTTTCGGTATGAGGA
GCGTGAATGAAGAAGGCCTTCGGGTTGTAAAGTTCTTTCGGTGTGAGGA
GCGTGAATGAAGAAGGCCTTCGGGTTGTAAAGTTCTTTCGGTATGAGGA
GCGTGAATGAAGAAGGCCTTCGGGTTGTAAAGTTCTTTCGGTATGAGGA
GCGTGAATGAAGAAGGCCTTCGGGTTGTAAAGTTCTTTCGGTATGAGGA
GCGTGAATGAAGAAGGCCTTCGGGTTGTAAAGTTCTTTCGGTATGAGGA
GCGTGAATGAAGAAGGCCTTCGGGTTGTAAAGTTCTTTCGGTATGAGGA
GCGTGAATGAAGAAGGCCTTCGGGTTGTAAAGTTCTTTCGGTATGAGGA
GCGTGAATGAAGAAGGCCTTCGGGTTGTAAAGTTCTTTCGGTACGAGGA
GCGTGAATGAAGAAGGCCTTCGGGTTGTAAAGTTCTTTCGGTGCGAGGA
GCGTGAATGAAGAAGGCCTTCGGGTTGTAAAGTTCTTTCGGTTTGAGGA
GCGTGAATGAAGAAGGCCTTCGGGTTGTAAAGTTCTTTCGGTATGAGGA
GCGTGAATGAAGAAGGCCTTCGGGTTGTAAAGTTCTTTCGGTGCGAGGA
GCGTGAATGAAGAAGTCCTTCGGGTTGTAAAGTTCTTTCGGTATGAGGA
GCGTGAATGAAGAAGGCCTTCGGGTTGTAAAGTTCTTTCGGTATGAGGA
GCGTGAATGAAGAAGGCCTTCGGGTTGTAAAGTTCTTTCGGTGTGAGGA
GCGTGAATGAAGAAGGCCTTCGGGTTGTAAAGTTCTTTCGGTGTGAGGA
GCGTGAATGAAGAAGGCCTTCGGGTTGTAAAGTTCTTTCGGTATGAGGA

AGGTGCGTTATAGACTTCTTGACGTTGCCACAGAAGAAGC
AGGGTATCTATAGTGAGATTGACGTTGTCGCAGAAGAAGC
AGGGAAGTTATACCGTAATTGACGTTGTCACAGAAGAAGC
AGGGTAGTTATACCTATATTGACGTTATCACAGAAGAAGC
AGGGACGTGATAGAATTATTGACGTTCCTACAGAAGAAGC
AGGGTAGTTATAGGCATTTTGACGTTGCCACAGAAGAAGC
AGGGATGTTACAGTTTTATTGACGTTGCCACAGAAGAAGC
AGGGTTTTTATAGCTATATTGACGTTATCACAGAAGAAGC
AGGGTTTTTATAGACAGATTGACGTTGTCACAGAAGAAGC
AGGATTTTTATAGGTGTATTGACGTTGTCACAGAAGAAGC
AGGGTTTTTATAGACATATTGACGTTGTCACAGAAGAAGC
AGGTATTTTATAGGCTTATTGACGTTGTCACAGAAGAAGC
AGGAGTTTAATAGTACAATTGACGTTATTACAGAAGAAGC
AGGGTTTTTATAGCTATATTGACGTTGTCACAGAAGAAGC
AGGTTAGTTATAGATACATTGACGTTACCACAGAAGAAGC
AGGGTTTTTATAGCAAGATTGACGTTGTCACAGAAGAAGC
AGGACGTTTATAGTTGGATTGACGTTATCACAGAAGAAGC
AGGTAGGTTATAGATTAATTGACGTTGTCACAGAAGAAGC
AGGTGTTTTATAGGCTCATTGACGTTGTCACAGAAGAAGC
AGGATTGTTATAGAAATATTGACGTTGTCACAGAAGAAGC
AGGAGTTTTATAGATCTATTGACGTTATCACAGAAGAAGC
AGGGTTTTTATAGACGTATTGACGTTATCACAGAAGAAGC
AGGACATTTATAGTTGGATTGACGTTACTACAGAAGAAGC
AGGTATGTTATAGACTCATTGACGTTAATACAGAAGAAGC
AGGTGTGTTATAGATCCATTGACGTTAATACAGAAGAAGC
AGGATTTTTATAGCTGGATTGACGTTACTACAGAAGAAGC
AGGATTTTTATAGCTAGATTGACGTTGTCACAGAAGAAGC
AGGGTTTTTAGAGTTATATTGACGATATCACAGAAGAAGC
AGGTATGTTATAGATTTTTTGACGTTGCCACAGAAGAAGC
AGGTCTTTAATAGGCGTATTGACGTTGCCACAGAAGAAGC
AGGGTGGTTATAGTTAAATTGACGTTGTCACAGAAGAAGC

ACCGGCTACTCCGTGCCAGCAGCCCGGTATCGAGGGTGAGCGT
ACCGGCTACTCCGTGCCAGCAGCCCGGTATCGAGGGTGAGCGT
ACCGGCTACTCCGTGCCAGCAGCCCGGTATCGAGGGTGAGCGT
ACCGGCTACTCCGTGCCAGCAGCACGGTATCGAGGGTGAGCGT
ACCGGCTACTCCGTGCCAGCAGCCCGGTATCGAGGGTGAGCGT
ACCGGCTACTCCGTGCCAGCAGCCCGGTATCGAGGGTGAGCGT
ACCGGCTACTCCGTGCCAGCAGCCCGGTATCGAGGGTGAGCGT
ACCGGCTACTCCGTGCCAGCAGCCCGGTATCGAGGGTGAGCGT
ACCGGCTACTCCGTGCCAGCAGCCCGGTATCGAGGGTGAGCGT
ACCGGCTACTCCGTGCCAGCAGCCCGGTATCGAGGGTGAGCGT
ACCGGCTACTCCGTGCCAGCAGCCCGGTATCGAGGGTGAGCGT
ACCGGCTACTCCGTGCCAGCAGCCCGGTATCGAGGGTGAGCGT
ACCGGCTACTCCGTGCCAGCAGCCCGGTATCGAGGGTGAGCGT
ACCGGCTACTCCGTGCCAGCAGCCCGGTATCGAGGGTGAGCGT
ACCGGCTACTCCGTGCCAGCAGCCCGGTATCGAGGGTGAGCGT
ACCGGCTACTCCGTGCCAGCAGCCCGGTATCGAGGGTGAGCGT
ACCGGCTACTCCGTGCCAGCAGCCCGGTATCGAGGGTGAGCGT
ACCGGCTACTCCGTGCCAGCAGCCCGGTATCGAGGGTGAGCGT
ACCGGCTACTCCGTGCCAGCAGCCCGGTATCGAGGGTGAGCGT
ACCGGCTACTCCGTGCCAGCAGCCCGGTATCGAGGGTGAGCGT
ACCGGCTACTCCGTGCCAGCAGCCCGGTATCGAGGGTGAGCGT
ACCGGCTACTCCGTGCCAGCAGCCCGGTATCGGGGGTGAGCGT
ACCGGCTACTCCGTGCCAGCAGCCCGGTATCGAGGGTGAGCGT
ACCGGCTACTCCGTGCCAGCAGCCCGGTATCGAGGGTGAGCGT
ACCGGCTACTCCGTGCCAGCAGCCCGGTATCGGGGGTGAGCGT
ACCGGCTACTCCGTGCCAGCAGCCCGGTATCGAGGGTGAGCGT
ACCGGCTACTCCGTGCCAGCAGCCCGGTATCGAGGGTGAGCGT
ACCGGCTACTCCGTGCCAGCAGCCCGGTATCGAGGGTGAGCGT
ACCGGCTACTCCGTGCCAGCAGCCCGGTATCGAGGGTGAGCGT
ACCGGCTACTCCGTGCCAGCAGCCCGGTATCGAGGGTGAGCGT
ACCGGCTACTCCGTGCCAGCAGCCCGGTATCGAGGGTGAGCGT

TATCGAATACTGGGCGTAAAGGGCACGCAGGCGGTATTAAGTGG
TATCGAATACTGGGCGTAAAGGGCACGCAGGCGGTGCTAAGTCG
TATCGAATACTGGGCGTAAAGGGCACGCAGGCGGTGTTAAGTGG
TATCGAATACTGGGCGTAAAGGGCACGCAGGCGGATTTAAGTGG
TATCGAATACTGGGCGTAAAGGGCACGCAGGCGGTGTTAAGTGG
TATCGAATACTGGGCGTAAAGGGCACGCAGGCGGTATTAAGTGG
TATCGAATACTGGGCGTAAAGGGCACGCAGGCGGCATTAAGTGG
TATCGAATACTGGGCGTAAAGGGCACGCAGGCGGTGTTAAGTGG
TATCGAATACTGGGCGTAAAGGGCACGCAGGCGGTATTAAGTGG
TATCGAATACTGGGCGTAAAGGGCACGCAGGCGGTATTAAGTGG
TATCGAATACTGGGCGTAAAGGGCACGCAGGCGGTATTAAGTGG
TATCGAATACTGGGCGTAAAGGGCACGCAGGCGGTATTAAGTGG
TATCGAATACTGGGCGTAAAGGGCACGCAGGCGGATTTAAGTGG
TATCGAATACTGGGCGTAAAGGGCACGCAGGCGGATTTAAGTGG
TATCGAATACTGGGCGTAAAGGGCACGCAGGCGGATTTAAGTGG
TATCGAATACTGGGCGTAAAGGGCACGCAGGCGGATTTAAGTGG
TATCGAATACTGGGCGTAAAGGGCACGCAGGCGGTATTAAGTGG
TATCGAATACTGGGCGTAAAGGGCACGCAGGCGGTATTAAGTGG
TATCGAATACTGGGCGTAAAGGGCACGCAGGCGGTGTTAAGTGG
TATCGAATACTGGGCGTAAAGGGCACGCAGGCGGTGTTAAGTGG
TATCGAATACTGGGCGTAAAGGGCATGCAGGCGGTGTTAAGTGG
TATCGAATACTGGGCGTAAAGGGCACGCAGGCGGTGTTAAGTGG
TATCGAATACTGGGCGTAAAGGGCACGCAGGCGGTGTTAAGTGG
TATCGAATACTGGGCGTAAAGGGCACGCAGGCGGTATTAAGTGG
TATCGAATACTGGGCGTAAAGGGCACGTAGGCGGACTTAAGTGG
TATCGAATACTGGGCGTAAAGGGCACGCAGGCGGTATTAAGTGG
TATCGAATACTGGGCGTAAAGGGCACGCAGGCGGTATTAAGTAG
TATCGAATACTGGGCGTAAAGGGCACGCAGGTGGTATTAAGTGG
TATCGAATACTGGGCGTAAAGGGCACGCAGGCGGGCTTAAGTGG
TATCGAATACTGGGCGTAAAGGGCACGCAGGCGGTATTAAGTGG
TATCGAATACTGGGCGTAAAGGGCACGCAGGCGGAAATAAGTGG

GTGTGAAAGCCCGGCTACCTAGGATTGCATTTCATACTGTGC
ATGTGAAAGCCCGGCTACTTGGGATTGCATTTGAGACTGCAC
GTGTGAAAGCCCGGCCACCTGGGATTGCATTTCAGACTGCAC
GTGTGAAAGCCCGGCTACCTGGGATAGCATTTCAGACTGATC
ATGTGAAAGCCCGGCTACTTGGGATAGCATTTCATACTGCAC
ATGTGAAATCCCGGCTACTTGGGATTGCATTTCAGACTTTAC
ATGTGAAATCCCGGCTACCTGGGACTGCATTTCAGACTGTGC
GTGTGAAATCCCGGCTACCTGGGATTGCATTTCAGACTGCGC
ATGTGAAAGCCCGGCTACCTGGGATTGCATTTCATACTGTGC
ATGTGAAAGCCCGGCTACCTGGGATTGCATTTCATACTGTGC
ATGTGAAAGCCCGGCTACTTGGGATTGCATTTCATACTGTGC
ATGTGAAAGCCCGGCTACTTGGGATTGCATTTCATACTGTGC
ATGTGAAATCCCGGCTACTTGGGACTGCATTTCAGACTGATC
ATGTGAAAGCCCGGCTACTTGGGATTGCATTTCAGACTGATC
ATGTGAAATCCCGGCTACTTGGGATTGCATTTCAGACTGATC
ATGTGAAAGCCCGGCTACCTGGGATTGCATTTCAGACTGATC
GTGTGAAAGCCCGGCTACCTGGGATTGCATTTCATACTGTGC
GTGTGAAATCCCGGCTACCTGGGATTGCATTTCAGACTGTGC
ATGTGAAAGCCCGGCTACTTGGGATTGCATTTCAGACTGCAC
ATGTGAAAGCCCGGCTACCTGGGATTGCATTTCATACTGCAC
ATGTGAAAGCCCGGCTACCTGGGATTGCATTTCAGACTGCGC
GTGTGAAAGCCCGGCTACCTGGGATTGCATTTCAGACTACAC
ATGTGAAAGCCCGGCTACCTGGGATTGCATTTCATACTGCAC
GTGTGAAAGCCCGGCTACCTAGGATTGCATTTCAGACTGTAC
GTGTGAAATCCCGGCTACCTGGGATTGCATTTCATACTGGTC
GTGTGAAAGCCCGGCTACCTGGGATTGCATTTCATACTGTGC
ATGTGAAAGCCCGGCTACTTGGGATTGCATTTTAGACTGTAC
GTGTGAAATCCCGGCTACCTGGGACAGCATTTCAGACTGTAC
ATGTGAAAGCCCGGCTACCTGGGACAGCATTTCATACTTGTC
GTGTGAAATCCCGGCTACCTAGGATTGCATTTCATACTGTAC
ATGTGAAATCCCGGCTACTTGGGACTGCATTTCATACTTTTC

TGAGTTAGGGAGGGTAGAATTCCACGTGTAGCGGTGAAATGCGT
TGAGTTAGGGAGGGTAGAATTCCACGTGTAGCGGTGAAATGCGT
TGAGTTAGGGAGGGTAGAATTCCACGTGTAGCGGTGAAATGCGT
TGAGTTAGGGAGGGTAGAATTCCACGTGTAGCGGTGAAATGCGT
TGAGTTAGGGAGGGTAGAATTCCACGTGTAGCGGTGAAATGCGT
TGAGTTAGGGAGGGTAGAATTCCACGTGTAGCGGTGAAATGCGT
TGAGTTAGGGAGGGTAGAATTCCACGTGTAGCGGTGAAATGCGT
TGAGTTAGGGAGGGTAGAATTCCACGTGTAGCGGTGAAATGCGT
TGAGTTAGGGAGGGTAGAATTCCACGTGTAGCGGTGAAATGCGT
TGAGTTAGGGAGGGTAGAATTCCACGTGTAGCGGTGAAATGCGT
TGAGTTAGGGAGGGTAGAATTCCACGTGTAGCGGTGAAATGCGT
TGAGTTAGGGAGGGTAGAATTCCACGTGTAGCGGTGAAATGCGT
TGAGTTAGGGAGGGTAGAATTCCACGTGTAGCGGTGAAATGCGT
TGAGTTAGGGAGGGTAGAATTCCACGTGTAGCGGTGAAATGCGT
TGAGTTAGGGAGGGTAGAATTCCACGTGTAGCGGTGAAATGCGT
TGAGTTAGGGAGGGTAGAATTCCACGTGTAGCGGTGAAATGCGT
TGAGTTAGGGAGGGTAGAATTCCACGTGTAGCGGTGAAATGCGT
TGAGTTAGGGAGGGTAGAATTCCACGTGTAGCGGTGAAATGCGT
TGAGTTAGGGAGGGTAGAATTCCATGTGTAGCGGTGAAATGCGT
TGAGTTAGGGAGGGTAGAATTCCACGTGTAGCGGTGAAATGCGT
TGAGTTAGGGAGGGTAGAATTCCACGTGTAGCGGTGAAATGCGT
TGAGTTAGGGAGGGTAGAATTCCACGTGTAGCGGTGAAATGCGT
TGAGTTAGGGAGGGTAGAATTCCACGTGTAGCGGTGAAATGCGT
TGAGTTAGGGAGGGTAGAATTCCACGTGTAGCGGTGAAATGCGT
TGAGTTAGGGAGGGTAGAATTCCACGTGTAGCGGTGAAATGCGT
TGAGTTAGGGAGGGTAGAATTCCACGTGTAGCGGTGAAATGCGT
TGAGTTAGGGAGGGTAGAATTCCACGTGTAGCGGTGAAATGCGT
TGAGTTAGGGAGGGTAGAATTCCACGTGTAGCGGTGAAATGCGT
TGAGTTAGGGAGGGTAGAATTCCACGTGTAGCGGTGAAATGCGT
TGAGTTAGGGAGGGTAGAATTCCACGTGTAGCGGTGAAATGCGT
TGAGTTAGAGAGGGTAGAATTCCACGTGTAGCGGTGAAATGCGT

GAGATGTGAGGAATACCGAAGGCGAAGGCAGCCCTTGGATATACT
GAGATGTGAGGAATACCGAAGGCGAAGGCAGCCCTTGGAGAGACT
GAGATGTGAGGAATACCGAAGGCGAAGGCAGCCCTTGGATGTACT
GAGATGTGAGGAATACCGAAGGCGAAGGCAGCCCTTGGATGTACT
GAGATGTGAGGAATACCGAAGGCGAAGGCAGCCCTTGGATGTACT
GAGATGTGAGGAATACCGAAGGCGAAGGCAGCCCTTGGAGATACT
GAGATGTGAGGAATACCGAAGGCGAAGGCAGCCCTTGGACGTACT
GAGATGTGAGGAATACCGAAGGCGAAGGCAGCCCTTGGAAATACT
GAGATGTGAGGAATACCGAAGGCGAAGGCAGCCCTTGGATGTACT
GAGATGTGAGGAATACCGAAGGCGAAGGCAGCCCTTGGATGTACT
GAGATGTGAGGAATACCGAAGGCGAAGGCAGCCCTTGGATGTACT
GAGATGTGAGGAATACCGAAGGCGAAGGCAGCCCTTGGAAATACT
GAGATGTGAGGAATACCGAAGGCGAAGGCAGCCCTTGGATGTACT
GAGATGTGAGGAATACCGAAGGCGAAGGCAGCCCTTGGATGTACT
GAGATGTGAGGAATACCGAAGGCGAAGGCAGCCCTTGGATATACT
GAGATGTGAGGAATACCGAAGGCGAAGGCAGCCCTTGGATATACT
GAGATGTGAGGAATACCGAAGGCGAAGGCAGCCCTTGGATGTACT
GAGATGTGAGGAATACCGAAGGCGAAGGCAGCCCTTGGATGTACT
GAGATGTGAGGAATACCGAAGGCGAAGGCAGCCCTTGGATGTACT
GAGATGTGAGGAATACCGAAGGCGAAGGCAGCCCTTGGATATACT
GAGATGTGAGGAATACCGAAGGCGAAGGCAGCCCTTGGATGTACT
GAGATGTGAGGAATACCGAAGGCGAAGGCAGCCCTTGGATGTACT
GAGATGTGAGGAATACCGAAGGCGAAGGCAGCCCTTGGATGTACT
GAGATGTGAGGAATACCGAAGGCGAAGGCAGCCCTTGGATGTACT
GAGATGTGAGGAATACCGAAGGCGAAGGCAGCCCTTGGATGTACT
GAGATGTGAGGAATACCGAAGGCGAAGGCAGCCCTTGGATGTACT
GAGATGTGAGGAATACCGAAGGCGAAGGCACCCCTTGGATGTACT
GAGATGTGAGGAATACCGAAGGCGAAGGCAGCCCTTGGAGATACT
GAGATGTGAGGAATACCGAAGGCGAAGGCAGCCCTTGGAGATACT
GAGATGTGAGGAATACCGAAGGCGAAGGCAGCCCTTGGAGAGACT
GAGATGTGAGGAATACCGGAGGCGAAGGCGGCCCTTGGAAAGACT

GACGCTCATGTGCAAGCGTGGGGAGCACAGGATTGATACCCTGT
GACGCTGAGGTGCAAGCGTGGGTAGCACAGGATTGATACCCTGT
GACGCTCATGTGCAAGCGTGGGGAGCACAGGATTGATACCCTGT
GACGCTCATGTGCAAGCGTGGGGAGCACAGGATTGATACCCTGT
GACGCTCATGTGCAAGCGTGGGGAGCACAGGATTGATACCCTGT
GACGCTCATGTGCAAGCGTGGGGAGCACAGGATTGATACCCTGT
GACGCTCATGTGCAAGCGTGGGGAGCACAGGATTGATACCCTGT
GACGCTCATGTGCAAGCGTGGGGAGCACAGGATTGATACCCTGT
GACGCTCATGTGCAAGCGTGGGGAGCACAGGATTGATACCCTGT
GACGCTCATGTGCAAGCGTGGGGAGCACAGGATTGATACCCTGT
GACGCTCATGTGCAAGCGTGGGGAGCACAGGATTGATACCCTGT
GACGCTCATGTGCAAGCGTGGGGAGCACAGGATTGATACCCTGT
GACGCTCATGTGCAAGCGTGGGGAGCACAGGATTGATACCCTGT
GACGCTCATGTGCAAGCGTGGGGAGCACAGGATTGATACCCTGT
GACGCTCATGTGCAAGCGTGGGGAGCACAGGATTGATACCCTGT
GACGCTCATGTGCAAGCGTGGGGAGCACAGGATTGATACCCTGT
GACGCTCATGTGCAAGCGTGGGGAGCACAGGATTGATACCCTGT
GACGCTCATGTGCAAGCGTGGGGAGCACAGGATTGATACCCTGT
GACGCTCATGTGCAAGCGTGGGGAGCACAGGATTGATACCCTGT
GACGCTCATGTGCAAGCGTGGGGAGCACAGGATTGATACCCTGT
GACGCTCATATGCAAGCGTGGGGAGCACAGGATTGATACCCTGT
GACGCTCATGTGCAAGCGTGGGGAGCACAGGATTGATACCCTGT
GACGCTCATGTGCAAGCGTGGGGAGCACAGGATTGATACCCTGT
GACGCTCATGTGCAAGCGTGGGGAGCACAGGATTGATACCCTGT
GACGCTGATGTGCAAGCGTGGGGAGCACAGGATTGATACCCTGT
GACGCTCATGTGCAAGCGTGGGGAGCACAGGATTGATACCCTGT
GACGCTCATGTGCAAGCGTGGGGAGCACAGGATTGATACCCTGT
GACGCTCGAGTGCAAGCGTGGGGAGCACAGGATTGATACCCTGT
GACGCTCATGTGCAAGCGTGGGGAGCACAGGATTGATACCCTGT
GACGCTCATGTGCAAGCGTGGGGAGCACAGGATTGATACCCTGT
GACGCTCATGTGCAAGCGTGGGGAGCACAGGATTGATACCCTGT

AGTCCCGCTGTAACGATGTCGATTTGGGGATTGGGTGG
AGTCCCGCTGTAACGATGTCGATTTGGGGGTTGGGTGG
AGTCCCGCCGTAACGCTGTCGATTTGGGGATTGGGTGG
AGTCCCGCTGTAACGCTGTCGATTTGGGGGTTGGGTGG
AGTCCCGCTGTAACGATGTCGATTTGGGGGTTGAGTGG
AGTCCCGCTGTAACGCTGTCGATTTGGGGATTGGGTGG
AGTCCCGCTGTAACGCTGTCGATTTGGGGATTGGGTGG
AGTCCCGCTGTAACGCTGTCGATTTGGGGATTGGGTGG
AGTCCCGCTGTAACGCTGTCGATTTGGGGATTGGGTGG
AGTCCCGCTGTAACGCTGTCGATTTGGGGATTGAGTGG
AGTCCCGCTGTAACGCTGTCGATTTGGGGATTGGGTGG
AGTCCCGCTGTAACGCTGTCGATTTGGGGATTGGGTGG
AGTCCCGCTGTAACGCTGTCGATTTGGGGATTGGGTGG
AGTCCCGCTGTAACGCTGTCGATTTGGGGATTGGGTGG
AGTCCCGCTGTAACGCTGTCGATTTGGGGATTGGGTGG
AGTCCCGCTGTAACGATGTTGATTTGGGGATTGGGTGG
AGTCCCGCTGTAACGCTGTCGATTTGGGGGTTGGGTGG
AGTCCCGCTGTAACGCTGTCGATTTGGGGATTGGGTGG
AGTCCCGCTGTAACGCTGTCGATTTGGGGATTGGGTGG
AGTCCCGCTGTAACGATGTCGATTTGGGGGTTGGGTGG
AGTCCCGCTGTAACGATGTCGATTTGGGGATTGAGTGG
AGTCCCGCCGTAACGCTGTCGATTTGGGGATTGGCTGG
AGTCCCGCTGTAACGCTGTCGATTTGGGGATTGGGTGG
AGTCCCGCTGTAACGCTGTCGATTTGGGGATTGGGTGG
AGTCCCGCTGTAACGGTGTCGATTTGGGGATTGGCTGG
AGTCCCGCTGTAACGATGTCGATTTGGGGGTTGAGTGG
AGTCCCGCTGTAACGCTGTCGATTTGGGGGTTGGGTGG
AGTCCCGCTGTAACGATGTCGATTTGGGGATTGGTTGG
AGTCCCGCTGTAACGATGTCGATTTGGGGGTTGGATGG
AGTCCCGCTGTAACGCTGTCGATTTGGGGATTGGTTGG
AGTCCCGCTGTAACGATGTCGATTTGGGGGTTGGCTGG

TGCCGTAGCTACGTGATAAATCGCCGCCGGGGAGTACGGCCGCAG
CGCCGTAGCTACGTGATAAATCGCCGCCGGGGAGTACGGCCGCAG
TGCCGTAGCTACGTGATAAATCGCCGCCGGGGAGTACGGCCGCAG
CGCCGTAGCTACGTGATAAATCGCCGCCGGGGAGTACGGCCGCAG
CGCCGTAGCTACGTGATAAATCGCCGCCGGGGAGTACGGCCGCAG
TGCCGTAGCTACGTGATAAATCGCCGCCGGGGAGTACGGCCGCAG
TGCCGAAGCTACGTGATAAATCGCCGCCGGGGAGTACGGCCGCAG
TGCCGTAGCTACGTGATAAATCGCCGCCGGGGAGTACGGCCGCAG
TGCCGTAGCTACGTGATAAATCGCCGCCGGGGAGTACGGCCGCAG
TGCCGTAGCTACGTGATAAATCGCCGCCGGGGAGTACGGCCGCAG
TGCCGTAGCTACGTGATAAATCGCCGCCGGGGAGTACGGCCGCAG
TGCCGTAGCTACGTGATAAATCGCCGCCGGGGAGTACGGCCGCAG
TGCCGAAGCTACGTGATAAATCGCCGCCGGGGAGTACGGCCGCAG
TGCCGTAGCTACGTGATAAATCGCCGCCGGGGAGTACGGCCGCAG
TGCCGAAGCTACGTGATAAATCGCCGCCGGGGAGTACGGCCGCAG
TGCCGTAGCTACGTGATAAATCACCGCCGGGGAGTACGGCCGCAG
CGCCGTAGCTACGTGATAAATCGCCGCCGGGGAGTACGGCCGCAG
TGCCGTAGCTACGTGATAAATCGCCGCCGGGGAGTACGGCCGCAG
TGCCGTAGTTACGTGATAAATCGCCGCCGGGGAGTACGGCCGCAG
CGCCGAAGCTACGTGATAAATCGCCGCCGGGGAGTACGGCCGCAG
TGCCGAAGCTACGTGATAAATCGCCGCCGGGGAGTACGGCCGCAG
TGCCGTAGCTACGTGATAAATCGCCGCCGGGGAGTACGGCCGCAG
TGCCGAAGCTACGTGATAAATCGCCGCCGGGGAGTACGGCCGCAG
TGCCGTAGCTACGTGATAAATCGCCGCCGGGGAGTACGGCCGCAG
TGCCGAAGCTACGTGATAAATCGCCGCCGGGGAGTACGGCCGCAG
CGCCGTAGCTACGTGATAAATCGCCGCCGGGGAGTACGGCCGCAG
CGCCGTAGCTACGTGATAAATCGCCGCCGGGGAGTACGGCCGCAG
TGCCGAAGCTACGTGATAAATCGCCGCCGGGGAGTACGGCCGCAG
CTCCGAAGCAACGTGATAAATCGCCGCCGGGGAGTACGGCCGCAG
TGCCGTAGTTACGTGATAAATCGCCGCCGGGGAGTACGGCCGCAG
CTCCGTAGCTACGCGTTAAATCGCCGCCGGGGAGTACGGCCGCAG

GTTAAAACAATATTGCGGGGGCGCACACGGTGGAGCATGT
GTTAAAACAATATTGCGGGGGCGCACACGGTGGAGCATGT
GTTAAAACAATATTGCGGGGGCGCACACGGTGGAGCATGT
GTTAAAACAATATTGCGGGGGCGCACACGGTGGAGCATGT
GTTAAAACAATATTGCGGGGGCGCACACGGTGGAGCATGT
GTTAAAACAATATTGCGGGGGCGCACACGGTGGAGCATGT
GTTAAAACAATATTGCGGGGGCGCACACGGTGGAGCATGT
GTTAAAACAATATTGCGGGGGCGCACACGGTGGAGCATGT
GTTAAAACAATATTGCGGGGGCGCACACGGTGGAGCATGT
GTTAAAACAATATTGCGGGGGCGCACACGGTGGAGCATGT
GTTAAAACAATATTGCGGGGGCGCACACGGTGGAGCATGT
GTTAAAACAATATTGCGGGGGCGCACACGGTGGAGCATGT
GTTAAAACAATATTGCGGGGGCGCACACGGTGGAGCATGT
GTTAAAACAATATTGCGGGGGCGCACACGGTGGAGCATGT
GTTAAAACAATATTGCGGGGGCGCACACGGTGGAGCATGT
GTTAAAACAATATTGCGGGGGCGCACACGGTGGAGCATGT
GTTAAAACAATATTGCGGGGGCGCACACGGTGGAGCATGT
GTTAAAACAATATTGCGGGGGCGCACACGGTGGAGCATGT
GTTAAAACAATATTGCGGGGGCGCACACGGTGGAGCATGT
GTTAAAACAATATTGCGGGGGCGCACACGGTGGAGCATGT
GTTAAAACAATATTGCGGGGGCGCACACGGTGGAGCATGT
GTTAAAACAATATTGCGGGGGCGCACACGGTGGAGCATGT
GTTAAAACAATATTGCGGGGGCGCACACGGTGGAGCATGT
GTTAAAACAATATTGCGGGGGCGCACACGGTGGAGCATGT
GTTAAAACAATATTGCGGGGGCGCACACGGTGGAGCATGT
GTTAAAACAATATTGCGGGGGCGCACACGGTGGAGCATGT
GTTAAAACAATATTGCGGGGGCGCACACGGTGGAGCATGT
GTTAAAACAATATTGCGGGGGCGCACACGGTGGAGCATGT
GTTAAAACAATATTGCGGGGGCGCACACGGTGGAGCATGT
GTTAAAACAATATTGCGGGGGCGCACACGGTGGAGCATGT
GTTAAAACAATATTGCGGGGGCGCACACGGTGGAGCATGT

GGTTTATCGATAACGCGAGAACCTACCTCTCTTGACATCATG
GGTTTATCGATAACGCGAGAACCTACCTCTCTTGACATCATA
GGTTTATCGATAACGCGAGAACCTACCTCTCTTGACATCTAA
GGTTTATCGATAACGCGAGAACCTACCTCTCTTGACATCTCA
GGTTTATCGATAACGCGAGAACCTACCTCTCTTGACATCAGA
GGTTTATCGATAACGCGAGAACCTACCTCTCTTGACATCTAA
GGTTTATCGATAACGCGAGAACCTACCTCTCTTGACATCTCA
GGTTTATCGATAACGCGAGAACCTACCTCTCTTGACATCAGA
GGTTTATCGATAACGCGAGAACCTACCTCTCTTGACATCTAA
GGTTTATCGATAACGCGAGAACCTACCTCTCTTGACATCATA
GGTTTATCGATAACGCGAGAACCTACCTCTCTTGACATCAAA
GGTTTATCGATAACGCGAGAACCTACCTCTCTTGACATCTAA
GGTTTATCGATAACGCGAGAACCTACCTCTCTTGACATCTAA
GGTTTATCGATAACGCGAGAACCTACCTCTCTTGACATCTCA
GGTTTATCGATAACGCGAGAACCTACCTCTCTTGACATCTAA
GGTTTATCGATAACGCGAGAACCTACCTCTCTTGACATCAGA
GGTTTATCGATAACGCGAGAACCTACCTCTCTTGACATCAAC
GGTTTATCGATAACGCGAGAACCTACCTCTCTTGACATCTAA
GGTTTATCGATAACGCGAGAACCTACCTCTCTTGACATCATG
GGTTTATCGATAACGCGAGAACCTACCTCTCTTGACATCATA
GGTTTATCGATAACGCGAGAACCTACCTCTCTTGACATCATA
GGTTTATCGATAACGCGAGAACCTACCTCTCTTGACATCATG
GGTTTATCGATAACGCGAGAACCTACCTCTCTTGACATCATG
GGTTTATCGATAACGCGAGAACCTACCTCTCTTGACATCTAA
GGTTTATCGATAACGCGAGAACCTACCTCTCTTGACATCGAA
GGTTTATCGATAACGCGAGAACCTACCTCTCTTGACATCAGA
GGTTTATCGATAACGCGAGAACCTACCTCTCTTGACATCATA
GGTTTATCGATAACGCGAGAACCTACCTCTCTTGACATCTAA
GGTTTATCGATAACGCGAGAACCTACCTCTCTTGACATCTAA
GGTTTATCGATAACGCGAGAACCTACCTCTCTTGACATCTAA
GGTTTATCGATAACGCGAGAACCTACCTCTCTTGACATCTAA

GAAGAGATAGGGGCCTTCGGAACCTGAGACAGGTGCT
GAATAGATGGTGGCCTTCGGAACTTGAGACAGGTGCT
GAACAGATATTGGCCTTCGGAACTAGAGACAGGTGCT
GAATAGATGGTGGCCTTTGGAACTAGAGACAGGTGCT
GAACAGATATTGGCCTTCGGAGCTTGAGACAGGTGCT
GAAGAGATAGAGGCCTTCGGAGCTAGAGACAGGTGCT
GAAGAGATACAGGCCTTCGGAACTAGAGACAGGTGCT
GAAGAGATGGTGGCCTTCGGAGCTTGAGACAGGTGCT
GAATAGATGGTGGCCTTCGGAACTAGAGACAGGTGCT
GAAGAGATAGAGGCCTTAGGAGCTTGAGACAGGTGCT
GAAGAGATAGTGGCCTTCGGAACTTGAGACAGGTGCT
GAATAGATGGTGGCCTTCGGAACTAGAGACAGGTGCT
GAATAGATGGTGGCCTTCGGAACTAGAGACAGGTGCT
GAAAAGATATAGGCCTTTGGAACTAGAGACAGGTGCT
GAATAGATGGTGGCCTTCGGAACTAGAGACAGGTGCT
GAAGAGATGGTGGCCTTCGGAGCTTGAGACAGGTGCT
GAAAAGATAGAGGCCTTCGGAACGTGAGACAGGTGCT
GAAGAGATAGTGGCCTTCGGAACTAGAGACAGGTGCT
GAAGAGATAGAGGCCTTCGGAACCTGAGACAGGTGCT
GAATAGATGGTGGCCTTCGGAACTTGTGACAGGTGCT
GAATAGATGGTGGCCTTCGGAGCTTGAGACAGGTGCT
GAAGAGATAGAGGCCTTCGGAACCTGAGACAGGTGCT
GAAGAGATAGAGGCCTTCGGAACCTGAGACAGGTGCT
GAATAGATGGTGGCCTTCGGAACTAGAGACAGGTGCT
GAATAGATGGTGGCCTTAGGAGCTTGAGACAGGTGCT
GAACAGATGAGGGCCTTCGGAACTTGAGACAGGTGCT
GAATAGATGGTGGCCTTCGGAACTTGAGACAGGTGCT
GAACAGATGTTGGCCTACGGAGCTAGAGACAGGTGCT
GGATAGATAATGGCTTTCAGAACTAGAGACAGGTGCT
GAATAGATAACGGCCGTAAGAGCTAGAGACAGGTGCT
GAATAGATGGTGGCCTTCGGAACTAGAGACAGGTGCT

GCATGGCTCGTCAGCTCGTGTTGTGAAATGTTGGGTTGTCCCGCAA
GCATGGCTCGTCAGCTCGTGTTGTGAAATGTTGGGTTGTCCCGCAA
GCATGGCTCGTCAGCTCGTGTTGTGAAATGTTGGGTTGTCCCGCAA
GCATGGCTCGTCAGCTCGTGTTGTGAAATGTTGGGTTGTCCCGCAA
GCATGGCTCGTCAGCTCGTGTTGTGAAATGTTGGGTTGTCCCGCAA
GCATGGCTCGTCAGCTCGTGTTGTGAAATGTTGGGTTGTCCCGCAA
GCATGGCTCGTCAGCTCGTGTTGTGAAATGTTGGGTTGTCCCGCAA
GCATGGCTCGTCAGCTCGTGTTGTGAAATGTTGGGTTGTCCCGCAA
GCATGGCTCGTCAGCTCGTGTTGTGAAATGTTGGGTTGTCCCGCAA
GCATGGCTCGTCAGCTCGTGTTGTGAAATGTTGGGTTGTCCCGCAA
GCATGGCTCGTCAGCTCGTGTTGTGAAATGTTGGGTTGTCCCGCAA
GCATGGCTCGTCAGCTCGTGTTGTGAAATGTTGGGTTGTCCCGCAA
GCATGGCTCGTCAGCTCGTGTTGTGAAATGTTGGGTTGTCCCGCAA
GCATGGCTCGTCAGCTCGTGTTGTGAAATGTTGGGTTGTCCCGCAA
GCATGGCTCGTCAGCTCGTGTTGTGAAATGTTGGGTTGTCCCGCAA
GCATGGCTCGTCAGCTCGTGTTGTGAAATGTTGGGTTGTCCCGCAA
GCATGGCTCGTCAGCTCGTGTTGTGAAATGTTGGGTTGTCCCGCAA
GCATGGCTCGTCAGCTCGTGTTGTGAAATGTTGGGTTGTCCCGCAA
GCATGGCTCGTCAGCTCGTGTTGTGAAATGTTGGGTTGTCCCGCAA
GCATGGCTCGTCAGCTCGTGTTGTGAAATGTTGGGTTGTCCCGCAA
GCATGGCTCGTCAGCTCGTGTTGTGAAATGTTGGGTTGTCCCGCAA
GCATGGCTCGTCAGCTCGTGTTGTGAAATGTTGGGTTGTCCCGCAA
GCATGGCTCGTCAGCTCGTGTTGTGAAATGTTGGGTTGTCCCGCAA
GCATGGCTCGTCAGCTCGTGTTGTGAAATGTTGGGTTGTCCCGCAA
GCATGGCTCGTCAGCTCGTGTTGTGAAATGTTGGGTTGTCCCGCAA
GCATGGCTCGTCAGCTCGTGTTGTGAAATGTTGGGTTGTCCCGCAA
GCATGGCTCGTCAGCTCGTGTTGTGAAATGTTGGGTTGTCCCGCAA
GCATGGCTCGTCAGCTCGTGTTGTGAAATGTTGGGTTGTCCCGCAA
GCATGGCTCGTCAGCTCGTGTTGTGAAATGTTGGGTTGTCCCGCAA
GCATGGCTCGTCAGCTCGTGTTGTGAAATGTTGGGTTGTCCCGCAA
GCATGGCTCGTCAGCTCGTGTTGTGAAATGTTGGGTTGTCCCGCAA

CGAGCGCACCCTTATCCTTTGTTGCCAGCTGGCGGGAACTCAA
CGAGCGCACCCTTATCCTTTGTTGCTAGCTTGCGAGAACTCAA
CGAGCGCACCCTTATCCTTTGTTGCCAGCTGGTGGGAACTCAA
CGAGCGCACCCTTATCCTTTGTTGCCAGCTGGGGGGAACTCAA
CGAGCGCACCCTTATCCTTTGTTGCCAGCGAAGGGGAACTCAA
CGAGCGCACCCTTATCCTTTGTTGCCAGCTGGGGGGAACTCAA
CGAGCGCACCCTTATCCTTTGTTGCCAGCTAGAGGGAACTCAA
CGAGCGCACCCTTATCCTTTGTTGCCAGCTTGGGGGAACTCAA
CGAGCGCACCCTTATCCTTTGTTGCCAGCCGGTGGGAACTCAA
CGAGCGCACCCTTATCCTTTGTTGCCAGCTGGTGGGAACTCAA
CGAGCGCACCCTTATCCTTTGTTGCCAGCTGGTGGGAACTCAA
CGAGCGCACCCTTATCCTTTGTTGCCAGCTGGTGGGAACTCAA
CGAGCGCACCCTTATCCTTTGTTGCCAGCTGGTGGGAACTCAA
CGAGCGCACCCTTATCCTTTGTTGCCAGCTGGTGGGAACTCAA
CGAGCGCACCCTTATCCTTTGTTGCCAGCTGGTGGGAACTCAA
CGAGCGCACCCTTATCCTTTGTTGCCAGCTGGTGGGAACTCAA
CGAGCGCACCCTTGTCCTTTGTTGCCATCTGGTGGGAACTCAA
CGAGCGCACCCTTATCCTTTGTTGCCAGCTGGTGGGAACTCAA
CGAGCGCACCCTTATCCTTTGTTGCCAGCTGGTGGGAACTCAA
CGAGCGCACCCTTATCCTTTGTTGCCAGCTTGAGGGAACTCAA
CGAGCGCACCCTTATCCTTTGTTGCCAGCCGGTGGGAACTCAA
CGAGCGCACCCTTATCCTTTGTTGCCAGCTGGTGGGAACTCAA
CGAGCGCACCCTTATCCTTTGTTGCCAGCTGGTGGGAACTCAA
CGAGCGCACCCTTATCCTTTGTTGCCAGCCGGTGGGAACTCAA
CGAGCGCACCCTTATCCTTTGTTGCCAGCCGGTGGGAACTCAA
CGAGCGCACCCTTATCCTTTGTTGCCAGCTGGTGGGAACTCAA
CGAGCGCACCCTTATCCTTTGTTGCCAGCTGGTGGGAACTCAA
CGAGCGCACCCTTATCCTTTGTTGCCAGCCGGGGGGAACTCAA
CGAGCGCACCCTTATTCTTTGTTACCAACTAGCGGGGACTCAA
CGAGCGCACCCTTATCCTTTGTTGCCAGCTGGTGGGAACTCAA
CGAGCGCACCCCTATCCTTTGTTGCCATCTAGTGGGAACTCAA

AGGAGACTCCAGTGATACTGGAGGAAGGGGGATGACGTCAAGTCA
AGGAGACTCCAGTGATACTGGAGGAAGGGGGACGACGTCAAGTCA
AGGAGACTCCGGTGATACCGGAGGAAGGGGGATGACGTCAAGTCA
AGGAGACTCCGGTGATACCGGAGGAAGGGGGATGACGTCAAGTCA
AGGAGACTCCAGTGATACTGGAGGAAGGGGGATGACGTCAAGTCA
AGGAGACTCCAGTGATACTGGAGGAAGGGGGATGACGTCAAGTCA
AGGAGACTCCGGTGATACCGGAGGAAGGGGGATGACGTCAAGTCA
AGGAGACTCCGGTGATACCGGAGGAAGGGGGATGACGTCAAGTCA
AGGAGACTCCGGTGATACCGGAGGAAGGGGGATGACGTCAAGTCA
AGGAGACTCCAGTGATACTGGAGGAAGGGGGATGACGTCAAGTCA
AGGAGACTCCAGTGATACTGGAGGAAGGGGGATGACGTCAAGTCA
AGGAGACTCCAGTGATACTGGAGGAAGGGGGATGACGTCAAGTCA
AGGAGACTCCAGTGACACTGGAGGAAGGGGGATGACGTCAAGTCA
AGGAGACTCCAGTGATACTGGAGGAAGGGGGATGACGTCAAGTCA
AGGAGACTCCAGTGATACTGGAGGAAGGGGGATGACGTCAAGTCA
AGGAGACTCCAGTGATACTGGAGGAAGGGGGATGACGTCAAGTCA
AGGAGACTCCAGTGATACTGGAGGAAGGGGGATGACGTCAAGTCA
AGGAGACTCCAGTGATACTGGAGGAAGGGGGATGACGTCAAGTCA
AGGAGACTCCGGTGATACCGGAGGAAGGGGGATGACGTCAAGTCA
AGGAGACTCCAGTGATACTGGAGGAAGGGGGATGACGTCAAGTCA
AGGAGACTCCAGTGATACTGGAGGAAGGGGGATGACGTCAAGTCA
AGGAGACTCCGGTGATACCGGAGGAAGGGGGATGACGTCAAGTCA
AGGAGACTCCGGTGATACCGGAGGAAGGGGGATGACGTCAAGTCA
AGGAGACTCCAGTGATACTGGAGGAAGGGGGATGACGTCAAGTCA
AGGAGACTCCGGTGATACCGGAGGAAGGGGGATGACGTCAAGTCA
AGGAGACTCCGGTGATACCGGAGGAAGGGGGATGACGTCAAGTCA
AGGAGACTCCAGTGATACTGGAGGAAGGGGGATGACGTCAAGTCA
AGGAGACCCCGGTGATACTGGAGGAAGGGGGATGACGTCAAGTCA
AGGAGACTCCAGTGACGCTGGAGGAAGGGGGATGACGTCAAGTCA
AGGAGACTCCAGTGATACTGGAGGAAGGGGGATGACGTCAAGTCA
AGGAGACTCCGGTGACACCGGAGGAAGGGGGATGACGTCAAGTCA

TCTGGCCTTACGAGTAGGGCTACACACGTGCTACAATGGCGTATACG
TCTGGCCTTACGAGTAGGGCTACACACGTGCTACAATGGCGTATACG
TCTGGCCTTACGAGTAGGGCTACACACGTGCTACAATGGTGCATACG
TCTGGCCTTACGAGTAGGGCTACACACGTGCTACAATGGCGTATACG
TCTGGCCTTACGAGTAGGGCTACACACGTGCTACAATGGTGCATACG
TCTGGCCTTACGAGTAGGGCTACACACGTGCTACAATGGTGCATACG
TCTGGCCTTACGAGTAGGGCTACACACGTGCTACAATGGCGTATACG
TCTGGCCTTACGAGTAGGGCTACACACGTGCTACAATGGTGCATACG
TCTGGCCTTACGAGTAGGGCTACACACGTGCTACAATGGTGCATACG
TCTGGCCTTACGAGTAGGGCTACACACGTGCTACAATGGTGCATACG
TCTGGCCTTACGAGTAGGGCTACACACGTGCTACAATGGCGTATACG
TCTGGCCTTACGAGTAGGGCTACACACGTGCTACAATGGTGCATACG
TCTGGCCTTACGAGTAGGGCTACACACGTGCTACAATGGTGCATACG
TCTGGCCTTACGAGTAGGGCTACACACGTGCTACAATGGTGCATACG
TCTGGCCTTACGAGTAGGGCTACACACGTGCTACAATGGTGCATACG
TCTGGCCTTACGAGTAGGGCTACACACGTGCTACAATGGTGCATACG
TCTGGCCTTACGAGTAGGGCTACACACGTGCTACAATGGTGCATACG
TCTGGCCTTACGAGTAGGGCTACACACGTGCTACAATGGTGCATACG
TCTGGCCTTACGAGTAGGGCTACACACGTGCTACAATGGTGCATACG
TCTGGCCTTACGAGTAGGGCTACACACGTGCTACAATGGCGTATACG
TCTGGCCTTACGAGTAGGGCTACACACGTGCTACAATGGTGCATACG
TCTGGCCTTACGAGTAGGGCTACACACGTGCTACAATGGCGTATACG
TCTGGCCTTACGAGTAGGGCTACACACGTGCTACAATGGCGTATACG
TCTGGCCTTACGAGTAGGGCTACACACGTGCTACAATGGCGTATACG
TCTGGCCTTACGAGTAGGGCTACACACGTGCTACAATGGCGTATACG
TCTGGCCTTACGAGTAGGGCTACACACGTGCTACAATGGCGTATACG
TCTGGCCTTACGAGTAGGGCTACACACGTGCTACAATGGTGCATACG
TCTGGCCTTACGAGTAGGGCTACACACGTGCTACAATGGCGTATACG
TCTGGCCTTACGAGTAGGGCTACACACGTGCTACAATGGTGCATACG
TCTGGCCTTACGAGTAGGGCTACACACGTGCTACAATGGTGCATACG
TCTGGCCTTACGAGTAGGGCTACACACGTGCTACAATGGCGTATACG

AGGGCAGCGGCGCGGGTAGCGAATCTCAGAAAGTACGTCTAAGT
AGGGCGGCGACGCGGGTAGCGAATCTCAGAAAGTACGTCTAAGT
AGGGTAGCGACGCGGGTAGCCAATCTCAGAAAGTGCATCTAAGT
AGGGGTGCGACGCTGTGAGCGAATCTCATAAAGTGCGTCTAAGT
AGGGCGGCGACGCGGGTAGCGAATCTCAGAAAGTGCATCGTAGT
AGGGGTGCGGCGCGGGTAGCGAATCTCAGAAAGTGCATCTAAGT
AGGGAAACGGCGCGGGGAGTGAATCTCAGAAAGTGCGTCGTAGT
AGAGCGACGGCGCGGTGAGTGAATCTCAGAAAGTGCATCTAAGT
AGGGCGGCAACGCGGGAAGCGAATCTCAGAAAGTGCATCGTAGT
AGGGCGGCGACGCGGGTAGCGAATCTCAGAAAGTGCATCGTAGT
AGGGAAGCAATGCCATGAGCAAATCTCACAAAGTACGTCTAAGT
AGGGCGACGACGCGGTGAGTGAATCTCAGAAAGTGCATCTAAGT
AGGGCAGCGGGGCGCTTAGCGAATCTCAGAAAGTGCATCTAAGT
AGGGCGACAACGCGGGGAGTGAATCTCAGAAAGTGCATCTAAGT
AGGGAAGCGGGGCGCTGAGCGAATCTCAGAAAGTGCATCTAAGT
AGGGTAGCGGCGCTGGTAGCCAATCTCAGAAAGTGCATCTAAGT
AGGGTGGCGGCGCTGGTGGCGAATCTCTTAAAGTGCATCTAAGT
AGGGCGACGACGCTGGGAGTGAATCTCAGAAAGTGCATCTAAGT
AGGGCGGCGACGTTGCGAGCGAATCTCAGAAAGTGCATCTAAGT
AGGGCGGCAACGCAGGGAGCGAATCTCACAAAGTACGTCTAAGT
AGGGAAGCGGCGCGGGGAGCGAATCTCAGAAAGTGCATCTAAGT
AGGGCGGCGGCGCTGGGAGCGAATCTCAGAAAGTACGTCTAAGT
AGGGAAGCAGTGCCATGAGCAAATCTCACAAAGTACGTCTAAGT
AGGGAAGCGACGCGGTGAGCGAATCTCATAAAGTACGTCTAAGT
AGGGTAACCACGCTGGGAGTGAATCTCAGAAAGTACGTCTAAGT
AGGGAAGCGGGGCGCTGAGCGAATCTCACAAAGTACGTCTAAGT
AGGGAAGCGTCGCGGTGAGCGAATCTCACAAAGTGCATCTAAGT
AGGGAGGCGGCGCGGGTAGCGAAGCTCAGAAAGTACGTCTAAGT
AGGGAGGCGATGCTATGAGCGAAACTCAGAAAGTGCATCGTAGT
AGGGCAGCGGGGCGCTTAGCGAATCTCAGAAAGTGCATCTAAGT
AGGGAAGCGGGGCGCTTAGCAAATCTCTTAAAGTGCGTCGTAGT

CCGGATTGGAGTCTGCACTCGACTCCATGAAGTCGGAATCGCTAGTAAT
CCGGATTGGAGTCTGCACTCGACTCCATGAAGTCGGAATCGCTAGTAAT
CCGGATTGGAGTCTGCACTCGACTCCATGAAGTCGGAATCGCTAGTAAT
CCGGATTGGAGTCTGCACTCGACTCCATGAAGTCGGAATCGCTAGTAAT
CCGGATTGGAGTCTGCACTCGACTCCATGAAGTCGGAATCGCTAGTAAT
CCGGATTGGAGTCTGCACTCGACTCCATGAAGTCGGAATCGCTAGTAAT
CCGGATTGGAGTCTGCACTCGACTCCATGAAGTCGGAATCGCTAGTAAT
CCGGATTGGAGTCTGCACTCGACTCCATGAAGTCGGAATCGCTAGTAAT
CCGGATTGGAGTCTGCACTCGACTCCATGAAGTCGGAATCGCTAGTAAT
CCGGATTGGAGTCTGCACTCGACTCCATGAAGTCGGAATCGCTAGTAAT
CCGGATTGGAGTCTGCACTCGACTCCATGAAGTCGGAATCGCTAGTAAT
CCGGATTGGAGTCTGCACTCGACTCCATGAAGTCGGAATCGCTAGTAAT
CCGGATTGGAGTCTGCACTCGACTCCATGAAGTCGGAATCGCTAGTAAT
CCGGATTGGAGTCTGCACTCGACTCCATGAAGTCGGAATCGCTAGTAAT
CCGGATTGGAGTCTGCACTCGACTCCATGAAGTCGGAATCGCTAGTAAT
CCGGATTGGAGTCTGCACTCGACTCCATGAAGTCGGAATCGCTAGTAAT
CCGGATTGGAGTCTGCACTCGACTCCATGAAGTCGGAATCGCTAGTAAT
CCGGATTGGAGTCTGCACTCGACTCCATGAAGTCGGAATCGCTAGTAAT
CCGGATTGGCGTCTGCACTCGACGCCATGAAGTCGGAATCGCTAGTAAT
CCGGATTGGAGTCTGCACTCGACTCCATGAAGTCGGAATCGCTAGTAAT
CCGGATTGGAGTCTGCACTCGACTCCATGAAGTCGGAATCGCTAGTAAT
CCGGATTGGAGTCTGCACTCGACTCCATGAAGTCGGAATCGCTAGTAAT
CCGGATTGGAGTCTGCACTCGACTCCATGAAGTCGGAATCGCTAGTAAT
CCGGATTGGAGTCTGCACTCGACTCCATGAAGTCGGAATCGCTAGTAAT
CCGGATTGGAGTCTGCACTCGACTCCATGAAGTCGGAATCGCTAGTAAT
CCGGATTGGAGTCTGCACTCGACTCCATGAAGTCGGAATCGCTAGTAAT
CCGGATTGGAGTCTGCACTCGACTCCATGAAGTCGGAATCGCTAGTAAT
CCGGATTGGAGTCTGGACTCGACTCCATGAAGTCGGAATCGCTAGTAAT
CCGGATTGGAGTCTGCACTCGACTCCATGAAGTCGGAATCGCTAGTAAT
CCGGATTGGAGTCTGCACTCGACTCCATGAAGTCGGAATCGCTAGTAAT
CCGGATTGGAGTCTGCACTCGACTCCATGAAGTCGGAATCGCTAGTAAT

CGCGAATCGAATGTCGCGGTAATACGTTCCCGGGCCT
CGCGAATCGAATGTCGCGGTAATACGTTCCCGGGCCT
CGCAAATCGAATGTTGCGGTAATACGTTCCCGGGCCT
CGCGAATCGAATGTCGCGGTAATACGTTCCCGGGCCT
CGCAAATCGAATGTTGCGGTAATACGTTCCCGGGCCT
CGCGAATCGAATGTCGCGGTAATACGTTCCCGGGCCT
CGCGAATCGCATGTCGCGGTAATACGTTCCCGGGCCT
CGCGAATCGAATGTCGCGGTAATACGTTCCCGGGCCT
CGCAAATCGAATGTTGCGGTAATACGTTCCCGGGCCT
CGCAAATCGAATGTTGCGGTAATACGTTCCCGGGCCT
CGCAAATCGAATGTTGCGGTAATACGTTCCCGGGCCT
CGCGAATCGAATGTCGCGGTAATACGTTCCCGGGCCT
CGCAAATCGAATGTTGCGGTAATACGTTCCCGGGCCT
CGCGAATCGAATGTCGCGGTAATACGTTCCCGGGCCT
CGCAAATCGAATGTTGCGGTAATACGTTCCCGGGCCT
CGCAAATCGAATGTTGCGGTAATACGTTCCCGGGCCT
CGCAAATCGAATGTTGCGGTAATACGTTCCCGGGCCT
CGCGAATCGAATGTTGCGGTAATACGTTCCCGGGCCT
CGCAAATCGAATGTTGCGGTAATACGTTCCCGGGCCT
CGCAAATCGAATGTTGCGGTAATACGTTCCCGGGCCT
CGCAAATCGAATGTTGCGGTAATACGTTCCCGGGCCT
CGCAAATCGAATGTTGCGGTAATACGTTCCCGGGCCT
CGCAAATCGAATGTTGCGGTAATACGTTCCCGGGCCT
CGCGAATCGAATGTCGCGGTAATACGTTCCCGGGCCT
CGCGAATCGAATGTTGCGGTAATACGTTCCCGGGCCT
CGCAAATCGAATGTTGCGGTAATACGTTCCCGGGCCT
CGCAAATCGAATGTTGCGGTAATACGTTCCCGGGCCT
CGCGGATCGAATGTCGCGGTAATACGTTCCCGGGCCT
CGCAAATCGAATGTTGCGGTAATACGTTCCCGGGCCT
CGCAAATCGAATGTTGCGGTAATACGTTCCCGGGCCT
CGCAAATCGAATGTTGCGGTAATACGTTCCCGGGCCT

Output files for phylogenetic exercises

Exercise 1

DNA parsimony algorithm, version 3.573c

Name         Sequences
----         ---------
seq1         AAG
seq2         ..A
seq3         GGA
seq4         .GA

One most parsimonious tree found:

        +--seq4      
     +--3  
  +--2  +--seq3      
  !  !  
--1  +-----seq2      
  !  
  +--------seq1      

  remember: this is an unrooted tree!

requires a total of      3.000
steps in each site:
         0   1   2   3   4   5   6   7   8   9
     *-----------------------------------------
    0!       1   1   1                        

>From    To     Any Steps?    State at upper node
                             ( . means same as in the node below it on tree)

          1                AAR
   1      2        maybe   ..A
   2      3         yes    .G.
   3   seq4         no     ...
   3   seq3         yes    G..
   2   seq2         no     ...
   1   seq1        maybe   ..G

Exercise 2a

DNA parsimony algorithm, version 3.573c

One most parsimonious tree found:

     +--------gorillaexx
  +--2  
  !  !  +-----homosapien
  !  +--3  
--1     !  +--orangutang
  !     +--4  
  !        +--gibbonxxxx
  !  
  +-----------chimpanzee

  remember: this is an unrooted tree!
requires a total of    330.000

Exercise 2b

Neighbor-Joining/UPGMA method version 3.573c


        +-------gibbonxxxx
     +--1  
  +--2  +-----orangutang
  !  !  
  !  +---gorillaexx
  !  
--3-homosapien
  !  
  +--chimpanzee


remember: this is an unrooted tree!

Between        And            Length
-------        ---            ------
   3             2              0.00318
   2             1              0.03598
   1          gibbonxxxx        0.12602
   1          orangutang        0.09198
   2          gorillaexx        0.05777
   3          homosapien        0.04015
   3          chimpanzee        0.05195

Exercise 2c (dnaml program)

Nucleic acid sequence Maximum Likelihood method, version 3.573c

Empirical Base Frequencies:

   A       0.30929
   C       0.32750
   G       0.10570
  T(U)     0.25751

Transition/transversion ratio =   2.000000
(Transition/transversion parameter =   1.653039)

     +-homosapien
  +--2  
  !  !  +-----orangutang
  !  +--3  
  !     +-------gibbonxxxx
  !  
--1---gorillaexx
  !  
  +--chimpanzee

remember: this is an unrooted tree!
Ln Likelihood = -2514.48557
Examined   17 trees

 Between        And            Length      Approx. Confidence Limits
 -------        ---            ------      ------- ---------- ------

   1             2              0.01720     (  0.00594,     0.02847) **
   2          homosapien        0.02875     (  0.01514,     0.04262) **
   2             3              0.05455     (  0.03466,     0.07469) **
   3          orangutang        0.09121     (  0.06758,     0.11573) **
   3          gibbonxxxx        0.13271     (  0.10424,     0.16187) **
   1          gorillaexx        0.06191     (  0.04344,     0.08064) **
   1          chimpanzee        0.05097     (  0.03402,     0.06806) **

     *  = significantly positive, P < 0.05
     ** = significantly positive, P < 0.01

Exercise 2d

(Bootstrapping of parsimony)

Majority-rule and strict consensus tree program, version 3.573c

Species in order: 

  gorillaexx
  homosapien
  orangutang
  gibbonxxxx
  chimpanzee

Sets included in the consensus tree
Set (species in order)     How many times out of 100.00

..**.                      100.00
.***.                      73.67

Sets NOT included in consensus tree:
Set (species in order)     How many times out of 100.00

..***                      20.17
.*..*                       6.17

CONSENSUS TREE:
the numbers at the forks indicate the number
of times the group consisting of the species
which are to the right of that fork occurred
among the trees, out of 100.00 trees

                 +----gibbonxxxx          
            +-100.0
       +-73.7    +----orangutang          
       !    !
  +-100.0    +---------homosapien          
  !    !
  !    +--------------chimpanzee          
  !
  +-------------------gorillaexx          

  remember: this is an unrooted tree!

Exercise 3 (parsimony and neighbour joining methods)

Protein parsimony algorithm, version 3.573c

One most parsimonious tree found:

           +--homosapien
     +-----3  
     !     +--gorillaexx
  +--2  
  !  !     +--orangutang
--1  +-----4  
  !        +--gibbonxxxx
  !  
  +-----------chimpanzee

  remember: this is an unrooted tree!
requires a total of     57.000


Neighbor-Joining/UPGMA method version 3.573c
 Neighbor-joining method
 Negative branch lengths allowed

     +--------gibbonxxxx
  +--1  
  !  +-----orangutang
  !  
--3-gorillaexx
  !  
  !  +-chimpanzee
  +--2  
     +-homosapien


remember: this is an unrooted tree!

Between        And            Length
-------        ---            ------
   3             1              0.02121
   1          gibbonxxxx        0.14023
   1          orangutang        0.08913
   3          gorillaexx        0.03381
   3             2              0.00696
   2          chimpanzee        0.02877
   2          homosapien        0.03076

Exercise 4

Results are given for neighbor joining, as well as sample results for a run of maximum parsimony and maximum likelihood, corresponding to the order of sequences in the data set. Note that latter results depend strongly on that order. In a data set of this size an exhaustive search is impossible, but randomizing the sequence will give different values, among which the best ones can be chosen. For parsimony the value obtained (827 substitutions) is quite good. For maximum likelihood the log likelihood value can be improved up to at least values around -6622.3.
Exercise 4: Neighbor joining
  31 Populations

Neighbor-Joining/UPGMA method version 3.573c


 Neighbor-joining method

 Negative branch lengths allowed


        +Actinomuri
     +-16  
     !  !  +Bistfivexx
     !  +--4  
  +-28     +Mhaemolyti
  !  !  
  !  !  +Ptrehalosi
  !  +-27  
  !     +Pbettixxxx
  !  
  !           +-Bistsevent
  !        +-13  
  !        !  !  +Hinfleunza
  !        !  +--8  
  !     +-20     +Aactinomyc
  !     !  !  
  !     !  !     +-Asuccinoge
  !     !  !  +--9  
  !     !  +-15  +-Hsomnusxxx
  !     !     !  
  !     !     +Paerogenes
  !  +-23  
  !  !  !        +Pgallinaru
  !  !  !     +--6  
  !  !  !     !  +Bistthreth
  !  !  !  +-11  
  !  !  !  !  !  +-Asalpingit
  !  !  !  !  +--2  
-29-24  +-21     +-Ptestudini
  !  !     !  
  !  !     !  +Pmultocida
  !  !     +--5  
  !  !        +Hfelisxxxx
  !  !  
  !  !  +Hparasuisx
  !  +-17  
  !     !  +Acapsulatu
  !     +-14  
  !        +Hhaemoglob
  !  
  !           +-Adelphinic
  !        +-10  
  !        !  !  +Phocenobac
  !        !  +--1  
  !     +-19     +Hducreyixx
  !     !  !  
  !     !  !  +Aminorxxxx
  !     !  +-12  
  !     !     !  +Alignieres
  !  +-25     +--3  
  !  !  !        +Hparainflu
  !  !  !  
  !  !  !     +-Plnagaaxxx
  !  !  !  +--7  
  +-26  +-18  +Arossiixxx
     !     !  
     !     +Aporcinusx
     !  
     !  +Bistsevenx
     +-22  
        +Lonekoalax

remember: this is an unrooted tree!

Between        And            Length
-------        ---            ------
  29            28              0.00041
  28            16              0.00318
  16          Actinomuri        0.01928
  16             4              0.00272
   4          Bistfivexx        0.01366
   4          Mhaemolyti        0.01594
  28            27              0.00069
  27          Ptrehalosi        0.01605
  27          Pbettixxxx        0.01465
  29            24              0.00152
  24            23              0.00042
  23            20              0.00157
  20            13              0.00285
  13          Bistsevent        0.02478
  13             8              0.00197
   8          Hinfleunza        0.01359
   8          Aactinomyc        0.03031
  20            15              0.00198
  15             9              0.00251
   9          Asuccinoge        0.01980
   9          Hsomnusxxx        0.03060
  15          Paerogenes        0.01609
  23            21              0.00139
  21            11              0.00341
  11             6              0.00214
   6          Pgallinaru        0.01507
   6          Bistthreth        0.02603
  11             2              0.00443
   2          Asalpingit        0.03543
   2          Ptestudini        0.03827
  21             5              0.00387
   5          Pmultocida        0.01569
   5          Hfelisxxxx        0.01131
  24            17              0.00226
  17          Hparasuisx        0.01199
  17            14              0.00156
  14          Acapsulatu        0.01477
  14          Hhaemoglob        0.01133
  29            26              0.00090
  26            25              0.00089
  25            19              0.00219
  19            10              0.00288
  10          Adelphinic        0.02672
  10             1              0.00443
   1          Phocenobac        0.02806
   1          Hducreyixx        0.01204
  19            12              0.00132
  12          Aminorxxxx        0.01064
  12             3              0.00461
   3          Alignieres        0.01688
   3          Hparainflu        0.01562
  25            18              0.00207
  18             7              0.00301
   7          Plnagaaxxx        0.02251
   7          Arossiixxx        0.01099
  18          Aporcinusx        0.01284
  26            22              0.00172
  22          Bistsevenx        0.01834
  22          Lonekoalax        0.01606
Exercise 4: Parsimony
DNA parsimony algorithm, version 3.573c

One most parsimonious tree found:


                                                                                         +--Hparainflu
                                                                                      +-25  
           +-------------------------------------------------------------------------22  +--Alignieres
           !                                                                          !  
           !                                                                          +-----Aminorxxxx
           !  
           !                                                                          +-----Aporcinusx
           !        +-----------------------------------------------------------------8  
           !        !                                                                 !  +--Arossiixxx
           !        !                                                                 +--9  
           !        !                                                                    +--Plnagaaxxx
     +----10        !  
     !     !        !                                                                 +-----Pbettixxxx
     !     !        !                 +----------------------------------------------20  
     !     !        !                 !                                               !  +--Lonekoalax
     !     !        !                 !                                               +-26  
     !     !        !                 !                                                  +--Bistsevenx
     !     !        !                 !  
     !     !        !        +-------16                                            +--------Hparasuisx
     !     !        !        !        !        +----------------------------------11  
     !     +--------4        !        !        !                                   !  +-----Paerogenes
     !              !        !        !        !                                   +--7  
     !              !        !        !        !                                      !  +--Hsomnusxxx
     !              !        !        +--------6                                      +-27  
     !              !        !                 !                                         +--Asuccinoge
     !              !        !                 !  
     !              !        !                 !           +--------------------------------Hhaemoglob
     !              !        !                 !           !  
     !              !        !                 +----------17  +-----------------------------Acapsulatu
     !              !        !                             !  !  
     !              !        !                             !  !  +--------------------------Ptrehalosi
     !              !        !                             +-13  !  
     !              !        !                                !  !           +--------------Pmultocida
     !              !        !                                !  !           !  
  +--2              +--------3                                !  !  +-------12  +-----------Hfelisxxxx
  !  !                       !                                +-15  !        !  !  
  !  !                       !                                   !  !        +-14        +--Ptestudini
  !  !                       !                                   !  !           !  +----30  
  !  !                       !                                   !  !           !  !     +--Asalpingit
  !  !                       !                                   !  !           +-28  
  !  !                       !                                   +--5              !     +--Bistthreth
  !  !                       !                                      !              +----29  
  !  !                       !                                      !                    +--Pgallinaru
  !  !                       !                                      !  
  !  !                       !                                      !                    +--Aactinomyc
  !  !                       !                                      !                 +-24  
  !  !                       !                                      +----------------23  +--Hinfleunza
--1  !                       !                                                        !  
  !  !                       !                                                        +-----Bistsevent
  !  !                       !  
  !  !                       !                                                        +-----Bistfivexx
  !  !                       +-------------------------------------------------------18  
  !  !                                                                                !  +--Mhaemolyti
  !  !                                                                                +-21  
  !  !                                                                                   +--Actinomuri
  !  !  
  !  !                                                                                   +--Hducreyixx
  !  +----------------------------------------------------------------------------------19  
  !                                                                                      +--Phocenobac
  !  
  +-----------------------------------------------------------------------------------------Adelphinic

  remember: this is an unrooted tree!

requires a total of    827.000
Exercise 4: Maximum likelihood
fastDNAml, version 1.2.1, March 9, 1998

Based on Joseph Felsenstein's

   Nucleic acid sequence Maximum Likelihood method, version 3.3

31 Species, 1137 Sites

...

Total weight of positions in analysis = 1137
There are 186 distinct data patterns (columns)

Empirical Base Frequencies:

   A       0.23894
   C       0.21466
   G       0.32715
  T(U)     0.21925

Transition/transversion ratio =   2.000000

(Transition/transversion parameter =   1.484598)

...

Examined 1887 trees

             + Hducreyixx
   +--------18  
   !         +--------- Phocenobac
   !  
   !                      +--------- Hparainflu
   !            +--------24  
   !  +--------21         + Alignieres
   !  !         !  
   !  !         + Aminorxxxx
   !  !  
   !  !               +--------- Lonekoalax
---1  !            +-25  
   !  !            !  +--------- Bistsevenx
   !  !            !  
   !  !            !                     +--------- Hfelisxxxx
   !  !            !                  +-13  
   !  !            !                  !  +--------- Pmultocida
   !  !            !        +--------11  
   !  !            !        !         !            +-------------------- Aactinomyc
   !  !            !        !         !  +--------23  
   !  !            !        !         +-22         + Hinfleunza
   !  !            !     +-14            !  
   +--9         +-15     !  !            +-------------------- Bistsevent
   !  !         !  !     !  !  
   !  !         !  !     !  !         + Ptrehalosi
   !  !         !  !  +--2  +--------12  
   !  !         !  !  !  !            + Acapsulatu
   !  !         !  !  !  !  
   !  !         !  !  !  !  +--------- Hhaemoglob
   !  !         !  !  !  !  !  
   !  !         !  !  !  +-16                   +--------- Bistthreth
   !  !         !  !  !     !         +--------28  
   !  !         !  !  !     !         !         !  +------------------------------ Ptestudini
   !  !         !  !  !     +--------27         +-29  
   !  !         !  +--7               !            +-------------------- Asalpingit
   !  !         !     !               !  
   !  !         !     !               +--------- Pgallinaru
   !  !         !     !  
   !  !         !     !        +--------- Pbettixxxx
   !  +---------3     !     +-19  
   !            !     !     !  !         +--------- Mhaemolyti
   !            !     !  +-17  +--------20  
   !            !     !  !  !            +--------- Bistfivexx
   !            !     !  !  !  
   !            !     +--5  +-------------------- Actinomuri
   !            !        !  
   !            !        !         + Hparasuisx
   !            !        !         !  
   !            !        +---------6            +-------------------- Hsomnusxxx
   !            !                  !  +--------26  
   !            !                  +-10         +--------- Asuccinoge
   !            !                     !  
   !            !                     +--------- Paerogenes
   !            !  
   !            !     +--------- Arossiixxx
   !            !  +--8  
   !            +--4  +-------------------- Plnagaaxxx
   !               !  
   !               +--------- Aporcinusx
   !  
   +-------------------- Adelphinic

Remember: this is an unrooted tree!

Ln Likelihood =   -6644.56672

 Between        And             Length      Approx. Confidence Limits
 -------        ---             ------      ------- ---------- ------
   1            18              0.00837     (  0.00272,     0.01407) **
  18          Hducreyixx        0.01213     (  0.00533,     0.01900) **
  18          Phocenobac        0.02807     (  0.01797,     0.03834) **
   1             9              0.00684     (  0.00150,     0.01222) **
   9            21              0.00693     (  0.00179,     0.01212) **
  21            24              0.01251     (  0.00563,     0.01947) **
  24          Hparainflu        0.02188     (  0.01292,     0.03097) **
  24          Alignieres        0.01167     (  0.00497,     0.01843) **
  21          Aminorxxxx        0.00367     (     zero,     0.00769) **
   9             3              0.00386     (  0.00007,     0.00767) **
   3            15              0.00266     (     zero,     0.00623) **
  15            25              0.00636     (  0.00123,     0.01153) **
  25          Lonekoalax        0.01693     (  0.00897,     0.02499) **
  25          Bistsevenx        0.01945     (  0.01093,     0.02809) **
  15             7              0.00290     (     zero,     0.00640) **
   7             2              0.00404     (     zero,     0.00816) **
   2            14              0.00388     (  0.00012,     0.00765) **
  14            11              0.00602     (  0.00053,     0.01155) **
  11            13              0.00568     (  0.00109,     0.01031) **
  13          Hfelisxxxx        0.00946     (  0.00337,     0.01562) **
  13          Pmultocida        0.01799     (  0.00992,     0.02618) **
  11            22              0.00768     (  0.00185,     0.01357) **
  22            23              0.00741     (  0.00174,     0.01312) **
  23          Aactinomyc        0.03562     (  0.02422,     0.04724) **
  23          Hinfleunza        0.00998     (  0.00354,     0.01649) **
  22          Bistsevent        0.02472     (  0.01521,     0.03440) **
  14            12              0.00211     (     zero,     0.00562) *
  12          Ptrehalosi        0.01503     (  0.00773,     0.02242) **
  12          Acapsulatu        0.01437     (  0.00727,     0.02155) **
   2            16              0.00362     (     zero,     0.00778) **
  16          Hhaemoglob        0.01116     (  0.00467,     0.01773) **
  16            27              0.01046     (  0.00415,     0.01683) **
  27            28              0.00786     (  0.00190,     0.01388) **
  28          Bistthreth        0.02454     (  0.01477,     0.03448) **
  28            29              0.01170     (  0.00472,     0.01875) **
  29          Ptestudini        0.04200     (  0.02943,     0.05485) **
  29          Asalpingit        0.03249     (  0.02130,     0.04389) **
  27          Pgallinaru        0.01283     (  0.00565,     0.02010) **
   7             5              0.00289     (     zero,     0.00630) **
   5            17              0.00249     (     zero,     0.00594) **
  17            19              0.00323     (     zero,     0.00689) **
  19          Pbettixxxx        0.01382     (  0.00665,     0.02107) **
  19            20              0.00930     (  0.00335,     0.01532) **
  20          Mhaemolyti        0.01693     (  0.00912,     0.02485) **
  20          Bistfivexx        0.01281     (  0.00595,     0.01974) **
  17          Actinomuri        0.02377     (  0.01457,     0.03311) **
   5             6              0.00782     (  0.00242,     0.01328) **
   6          Hparasuisx        0.00721     (  0.00182,     0.01265) **
   6            10              0.00830     (  0.00222,     0.01444) **
  10            26              0.00706     (  0.00159,     0.01259) **
  26          Hsomnusxxx        0.03306     (  0.02193,     0.04440) **
  26          Asuccinoge        0.01859     (  0.01002,     0.02728) **
  10          Paerogenes        0.01708     (  0.00890,     0.02537) **
   3             4              0.00517     (  0.00069,     0.00967) **
   4             8              0.00667     (  0.00147,     0.01192) **
   8          Arossiixxx        0.00578     (  0.00103,     0.01058) **
   8          Plnagaaxxx        0.02878     (  0.01874,     0.03899) **
   4          Aporcinusx        0.01098     (  0.00468,     0.01734) **
   1          Adelphinic        0.02925     (  0.01893,     0.03976) **

     *  = significantly positive, P < 0.05
     ** = significantly positive, P < 0.01

Ib Skovgaard (ims@kvl.dk) 2002-06-17