Publication Date:
2008-06-21
Description:
Genetic sequence alignment is the basis of many evolutionary and comparative studies, and errors in alignments lead to errors in the interpretation of evolutionary information in genomes. Traditional multiple sequence alignment methods disregard the phylogenetic implications of gap patterns that they create and infer systematically biased alignments with excess deletions and substitutions, too few insertions, and implausible insertion-deletion-event histories. We present a method that prevents these systematic errors by recognizing insertions and deletions as distinct evolutionary events. We show theoretically and practically that this improves the quality of sequence alignments and downstream analyses over a wide range of realistic alignment problems. These results suggest that insertions and sequence turnover are more common than is currently thought and challenge the conventional picture of sequence evolution and mechanisms of functional and structural changes.〈br /〉〈span class="detail_caption"〉Notes: 〈/span〉Loytynoja, Ari -- Goldman, Nick -- GR078968/Wellcome Trust/United Kingdom -- New York, N.Y. -- Science. 2008 Jun 20;320(5883):1632-5. doi: 10.1126/science.1158395.〈br /〉〈span class="detail_caption"〉Author address: 〈/span〉European Molecular Biology Laboratory-European Bioinformatics Institute, Wellcome Trust Genome Campus, Hinxton CB10 1SD, UK. ari@ebi.ac.uk〈br /〉〈span class="detail_caption"〉Record origin:〈/span〉 〈a href="http://www.ncbi.nlm.nih.gov/pubmed/18566285" target="_blank"〉PubMed〈/a〉
Keywords:
*Algorithms
;
Computer Simulation
;
*Evolution, Molecular
;
HIV/chemistry/classification/genetics
;
HIV Envelope Protein gp120/*chemistry/genetics
;
HIV-1/chemistry/classification/genetics
;
Membrane Glycoproteins/*chemistry/genetics
;
Mutagenesis, Insertional
;
*Phylogeny
;
Sequence Alignment/*methods
;
Sequence Deletion
;
Simian Immunodeficiency Virus/chemistry/classification/genetics
;
Viral Envelope Proteins/*chemistry/genetics
Print ISSN:
0036-8075
Electronic ISSN:
1095-9203
Topics:
Biology
,
Chemistry and Pharmacology
,
Computer Science
,
Medicine
,
Natural Sciences in General
,
Physics