Australian Systematic Botany Australian Systematic Botany Society
Taxonomy, biogeography and evolution of plants
L. A. S. JOHNSON REVIEW

Multiple sequence alignment for phylogenetic purposes

David A. Morrison
+ Author Affiliations
- Author Affiliations

Department of Parasitology (SWEPAR), National Veterinary Institute and Swedish University of Agricultural Sciences, 751 89 Uppsala, Sweden. Email: David.Morrison@bvf.slu.se

Australian Systematic Botany 19(6) 479-539 https://doi.org/10.1071/SB06020
Submitted: 3 July 2006  Accepted: 30 October 2006   Published: 14 December 2006

Abstract

I have addressed the biological rather than bioinformatics aspects of molecular sequence alignment by covering a series of topics that have been under-valued, particularly within the context of phylogenetic analysis. First, phylogenetic analysis is only one of the many objectives of sequence alignment, and the most appropriate multiple alignment may not be the same for all of these purposes. Phylogenetic alignment thus occupies a specific place within a broader context. Second, homology assessment plays an intricate role in phylogenetic analysis, with sequence alignment consisting of primary homology assessment and tree building being secondary homology assessment. The objective of phylogenetic alignment thus distinguishes it from other sorts of alignment. Third, I summarise what is known about the serious limitations of using phenetic similarity as a criterion for automated multiple alignment, and provide an overview of what is currently being done to improve these computerised procedures. This synthesises information that is apparently not widely known among phylogeneticists. Fourth, I then consider the recent development of automated procedures for combining alignment and tree building, thus integrating primary and secondary homology assessment. Finally, I outline various strategies for increasing the biological content of sequence alignment procedures, which consists of taking into account known evolutionary processes when making alignment decisions. These procedures can be objective and repeatable, and can involve computerised algorithms to automate much of the work. Perhaps the most important suggestion is that alignment should be seen as a process where new sequences are added to a pre-existing alignment that has been manually curated by the biologist.


References


Aagesen L, Petersen G, Seberg O (2005) Sequence length variation, indel costs, and congruence in sensitivity analysis. Cladistics 21, 15–30.

Aboitiz F (1987) Letter to the editor. Cell 51, 515–516.
CrossRef | PubMed |

Achaz G, Boyer F, Rocha EPC, Viari AC (2006) Repseek, a tool to retrieve approximate repeats from large DNA sequences. Bioinformatics in press ,
CrossRef | PubMed |

Al-Lazikani B, Sheinerman FB, Honig B (2001) Combining multiple structure and sequence alignments to improve sequence detection and alignment: application to the SH2 domains of janus kinases. Proceedings of the National Academy of Sciences USA 98, 14 796–14 801.
CrossRef |

Allison L, Wallace CS (1994) The posterior probability distribution of alignments and its application to parameter estimation of evolutionary trees and to optimization of multiple alignments. Journal of Molecular Evolution 39, 418–430.
CrossRef | PubMed |

Allison L , Wallace CS , Yee CN (1992) Minimum message length encoding, evolutionary trees and multiple alignment. In ‘Proceedings of the Hawaii international conference on system sciences (HICSS-25).’ pp. 663–674. (IEEE Press: Piscataway)

Althaus E, Caprara A, Lenhof H-P, Reinert K (2002) Multiple sequence alignment with arbitrary gap costs: computing an optimal solution using polyhedral combinatorics. Bioinformatics 18, S4–S16.
PubMed |


Anbarasu LA, Narayanasamy P, Sundararajan V (2000) Multiple molecular sequence alignment by island parallel genetic algorithm. Current Science 78, 858–863.

Andersen ES, Rosenblad MA, Larsen N, Westergaard JC, Burks J, Wower IK, Wower J, Gorodkin J, Samuelsson T, Zwieb C (2006) The tmRDB and SRPDB resources. Nucleic Acids Research 34, D163–D168.
CrossRef | PubMed |

Anwar T, Khan AU (2006) SSRscanner: a program for reporting distribution and location of simple sequence repeats. Bioinformation 1, 89–91.

Apostolico A, Giancarlo R (1998) Sequence alignment in molecular biology. Journal of Computational Biology 5, 173–196.
PubMed |


Armougom F, Moretti S, Poirot O, Audic S, Dumas P, Schaeli B, Keduas V, Notredame C (2006) Expresso: automatic incorporation of structural information in multiple sequence alignments using 3D-Coffee. Nucleic Acids Research 34, W604–W608.
PubMed |


Arvestad L (1997) Aligning coding DNA in the presence of frame-shift errors. Lecture Notes in Computer Science 1264, 180–190.

Badger JH, Eisen JA, Ward NL (2005) Genomic analysis of Hyphomonas neptunium contradicts 16S rRNA gene-based phylogenetic analysis: implications for the taxonomy of the orders ‘Rhodobacterales’ and Caulobacterales. International Journal of Systematic and Evolutionary Microbiology 55, 1021–1026.
CrossRef | PubMed |

Bafna V, Tang H, Zhang S (2006) Consensus folding of unaligned RNA sequences revisited. Journal of Computational Biology 13, 283–295.
CrossRef | PubMed |

Bahr A, Thompson JD, Thierry J-C, Poch O (2001) BAliBASE (Benchmark Alignment dataBASE): enhancements for repeats, transmembrane sequences and circular permutations. Nucleic Acids Research 29, 323–326.
CrossRef | PubMed |

Barta JR (1997) Investigating phylogenetic relationships within the Apicomplexa using sequence data: the search for homology. Methods 13, 81–88.
CrossRef | PubMed |

Barton GJ, Sternberg MJE (1987) A strategy for the rapid multiple alignment of protein sequences: confidence levels from tertiary structure comparisons. Journal of Molecular Biology 198, 327–337.
CrossRef | PubMed |

Batzoglou S (2005) The many faces of sequence alignment. Briefings in Bioinformatics 6, 6–22.
CrossRef | PubMed |

Bauer M, Klau GW, Reinert K (2005a) Fast and accurate structural RNA alignment by progressive lagrangian optimization. Lecture Notes in Computer Science 3695, 217–228.

Bauer M, Klau GW, Reinert K (2005b) Multiple structural RNA alignment with lagrangian relaxation. Lecture Notes in Computer Science 3692, 303–314.

Baumel A, Ainouche ML, Bayer RJ, Ainouche AK, Misset MT (2002) Molecular phylogeny of hybridizing species from the genus Spartina Schreb. (Poaceae). Molecular Phylogenetics and Evolution 22, 303–314.
CrossRef | PubMed |

Beebe NW, Cooper RD, Morrison DA, Ellis JT (2000) Subset partitioning of the ribosomal DNA small subunit and its effects on the phylogeny of the Anopheles punctulatus group. Insect Molecular Biology 9, 515–520.
CrossRef | PubMed |

Bell LH, Coggins JR, Milner-White EJ (1993) Mix’n’Match: an improved multiple sequence alignment procedure for distantly related proteins using secondary structure predictions, designed to be independent of the choice of gap penalty and scoring matrix. Protein Engineering 6, 683–690.
PubMed |


Belshaw R, Quicke DLJ (2002) Robustness of ancestral state estimates: evolution of life history strategy in ichneumonoid parasitoids. Systematic Biology 51, 450–477.
CrossRef | PubMed |

Benner SA, Cohen MA, Gonnet GH (1993) Empirical and structural models for insertions and deletions in the divergent evolution of proteins. Journal of Molecular Biology 229, 1065–1082.
CrossRef | PubMed |

Benson G (1997) Sequence alignment with tandem duplication. Journal of Computational Biology 4, 351–367.
PubMed |


Benson G (1999) Tandem Repeats Finder: a program to analyze DNA sequences. Nucleic Acids Research 27, 573–580.
CrossRef | PubMed |

Bininda-Emonds ORP (2005) TransAlign: using amino acids to facilitate the multiple alignment of protein-coding DNA sequences. BMC Bioinformatics 6, 156.
CrossRef | PubMed |

Bishop MJ, Thompson EA (1986) Maximum likelihood alignment of DNA sequences. Journal of Molecular Biology 190, 159–165.
CrossRef | PubMed |

Blackshields G, Wallace IM, Larkin M, Higgins DG (2006) Analysis and comparison of benchmarks for multiple sequence alignment. In Silico Biology 6, 0030.
PubMed |


Blaisdell BE (1986) A measure of the similarity of sets of sequences not requiring sequence alignment. Proceedings of the National Academy of Sciences USA 83, 5155–5159.
CrossRef |

Bledsoe AH, Sheldon FH (1990) Molecular homology and DNA hybridization. Journal of Molecular Evolution 30, 425–433.
CrossRef | PubMed |

Boeva V, Regnier M, Papatsenko D, Makeev V (2006) Short fuzzy tandem repeats in genomic sequences, identification, and possible role in regulation of gene expression. Bioinformatics 22, 676–684.
CrossRef | PubMed |

Bonizzoni P, Della Vedova G (2001) The complexity of multiple sequence alignment with SP-score that is a metric. Theoretical Computer Science 259, 63–79.
CrossRef |

Brawley SH (1999) Submission and retrieval of an aligned set of nucleic acid sequences. Journal of Phycology 35, 433–437.
CrossRef |

Brenner SE, Chothia C, Hubbard TJ (1998) Assessing sequence comparison methods with reliable structurally-identified distant evolutionary relationships. Proceedings of the National Academy of Sciences USA 95, 6073–6078.
CrossRef |

Briffeuil P, Baudoux G, Lambert C, De Bolle X, Vinals C, Feytmans E, Depiereux E (1998) Comparative analysis of seven multiple protein sequence alignment servers: clues to enhance reliability of predictions. Bioinformatics 14, 357–366.
CrossRef | PubMed |

Britten RJ, Rowen L, Williams J, Cameron RA (2003) Majority of divergence between closely related DNA samples is due to indels. Proceedings of the National Academy of Sciences USA 100, 4661–4665.
CrossRef |

Brower AVZ, Schawaroch V (1996) Three steps of homology assessment. Cladistics 12, 265–272.

Brown JW (1999) The ribonuclease P database. Nucleic Acids Research 27, 314.
CrossRef | PubMed |

Bucka-Lassen K, Caprani O, Hein J (1999) Combining many multiple alignments in one improved alignment. Bioinformatics 15, 122–130.
CrossRef | PubMed |

Butler AB, Saidel WM (2000) Defining sameness: historical, biological, and generative homology. BioEssays 22, 846–853.
CrossRef | PubMed |

Campagna D, Romualdi C, Vitulo N, Del Favero M, Lexa M, Cannata N, Valle G (2005) RAP: a new computer program for de novo identification of repeated sequences in whole genomes. Bioinformatics 21, 582–588.
CrossRef | PubMed |

Cannone JJ, Subramanian S, Schnare MN, Collett JR, D’Souza LM , et al. (2002) The Comparative RNA Web (CRW) Site: an online database of comparative sequence and structure information for ribosomal, intron, and other RNAs. BMC Bioinformatics 3, 2.
CrossRef | PubMed |

Carfi A, Pares S, Duée E, Galleni M, Duez C, Frère JM, Dideberg O (1995) The 3-D structure of a zinc metallo-β-lactamase from Bacillus cereus reveals a new type of protein fold. EMBO Journal 14, 4914–4921.
PubMed |


Cartmill M (1994) A critique of homology as a morphological concept. American Journal of Physical Anthropology 94, 115–123.
CrossRef | PubMed |

Cartwright RA (2005) DNA assembly with gaps (DAWG): simulating sequence evolution. Bioinformatics 21, iii31–iii38.
CrossRef | PubMed |

Castelo AT, Martins W, Gao GR (2002) TROLL—tandem repeat occurrence locator. Bioinformatics 18, 634–636.
CrossRef | PubMed |

Catherinot V, Labesse G (2004) ViTO: tool for refinement of protein sequence–structure alignments. Bioinformatics 20, 3694–3696.
CrossRef | PubMed |

Cerchio S, Tucker P (1998) Influence of alignment on the mtDNA phylogeny of Cetacea: questionable support for a Mysticeti / Physeteroidea clade. Systematic Biology 47, 336–344.
CrossRef | PubMed |

Chain P, Kurtz S, Ohlebusch E, Slezak T (2003) An applications-focused review of comparative genomics tools: capabilities, limitations, and future challenges. Briefings in Bioinformatics 4, 105–123.
CrossRef | PubMed |

Chakrabarti S, Bhardwaj N, Anand PA, Sowdhamini R (2004) Improvement of alignment accuracy utilizing sequentially conserved motifs. BMC Bioinformatics 5, 167.
CrossRef | PubMed |

Chakrabarti S, Lanczycki CJ, Panchenko AR, Przytycka TM, Thiessen PA, Bryant SH (2006) Refining multiple sequence alignments with conserved core regions. Nucleic Acids Research 34, 2598–2606.
CrossRef | PubMed |

Chan SC, Wong AKC, Chiu DKY (1992) A survey of multiple sequence comparison methods. Bulletin of Mathematical Biology 54, 563–598.
PubMed |


Chang MSS, Benner SA (2004) Empirical analysis of protein insertions and deletions determining parameters for the correct placement of gaps in protein sequence alignments. Journal of Molecular Biology 341, 617–631.
CrossRef | PubMed |

Chenna R, Sugawara H, Koike T, Lopez R, Gibson TJ, Higgins DG, Thompson JD (2003) Multiple sequence alignment with the Clustal series of programs. Nucleic Acids Research 31, 3497–3500.
CrossRef | PubMed |

Chiaromonte F , Yap VB , Miller W (2002) Scoring pairwise genomic sequence alignments. In ‘Proceedings of the 7th Pacific Symposium on Biocomputing 2002, Lihue, Hawaii’. pp. 115–126.

Chindelevitch L, Li Z, Blais E, Blanchette M (2006) On the inference of parsimonious indel evolutionary scenarios. Journal of Bioinformatics and Computational Biology 4, 721–744.
CrossRef | PubMed |

Clamp M, Cuff J, Searle SM, Barton GJ (2004) The Jalview java alignment editor. Bioinformatics 20, 426–427.
CrossRef | PubMed |

Cognato AI, Vogler AP (2001) Exploring data interaction and nucleotide alignment in a multiple gene analysis of Ips (Coleoptera: Scolytinae). Systematic Biology 50, 758–780.
CrossRef | PubMed |

Cole JR, Chai B, Farris RJ, Wang Q, Kulam SA, McGarrell DM, Garrity GM, Tiedje JM (2005) The Ribosomal Database Project (RDP-II): sequences and tools for high-throughput rRNA analysis. Nucleic Acids Research 33, D294–D296.
CrossRef | PubMed |

Cooper A, Lalueza-Fox C, Anderson S, Rambaut A, Austin J, Ward R (2001) Complete mitochondrial genome sequences of two extinct moas clarify ratite evolution. Nature 409, 704–707.
CrossRef | PubMed |

Corpet F (1988) Multiple sequence alignment with hierarchical clustering. Nucleic Acids Research 16, 10 881–10 890.

Corpet F, Michot B (1994) RNAlign program: alignment of RNA sequences using both primary and secondary structures. Computer Applications in the Biosciences 10, 389–399.
PubMed |


Cozzetto D, Tramontano A (2005) Relationship between multiple sequence alignments and quality of protein comparative models. Proteins: Structure, Function, and Bioinformatics 58, 151–157.
CrossRef |

Croan DG, Morrison DA, Ellis JT (1997) Evolution of the genus Leishmania revealed by comparison of DNA and RNA polymerase gene sequences. Molecular and Biochemical Parasitology 89, 149–159.
CrossRef | PubMed |

Dalli D, Wilm A, Mainz I, Steger G (2006) STRAL: progressive alignment of non-coding RNA using base pairing probability vectors in quadratic time. Bioinformatics 22, 1593–1599.
CrossRef | PubMed |

Darling ACE, Mau B, Blattner FR, Perna NT (2004) Mauve: multiple alignment of conserved genomic sequence with rearrangements. Genome Research 14, 1394–1403.
CrossRef | PubMed |

De Laet JE (2005) Parsimony and the problem of inapplicables in sequence data. In ‘Parsimony, phylogeny, and genomics.’ (Ed. VA Albert) pp. 81–116. (Oxford University Press: Oxford)

Deléage G, Clerc FF, Roux B, Gautheron DC (1988) ANTHEPROT: a package for protein sequence analysis using a microcomputer. Computer Applications in the Biosciences 4, 351–356.
PubMed |


De Rijk P, De Wachter R (1993) DCSE, an interactive tool for sequence alignment and secondary structure research. Bioinformatics 9, 735–740.

DeSantis TZ, Hugenholtz P, Larsen N, Rojas M, Brodie EL, Keller K, Huber T, Dalevi D, Hu P, Andersen GL (2006a) Greengenes, a chimera-checked 16S rRNA gene database and workbench compatible with ARB. Applied and Environmental Microbiology 72, 5069–5072.
CrossRef | PubMed |

DeSantis TZ, Hugenholtz P, Keller K, Brodie EL, Larsen N, Piceno YM, Phan R, Andersen GL (2006b) NAST: a multiple sequence alignment server for comparative analysis of 16S rRNA genes. Nucleic Acids Research 34, W394–W399.
CrossRef | PubMed |

Dewey CN, Pachter L (2006) Evolution at the nucleotide level: the problem of multiple whole-genome alignment. Human Molecular Genetics 15, R51–R56.
CrossRef | PubMed |

Do CB, Mahabhashyam MSP, Brudno M, Batzoglou S (2005) ProbCons: probabilistic consistency-based multiple sequence alignment. Genome Research 15, 330–340.
CrossRef | PubMed |

Domingues FS, Lackner P, Andreeva A, Sippl MJ (2000) Structure-based evaluation of sequence comparison and fold recognition alignment accuracy. Journal of Molecular Biology 297, 1003–1013.
CrossRef | PubMed |

Donoghue MJ , Sanderson MJ (1994) Complexity and homology in plants. In ‘Homology: the hierarchical basis of comparative biology’. (Ed. BK Hall) pp. 393–421. (Academic Press: San Diego)

Doolittle RF (1981) Similar amino acid sequences: chance or common ancestry? Science 214, 149–159.
CrossRef | PubMed |

Duret L , Abdeddaïm S (2000) Multiple alignments for structural, functional, or phylogenetic analyses of homologous sequences. In ‘Bioinformatics: sequence, structure, and databanks.’ (Ed. D Higgins, W Taylor) pp. 51–76. (Oxford University Press: Oxford)

Ebedes J, Datta A (2004) Multiple sequence alignment in parallel on a workstation cluster. Bioinformatics 20, 1193–1195.
CrossRef | PubMed |

Eddy SR (1998) Profile hidden markov models. Bioinformatics 14, 755–763.
CrossRef | PubMed |

Eddy SR (2002a) A memory efficient dynamic programming algorithm for optimal structural alignment of a sequence to an RNA secondary structure. BMC Bioinformatics 3, 18.
CrossRef | PubMed |

Eddy SR (2002b) Computational genomics of noncoding RNA genes. Cell 109, 137–140.
CrossRef | PubMed |

Edgar RC (2004a) Local homology recognition and distance measures in linear time using compressed amino acid alphabets. Nucleic Acids Research 32, 380–385.
CrossRef | PubMed |

Edgar RC (2004b) MUSCLE: multiple sequence alignment with high accuracy and high throughput. Nucleic Acids Research 32, 1792–1797.
CrossRef | PubMed |

Edgar RC (2004c) MUSCLE: a multiple sequence alignment method with reduced time and space complexity. BMC Bioinformatics 5, 113.
CrossRef | PubMed |

Edgar RC, Sjölander K (2004) A comparison of scoring functions for protein sequence profile alignment. Bioinformatics 20, 1301–1308.
CrossRef | PubMed |

Edgar RC, Batzoglou S (2006) Multiple sequence alignment. Current Opinion in Structural Biology 16, 368–373.
CrossRef | PubMed |

Elias I (2003) Settling the intractability of multiple alignment. Lecture Notes in Computer Science 2906, 352–363.

Ellis J, Morrison D (1995) Effects of sequence alignment on the phylogeny of Sarcocystis deduced from 18S rDNA sequences. Parasitology Research 81, 696–699.
CrossRef | PubMed |

Errami M, Geourjon C, Deléage G (2003) Conservation of amino acids into multiple alignments involved in pairwise interactions in three-dimensional protein structures. Journal of Bioinformatics and Computational Biology 1, 505–520.
CrossRef | PubMed |

Feng D-F, Doolittle RF (1987) Progressive sequence alignment as a prerequisite to correct phylogenetic trees. Journal of Molecular Evolution 25, 351–360.
PubMed |


Finn RD, Mistry J, Schuster-Böckler B, Griffiths-Jones S, Hollich V , et al. (2006) Pfam: clans, web tools and services. Nucleic Acids Research 34, D247–D251.
CrossRef | PubMed |

Fitch WM (2000) Homology: a personal view on some of the problems. Trends in Genetics 16, 227–231.
CrossRef | PubMed |

Fitch WM, Smith TF (1983) Optimal sequence alignments. Proceedings of the National Academy of Sciences USA 80, 1382–1386.
CrossRef |

Fleißner R (2004) ‘Sequence alignment and phylogenetic inference.’ (Logos Verlag: Berlin)

Fleissner R, Metzler D, von Haeseler A (2005) Simultaneous statistical multiple alignment and phylogeny reconstruction. Systematic Biology 54, 548–561.
CrossRef | PubMed |

Frith MC, Hansen U, Spouge JL, Weng Z (2004) Finding functional sequence elements by multiple local alignment. Nucleic Acids Research 32, 189–200.
CrossRef | PubMed |

Gagnon S, Bourbeau D, Levesque RC (1996) Secondary structures and features of the 18S, 5.8S and 26S ribosomal RNAs from the Apicomplexan parasite Toxoplasma gondii. Gene 173, 129–135.
CrossRef | PubMed |

Gardner PP, Giegerich R (2004) A comprehensive comparison of comparative RNA structure prediction approaches. BMC Bioinformatics 5, 140.
CrossRef | PubMed |

Gardner PP, Wilm A, Washietl S (2005) A benchmark of multiple sequence alignment programs upon structural RNAs. Nucleic Acids Research 33, 2433–2439.
CrossRef | PubMed |

Geiger DL (2002) Stretch coding and block coding: two new strategies to represent questionably aligned DNA sequences. Journal of Molecular Evolution 54, 191–199.
CrossRef | PubMed |

Gille C, Frömmel C (2001) STRAP: editor for structural alignments of proteins. Bioinformatics 17, 377–378.
CrossRef | PubMed |

Gillespie JJ (2004) Characterizing regions of ambiguous alignment caused by the expansion and contraction of hairpin-stem loops in ribosomal RNA molecules. Molecular Phylogenetics and Evolution 33, 936–943.
CrossRef | PubMed |

Gillespie JJ, Yoder MJ, Wharton RA (2005a) Predicted secondary structure for 28S and 18S rRNA from Ichneumonoidea (Insecta: Hymenoptera: Apocrita): impact on sequence alignment and phylogeny estimation. Journal of Molecular Evolution 61, 114–137.
CrossRef | PubMed |

Gillespie JJ, McKenna CH, Yoder MJ, Gutell RR, Johnston JS, Kathirithamby J, Cognato AI (2005b) Assessing the odd secondary structural properties of nuclear small subunit ribosomal RNA sequences (18S) of the twisted-wing parasites (Insecta: Strepsiptera). Insect Molecular Biology 14, 625–643.
CrossRef | PubMed |

Giribet G (2001) Exploring the behavior of POY, a program for direct optimization of molecular data. Cladistics 17, S60–S70.
CrossRef | PubMed |

Giribet G (2002) Relationship among metazoan phyla as inferred from 18S rRNA sequence data: a methodological approach. In ‘Molecular systematics and evolution: theory and practice’. (Eds R DeSalle, G Giribet, W Wheeler) pp. 85–101. (Birkhäuser Verlag: Basel)

Giribet G (2005) Generating implied alignments under direct optimization using POY. Cladistics 21, 396–402.
CrossRef |

Giribet G, Wheeler WC (1999) On gaps. Molecular Phylogenetics and Evolution 13, 132–143.
CrossRef | PubMed |

Giribet G , Wheeler WC , Muona J (2002) DNA multiple sequence alignments. In ‘Molecular systematics and evolution: theory and practice’. (Eds R DeSalle, G Giribet, W Wheeler) pp. 107–114. (Birkhäuser Verlag: Basel)

Gonnet GH, Korostensky C, Benner S (2000) Evaluation measures of multiple sequence alignments. Journal of Computational Biology 7, 261–276.
CrossRef | PubMed |

Gotoh O (1982) An improved algorithm for matching biological sequences. Journal of Molecular Biology 162, 705–708.
CrossRef | PubMed |

Gotoh O (1990) Consistency of optimal sequence alignments. Bulletin of Mathematical Biology 52, 509–525.
PubMed |


Gotoh O (1995) A weighting scheme and algorithm for aligning many phylogenetically related sequences. Computer Applications in the Biosciences 11, 543–551.
PubMed |


Gotoh O (1996) Significant improvement in accuracy of multiple protein sequence alignments by iterative refinement as assessed by reference to structural alignments. Journal of Molecular Biology 264, 823–838.
CrossRef | PubMed |

Gotoh O (1999) Multiple sequence alignment: algorithms and applications. Advances in Biophysics 36, 159–206.
CrossRef | PubMed |

Gough J (2005) Convergent evolution of domain architectures is rare. Bioinformatics 21, 1464–1471.
CrossRef | PubMed |

Graham SW, Reeves PA, Burns ACE, Olmstead RG (2000) Microstructural changes in noncoding chloroplast DNA: interpretation, evolution, and utility of indels and inversions in basal angiosperm phylogenetic inference. International Journal of Plant Sciences 161, S83–S96.
CrossRef |

Grasso C, Lee C (2004) Combining partial order alignment and progressive multiple sequence alignment increases alignment speed and scalability to very large alignment problems. Bioinformatics 20, 1546–1556.
CrossRef | PubMed |

Greenberg HJ, Hart WE, Lancia G (2004) Opportunities for combinatorial optimization in computational biology. INFORMS Journal on Computing 16, 211–231.
CrossRef |

Griffiths-Jones S (2005) RALEE—RNA alignment editor in emacs. Bioinformatics 21, 257–259.
CrossRef | PubMed |

Griffiths-Jones S, Moxon S, Marshall M, Khanna A, Eddy SR, Bateman A (2005) Rfam: annotating non-coding RNAs in complete genomes. Nucleic Acids Research 33, D121–D124.
CrossRef | PubMed |

Gu X, Li W-H (1995) The size distribution of insertions and deletions in human and rodent pseudogenes suggests the logarithmic gap penalty for sequence alignment. Journal of Molecular Evolution 40, 464–473.
CrossRef | PubMed |

Gueneau de Novoa P, Williams KP (2004) The tmRNA website: reductive evolution of tmRNA in plastids and other endosymbionts. Nucleic Acids Research 32, D104–D108.
CrossRef | PubMed |

Gupta SK, Kececioglu JD, Schäffer AA (1995) Improving the practical space and time efficiency of the shortest-paths approach to sum-of-pairs multiple sequence alignment. Journal of Computational Biology 2, 459–472.
PubMed |


Gutell RR, Lee JC, Cannone JJ (2002) The accuracy of ribosomal RNA comparative structure models. Current Opinion in Structural Biology 12, 301–310.
CrossRef | PubMed |

Hall TA (1999) BioEdit: a user-friendly biological sequence alignment editor and analysis program for Windows 95/98/NT. Nucleic Acids Symposium Series 41, 95–98.

Hancock JM, Vogler AP (2000) How slippage-derived sequences are incorporated into rRNA variable-region secondary structure: implications for phylogeny reconstruction. Molecular Phylogenetics and Evolution 14, 366–374.
CrossRef | PubMed |

Haszprunar G (1998) Parsimony analysis as a specific kind of homology estimation and the implications for character weighting. Molecular Phylogenetics and Evolution 9, 333–339.
CrossRef | PubMed |

Heger A, Holm L (2000) Rapid automatic detection and alignment of repeats in protein sequences. Proteins: Structure, Function, and Genetics 41, 224–237.
CrossRef |

Hein J (1990) Unified approach to alignment and phylogenies. Methods in Enzymology 183, 626–645.
PubMed |


Hein J (1994) An algorithm combining DNA and protein alignment. Journal of Theoretical Biology 167, 169–174.
CrossRef | PubMed |

Hein J, Støvlbæk J (1996) Combined DNA and protein alignment. Methods in Enzymology 266, 402–418.
PubMed |


Helm M, Brulé H, Friede D, Giegé R, Pütz J, Florentz C (2000) Search for characteristic structural features of mammalian mitochondrial tRNAs. RNA 6, 1356–1379.
CrossRef | PubMed |

Henneke CM (1989) A multiple sequence alignment algorithm for homologous proteins using secondary structure information and optionally keying alignments to functionally important sites. Computer Applications in the Biosciences 5, 141–150.
PubMed |


Hennig W (1966) ‘Phylogenetic systematics.’ [Transl. DD Davis, R Zangerl from W Hennig (1950) ‘Grundzüge einer theorie der phylogenetischen systematik.’ (Deutscher Zentralverlag: Berlin)] (University of Illinois Press: Urbana)

Henikoff S (1991) Playing with blocks: some pitfalls of forcing multiple alignments. The New Biologist 3, 1148–1154.
PubMed |


Heringa J (1999) Two strategies for sequence comparison: profile-preprocessed and secondary structure-induced multiple alignment. Computers and Chemistry 23, 341–364.
CrossRef | PubMed |

Hickson RE, Simon C, Cooper A, Spicer GS, Sullivan J, Penny D (1996) Conserved sequence motifs, alignment, and secondary structure for the third domain of animal 12S rRNA. Molecular Biology and Evolution 13, 150–169.
PubMed |


Hickson RE, Simon C, Perrey SW (2000) The performance of several multiple-sequence alignment programs in relation to secondary-structure features for an rRNA sequence. Molecular Biology and Evolution 17, 530–539.
PubMed |


Higgins DG, Thompson JD, Gibson TJ (1996) Using CLUSTAL for multiple sequence alignments. Methods in Enzymology 266, 383–402.
PubMed |


Higgins DG, Blackshields G, Wallace IM (2005) Mind the gaps: progress in progressive alignment. Proceedings of the National Academy of Sciences USA 102, 10 411–10 412.
CrossRef |

Higgs PG (2000) RNA secondary structure: physical and computational aspects. Quarterly Reviews of Biophysics 33, 199–253.
CrossRef | PubMed |

Hillis DM (1994) Homology in molecular biology. In ‘Homology: the hierarchical basis of comparative biology’. (Ed. BK Hall) pp. 339–368. (Academic Press: San Diego)

Hirosawa M, Totoki Y, Hoshida M, Ishikawa M (1995) Comprehensive study of iterative algorithms of multiple sequence alignment. Computer Applications in the Biosciences 11, 13–18.
PubMed |


Hofacker IL, Bernhart SHF, Stadler PF (2004) Alignment of RNA base pairing probability matrices. Bioinformatics 20, 2222–2227.
CrossRef | PubMed |

Hogeweg P, Hesper B (1984) The alignment of sets of sequences and the construction of phyletic trees: an integrated method. Journal of Molecular Evolution 20, 175–186.
CrossRef | PubMed |

Holm L, Sander C (1996) Mapping the protein universe. Science 273, 595–603.
CrossRef | PubMed |

Holmes I (2005) Accelerated probabilistic inference of RNA structure evolution. BMC Bioinformatics 6, 73.
CrossRef | PubMed |

Holmes I, Durbin R (1998) Dynamic programming alignment accuracy. Journal of Computational Biology 5, 493–504.
PubMed |


Hoot SB, Douglas AW (1998) Phylogeny of the Proteaceae based on atpB and atpB–rbcL intergenic spacer region sequences. Australian Systematic Botany 11, 301–320.
CrossRef |

Hua Y, Jiang T, Wu B (1999) Aligning DNA sequences to minimize the change in protein. Journal of Combinatorial Optimization 3, 227–245.
CrossRef |

Huang X, Miller W (1991) A time-efficient, linear-space local similarity algorithm. Advances in Applied Mathematics 12, 337–357.
CrossRef |

Hudak J , McClure MA (1999) A comparative analysis of computational motif-detection methods. In ‘Proceedings of the 4th Pacific Symposium on Biocomputing 1999, Hawaii’. pp. 138–149.

Janies DA, Wheeler WC (2001) Efficiency of parallel direct optimization. Cladistics 17, S71–S82.
CrossRef | PubMed |

Jennings AJ, Edge CM, Sternberg MJE (2001) An approach to improving multiple alignments of protein sequences using predicted secondary structure. Protein Engineering 14, 227–231.
CrossRef | PubMed |

Jeon Y-S, Chung H, Park S, Hur I, Lee J-H, Chun J (2005) jPHYDIT: a JAVA-based integrated environment for molecular phylogeny of ribosomal RNA sequences. Bioinformatics 21, 3171–3173.
CrossRef | PubMed |

Jiang T , Lawler EL , Wang L (1994) Aligning sequences via an evolutionary tree: complexity and approximation. In ‘Proceedings of the 26th annual ACM symposium on theory of computing’. pp. 760–769. (ACM Press: New York)

Johnson MS, Sali A, Blundell TL (1990) Phylogenetic relationships from three-dimensional protein structures. Methods in Enzymology 183, 670–690.
PubMed |


Johnson R (1982) Parsimony principles in phylogenetic systematics: a critical re-appraisal. Evolutionary Theory 6, 79–90.

Just W (2001) Computational complexity of multiple sequence alignment with SP-score. Journal of Computational Biology 8, 615–623.
CrossRef | PubMed |

Just W, Della Vedova G (2004) Multiple sequence alignment as a facility location problem. INFORMS Journal on Computing 16, 430–440.
CrossRef |

Karaca M, Bilgen M, Onus AN, Ince AG, Elmasulu SY (2005) Exact Tandem Repeats Analyzer (E-TRA): a new program for DNA sequence mining. Journal of Genetics 84, 49–54.
PubMed |


Karp RM (2002) Mathematical challenges from genomics and molecular biology. Notices of the AMS 49, 544–553.

Karplus K, Hu B (2001) Evaluation of protein multiple alignments by SAM-T99 using the BAliBASE multiple alignment test set. Bioinformatics 17, 713–720.
CrossRef | PubMed |

Katoh K, Misawa K, Kuma K, Miyata T (2002) MAFFT: a novel method for rapid multiple sequence alignment based on fast fourier transform. Nucleic Acids Research 30, 3059–3066.
CrossRef | PubMed |

Katoh K, Kuma K, Toh H, Miyata T (2005a) MAFFT version 5: improvement in accuracy of multiple sequence alignment. Nucleic Acids Research 33, 511–518.
CrossRef | PubMed |

Katoh K, Kuma K, Miyata T, Toh H (2005b) Improvement in the accuracy of multiple sequence alignment program MAFFT. Genome Informatics 16, 22–33.
PubMed |


Kawakita A, Sota T, Ascher JS, Ito M, Tanaka H, Kato M (2003) Evolution and phylogenetic utility of alignment gaps within intron sequences of three nuclear genes in bumble genes (Bombus). Molecular Biology and Evolution 20, 87–92.
CrossRef | PubMed |

Kececioglu J , Starrett D (2004) Aligning alignments exactly. In ‘Proceedings of the 8th ACM conference on research in computational molecular biology (RECOMB’04)’. pp. 85–96. (ACM Press: New York)

Kececioglu J, Kim E (2006) Simple and fast inverse alignment. Lecture Notes in Computer Science 3909, 441–455.

Keightley PD, Johnson T (2004) MCALIGN: stochastic alignment of noncoding DNA sequences based on an evolutionary model of sequence evolution. Genome Research 14, 442–450.
CrossRef | PubMed |

Kelchner SA (2000) The evolution of non-coding chloroplast DNA and its application in plant systematics. Annals of the Missouri Botanical Garden 87, 482–498.
CrossRef |

Kelchner SA (2002) Group II introns as phylogenetic tools: structure, function, and evolutionary constraints. American Journal of Botany 89, 1651–1669.

Kelchner SA, Wendel JF (1996) Hairpins create minute inversions in non-coding regions of chloroplast DNA. Current Genetics 30, 259–262.
CrossRef | PubMed |

Kelchner SA, Clark LG (1997) Molecular evolution and phylogenetic utility of the chloroplast rpl16 intron in Chusquea and the Bambusoideae (Poaceae). Molecular Phylogenetics and Evolution 8, 385–397.
CrossRef | PubMed |

Kjer KM (1995) Use of rRNA secondary structure in phylogenetic studies to identify homologous positions: an example of alignment and data presentation from the frogs. Molecular Phylogenetics and Evolution 4, 314–330.
CrossRef | PubMed |

Kjer KM (1997) An alignment template for amphibian 12S rRNA, domain III: conserved primary and secondary structural motifs. Journal of Herpetology 31, 599–604.
CrossRef |

Kjer KM (2004) Aligned 18S and insect phylogeny. Systematic Biology 53, 506–514.
CrossRef | PubMed |

Kjer KM, Baldridge GD, Fallon AM (1994) Mosquito large subunit ribosomal RNA: simultaneous alignment of primary and secondary structure. Biochimica et Biophysica Acta 1217, 147–155.
PubMed |


Kjer KM, Gillespie JJ, Ober KA (2006) Opinions on multiple sequence alignment, and an empirical comparison of repeatability and accuracy between POY and structural alignment. Systematic Biology in press ,

Kleinjung J, Douglas N, Heringa J (2002) Parallelized multiple alignment. Bioinformatics 18, 1270–1271.
CrossRef | PubMed |

Knudsen B, Miyamoto M (2003) Sequence alignments and pair hidden markov models using evolutionary history. Journal of Molecular Biology 333, 453–460.
CrossRef | PubMed |

Kolodny R, Koehl P, Levitt M (2005) Comprehensive evaluation of protein structure alignment methods: scoring by geometric measures. Journal of Molecular Biology 346, 1173–1188.
CrossRef | PubMed |

Kreitman M (1983) Nucleotide polymorphism at the alcohol dehydrogenase locus of Drosophila melanogaster. Nature 304, 412–417.
CrossRef | PubMed |

Kroken S, Taylor JW (2001) Outcrossing and recombination in the lichenized fungus Letharia. Fungal Genetics and Biology 34, 83–92.
CrossRef | PubMed |

Kurtz S, Schleiermacher C (1999) REPuter: fast computation of maximal repeats in complete genomes. Bioinformatics 15, 426–427.
CrossRef | PubMed |

Lambert C, Van Campenhout J-M, DeBolle X, Depiereux E (2003) Review of common sequence alignment methods: clues to enhance reliability. Current Genomics 4, 131–146.
CrossRef |

Lancia G, Ravi R (1999) GESTALT: genomic steiner aligments. Lecture Notes in Computer Science 1645, 101–114.

Lassmann T, Sonnhammer ELL (2002) Quality assessment of multiple alignment programs. FEBS Letters 529, 126–130.
CrossRef | PubMed |

Lassmann T, Sonnhammer ELL (2005) Automatic assessment of alignment quality. Nucleic Acids Research 33, 7120–7128.
CrossRef | PubMed |

Laurenne NM, Broad GR, Quicke DLJ (2006) Direct optimization and multiple alignment of 28S D2–D3 rDNA sequences: problems with indels on the way to a molecular phylogeny of the cryptine ichneumon wasps (Insecta: Hymenoptera). Cladistics 22, 442–473.
CrossRef |

Lawrence CJ, Malmberg RL, Muszynski MG, Dawe RK (2002) Maximum likelihood methods reveal conservation of function among closely related kinesin families. Journal of Molecular Evolution 54, 42–53.
CrossRef | PubMed |

Lawrence CJ, Zmasek CM, Dawe RK, Malmberg RL (2004) LumberJack: a heuristic tool for sequence alignment exploration and phylogenetic inference. Bioinformatics 20, 1977–1979.
CrossRef | PubMed |

Lebrun E, Santini JM, Brugna M, Ducluzeau A-L, Ouchane S, Schoepp-Cothenet B, Baymann F, Nitschke W (2006) The rieske protein: a case study on the pitfalls of multiple sequence alignments and phylogenetic reconstruction. Molecular Biology and Evolution 23, 1180–1191.
CrossRef | PubMed |

Lecompte O, Thompson JD, Plewniak F, Thierry J-C, Poch O (2001) Multiple alignment of complete sequences (MACS) in the post-genomic era. Gene 270, 17–30.
CrossRef | PubMed |

Lee MSY (2001) Unalignable sequences and molecular evolution. Trends in Ecology and Evolution 16, 681–685.
CrossRef |

Lenhof H-P, Reinert K, Vingron M (1998) A polyhedral approach to RNA sequence structure alignment. Journal of Computational Biology 5, 517–530.
PubMed |


Li K-B (2003) ClustalW-MPI: ClustalW analysis using distributed and parallel computing. Bioinformatics 19, 1585–1586.
CrossRef | PubMed |

Lombard V, Camon EB, Parkinson HE, Hingamp P, Stoesser G, Redaschi N (2002) EMBL-Align: a new public nucleotide and amino acid multiple sequence alignment database. Bioinformatics 18, 763–764.
CrossRef | PubMed |

Löytynoja A, Milinkovitch MC (2001) SOAP, cleaning multiple alignments from unstable blocks. Bioinformatics 17, 573–574.
CrossRef | PubMed |

Löytynoja A, Milinkovitch MC (2003) A hidden markov model for progressive multiple alignment. Bioinformatics 19, 1505–1513.
CrossRef | PubMed |

Löytynoja A, Goldman N (2005) An algorithm for progressive multiple alignment of sequences with insertions. Proceedings of the National Academy of Sciences USA 102, 10 557–10 562.
CrossRef |

Lu CL, Huang YP (2005) A memory-efficient algorithm for multiple sequence alignment with constraints. Bioinformatics 21, 23–30.

Ludwig W, Strunk O, Westram R, Richter L, Meier H , et al. (2004) ARB: a software environment for sequence data. Nucleic Acids Research 32, 1363–1371.
CrossRef | PubMed |

Lunter G , Drummond AJ , Miklós I , Hein J (2005) Statistical alignment: recent progress, new applications, and challenges. In ‘Statistical methods in molecular evolution’. (Ed. R Nielsen) pp. 375–405. (Springer: New York)

Manohar A , Batzoglou S (2005) TreeRefiner: a tool for refining a multiple alignment on a phylogenetic tree. In ‘Proceedings of the 2005 IEEE computational systems bioinformatics conference (CSB’05)’. pp. 111–119. (IEEE Press: Piscataway)

Marchler-Bauer A, Panchenko AR, Ariel N, Bryant SH (2002) Comparison of sequence and structure alignments for protein domains. Proteins: Structure, Function, and Genetics 48, 439–446.
CrossRef |

Marchler-Bauer A, Anderson JB, Cherukuri PF, DeWeese-Scott C, Geer LY , et al. (2005) CDD: a conserved domain database for protein classification. Nucleic Acids Research 33, D192–D196.
CrossRef | PubMed |

Margulies EH, Chen CW, Green ED (2006) Differences between pair-wise and multi-sequence alignment methods affect vertebrate genome comparisons. Trends in Genetics 22, 187–193.
CrossRef | PubMed |

Marsden B, Abagyan R (2004) SAD—a normalized structural alignment database: improving sequence–structure alignments. Bioinformatics 20, 2333–2344.
CrossRef | PubMed |

Marti-Renom MA, Madhusudhan MS, Sali A (2004) Alignment of protein sequences by their profiles. Protein Science 13, 1071–1087.
CrossRef | PubMed |

May ACW (2004) Percent sequence identity: the need to be explicit. Structure 12, 737–738.
CrossRef | PubMed |

McClure MA, Vasi TK, Fitch WM (1994) Comparative analysis of multiple protein-sequence alignment methods. Molecular Biology and Evolution 11, 571–592.
PubMed |


Mecham J, Clement M, Snell Q, Freestone T, Seppi K, Crandall K (2006) Jumpstarting phylogenetic analysis. International Journal of Bioinformatics Research and Applications 2, 19–35.

Miklós I, Lunter GA, Holmes I (2004) A “long indel” model for evolutionary sequence alignment. Molecular Biology and Evolution 21, 529–540.
CrossRef | PubMed |

Milinkovitch MC, LeDuc RG, Adachi J, Farnir F, Georges M, Hasegawa M (1996) Effects of character weighting and species sampling on phylogeny reconstruction: a case study based on DNA sequence data in cetaceans. Genetics 144, 1817–1833.
PubMed |


Miller W (2001) Comparison of genomic DNA sequences: solved and unsolved problems. Bioinformatics 17, 391–397.
CrossRef | PubMed |

Mindell DP (1991) Aligning DNA sequences: homology and phylogenetic weighting. In ‘Phylogenetic analysis of DNA sequences’. (Eds MM Miyamoto, J Cracraft) pp. 73–89. (Oxford University Press: New York)

Morell V (1996) TreeBASE: the roots of phylogeny. Science 273, 569.
CrossRef |

Morgenstern B (1999) DIALIGN 2: improvement of the segment-to-segment approach to multiple sequence alignment. Bioinformatics 15, 211–218.
CrossRef | PubMed |

Morgenstern B, Prohaska SJ, Pohler D, Stadler PF (2006) Multiple sequence alignment with user-defined anchor points. Algorithms for Molecular Biology 1, 6.
CrossRef | PubMed |

Morris P, Cobabe E (1991) Cuvier meets Watson and Crick: the utility of molecules as classical homologies. Biological Journal of the Linnean Society 44, 307–324.

Morrison DA (2006) Phylogenetic analyses of parasites in the new millennium. Advances in Parasitology 63, 1–124.

Morrison DA, Ellis JT (1997) Effects of nucleotide sequence alignment on phylogeny estimation: a case study of 18S rDNAs of Apicomplexa. Molecular Biology and Evolution 14, 428–441.
PubMed |


Mugridge NB, Morrison DA, Jäkel T, Heckeroth AR, Tenter AM, Johnson AM (2000) Effects of sequence alignment and structural domains of ribosomal DNA on phylogeny reconstruction for the protozoan family Sarcocystidae. Molecular Biology and Evolution 17, 1842–1853.
PubMed |


Myers G, Selznick S, Zhang Z, Miller W (1996) Progressive multiple alignment with constraints. Journal of Computational Biology 3, 563–572.
PubMed |


Nguyen HD, Yoshihara I, Yamamori K, Yasunaga M (2002) Aligning multiple protein sequences by parallel hybrid genetic algorithm. Genome Informatics 13, 123–132.
PubMed |


Nicholas HB, Ropelewski AJ, Deerfield DW (2002) Strategies for multiple sequence alignment. BioTechniques 32, 572–591.
PubMed |


Notredame C (2002) Recent progress in multiple sequence alignment: a survey. Pharmacogenomics 3, 131–144.
CrossRef | PubMed |

Notredame C, O’Brien EA, Higgins DG (1997) RAGA: RNA sequence alignment by genetic algorithm. Nucleic Acids Research 25, 4570–4580.
CrossRef | PubMed |

Notredame C, Holm L, Higgins DG (1998) COFFEE: an objective function for multiple sequence alignments. Bioinformatics 14, 407–422.
CrossRef | PubMed |

Notredame C, Higgins DG, Heringa J (2000) T-coffee: a novel method for fast and accurate multiple sequence alignment. Journal of Molecular Biology 302, 205–217.
CrossRef | PubMed |

Nozaki Y, Bellgard M (2005) Statistical evaluation and comparison of a pairwise alignment algorithm that a priori assigns the number of gaps rather than employing gap penalties. Bioinformatics 21, 1421–1428.
CrossRef | PubMed |

O’Brien EA, Notredame C, Higgins DG (1998) Optimization of ribosomal RNA profile alignments. Bioinformatics 14, 332–341.
CrossRef | PubMed |

O’Donnell K, Kistler HC, Tacke BK, Casper HH (2000) Gene genealogies reveal global phylogeographic structure and reproductive isolation among lineages of Fusarium graminearum, the fungus causing wheat scab. Proceedings of the National Academy of Sciences USA 97, 7905–7910.
CrossRef |

Ogden TH, Rosenberg MS (2006) Multiple sequence alignment accuracy and phylogenetic inference. Systematic Biology 55, 314–328.
CrossRef | PubMed |

Ohlson T, Wallner B, Elofsson A (2004) Profile–profile methods provide improved fold recognition: a study of different profile–profile alignment methods. Proteins: Structure, Function, and Bioinformatics 57, 188–197.
CrossRef |

Oliver T, Schmidt B, Nathan D, Clemens R, Maskell D (2005) Using reconfigurable hardware to accelerate multiple sequence alignment with ClustalW. Bioinformatics 21, 3431–3432.
CrossRef | PubMed |

Ophir R, Graur D (1997) Patterns and rates of indel evolution in processed pseudogenes from humans and murids. Gene 205, 191–202.
CrossRef | PubMed |

O’Sullivan O, Suhre K, Abergel C, Higgins DG, Notredame C (2004) 3DCoffee: combining protein sequences and structures within multiple sequence alignments. Journal of Molecular Biology 340, 385–395.
CrossRef | PubMed |

Page RDM (2000) Comparative analysis of secondary structure of insect mitochondrial small subunit ribosomal RNA using maximum weighted matching. Nucleic Acids Research 28, 3839–3845.
CrossRef | PubMed |

Parida L, Floratos A, Rigoutsos I (1999) An approximation algorithm for alignment of multiple sequences using motif discovery. Journal of Combinatorial Optimization 3, 247–275.
CrossRef |

Parmentier G, Trystram D, Zola J (2004) Cache-based parallelization of multiple sequence alignment problem. Lecture Notes in Computer Science 3149, 1005–1012.

Pascarella S, Argos P (1992) Analysis of insertions / deletions in protein structures. Journal of Molecular Biology 224, 461–471.
CrossRef | PubMed |

Patterson C (1988) Homology in classical and molecular biology. Molecular Biology and Evolution 5, 603–625.
PubMed |


Pearson WR, Sierk ML (2005) The limits of protein sequence comparison? Current Opinion in Structural Biology 15, 254–260.
CrossRef | PubMed |

Pedersen CNS, Lyngsø R, Hein J (1998) Comparison of coding DNA. Lecture Notes in Computer Science 1448, 153–173.

Pei J, Grishin NV (2006) MUMMALS: multiple sequence alignment improved by using hidden markov models with local structural information. Nucleic Acids Research 34, 4364–4374.
CrossRef | PubMed |

Pei J, Sadreyev R, Grishin NV (2003) PCMA: fast and accurate multiple sequence alignment based on profile consistency. Bioinformatics 19, 427–428.
CrossRef | PubMed |

Petersen G, Seberg O, Aagesen L, Frederiksen S (2004) An empirical test of the treatment of indels during optimization alignment based on the phylogeny of the genus Secale (Poaceae). Molecular Phylogenetics and Evolution 30, 733–742.
CrossRef | PubMed |

Pettersson EU, Ljunggren EL, Morrison DA, Mattsson JG (2005) Functional analysis and localisation of a class delta glutathione S-transferase from Sarcoptes scabiei. International Journal for Parasitology 35, 39–48.
CrossRef | PubMed |

Phillips A (2006) Homology assessment and molecular sequence alignment. Journal of Biomedical Informatics 39, 18–33.
CrossRef | PubMed |

Phillips A, Janies D, Wheeler W (2000) Multiple sequence alignment in phylogenetic analysis. Molecular Phylogenetics and Evolution 16, 317–330.
CrossRef | PubMed |

Pible O, Imbert G, Pellequer J-L (2005) INTERALIGN: interactive alignment editor for distantly related protein sequences. Bioinformatics 21, 3166–3167.
CrossRef | PubMed |

de Pinna MCC (1991) Concepts and tests of homology in the cladistic paradigm. Cladistics 7, 367–394.
CrossRef |

Poch O, Delarue M (1996) Converting sequence block alignments into structural insights. Methods in Enzymology 266, 662–680.
PubMed |


Pollard DA, Bergman CM, Stoye J, Celniker SE, Eisen MB (2004) Benchmarking tools for the alignment of functional noncoding DNA. BMC Bioinformatics 5, 6.
CrossRef | PubMed |

Ponting CP , Birney E (2000) Identification of domains from protein sequences. In ‘Protein structure prediction: methods and protocols’. (Ed. DM Webster) pp. 53–69. (Humana Press: Totowa)

Qian B, Goldstein RA (2001) Distribution of indel lengths. Proteins: Structure, Function, and Genetics 45, 102–104.
CrossRef |

Raghava GPS, Searle SMJ, Audley PC, Barber JD, Barton GJ (2003) OXBench: a benchmark for evaluation of protein multiple sequence alignment accuracy. BMC Bioinformatics 4, 47.
CrossRef | PubMed |

Rainaldi G, Volpicella M, Licciulli F, Liuni S, Gallerani R, Ceci LR (2003) PLMItRNA, a database on the heterogeneous genetic origin of mitochondrial tRNA genes and tRNAs in photosynthetic eukaryotes. Nucleic Acids Research 31, 436–438.
CrossRef | PubMed |

Raphael B, Zhi D, Tang H, Pevzner P (2004) A novel method for multiple alignment of sequences with repeated and shuffled elements. Genome Research 14, 2336–2346.
CrossRef | PubMed |

Redelings BD, Suchard MA (2005) Joint bayesian estimation of alignment and phylogeny. Systematic Biology 54, 401–418.
CrossRef | PubMed |

Reeck GR, de Haën C, Teller DC, Doolittle RF, Fitch WM , et al. (1987) “Homology” in proteins and nucleic acids: a terminology muddle and a way out of it. Cell 50, 667.
CrossRef | PubMed |

Reese JT, Pearson WR (2002) Empirical determination of effective gap penalties for sequence comparison. Bioinformatics 18, 1500–1507.
CrossRef | PubMed |

Reinert K, Stoye J, Will T (2000) An iterative method for faster sum-of-pairs multiple sequence alignment. Bioinformatics 16, 808–814.
CrossRef | PubMed |

Riaz T, Wang Y, Li K-B (2004) Multiple sequence alignment using tabu search. Conferences in Research and Practice in Information Technology 29, 223–232.

Riaz T, Wang Y, Li K-B (2005) Tabu search algorithm for post-processing multiple sequence alignment. Journal of Bioinformatics and Computational Biology 3, 145–156.
CrossRef | PubMed |

Rice KA, Donoghue MJ, Olmstead RG (1997) Analyzing large data sets: rbcL 500 revisited. Systematic Biology 46, 554–563.
CrossRef | PubMed |

Rieppel O (1994) Homology, topology, and typology: the history of modern debates. In ‘Homology: the hierarchical basis of comparative biology’. (Ed. BK Hall) pp. 63–100. (Academic Press: San Diego)

Rieppel O, Kearney M (2002) Similarity. Biological Journal of the Linnean Society 75, 59–82.
CrossRef |

Rinsma-Melchert I (1993) The expected number of matches in optimal global sequence alignments. New Zealand Journal of Botany 31, 219–230.

Rodriguez R , Vriend G (1997) Professional gambling. In ‘Biomolecular structure and dynamics: recent experimental and theoretical advances’. (Eds G Vergoten, T Theophanides) pp. 79–120. (Kluwer Academic Publishers: Dordrecht)

Rosenberg MS (2005a) Evolutionary distance estimation and fidelity of pair wise sequence alignment. BMC Bioinformatics 6, 102.
CrossRef | PubMed |

Rosenberg MS (2005b) MySSP: non-stationary evolutionary sequence simulation, including indels. Evolutionary Bioinformatics Online 1, 81–83.

Roshan U, Livesay DR (2006) Probalign: multiple sequence alignment using partition function posterior probabilities. Bioinformatics 22, 2715–2721.
CrossRef | PubMed |

Rost B, Valencia A (1996) Pitfalls of protein sequence analysis. Current Opinion in Biotechnology 7, 457–461.
CrossRef | PubMed |

Sadreyev RI, Grishin NV (2004) Estimates of statistical significance for comparison of individual positions in multiple sequence alignments. BMC Bioinformatics 5, 106.
CrossRef | PubMed |

Sammeth M, Heringa J (2006) Global multiple-sequence alignment with repeats. Proteins: Structure, Function, and Bioinformatics 64, 263–274.
CrossRef |

Sammeth M, Weniger T, Harmsen D, Stoye J (2005) Alignment of tandem repeats with excision, duplication, substitution and indels (EDSI). Lecture Notes in Computer Science 3692, 276–290.

Sanchis A, Michelana JM, Latorre A, Quicke DLJ, Gärdenfors U, Belshaw R (2001) The phylogenetic analysis of variable-length sequence data: elongation factor-1α introns in European populations of the parasitoid wasp genus Pauesia (Hymenoptera: Braconidae: Aphidiinae). Molecular Biology and Evolution 18, 1117–1131.
PubMed |


Sankoff D , Cedergren RJ (1983) Simultaneous comparison of three or more sequences related by a tree. In ‘Time warps, string edits, and macromolecules: the theory and practice of sequence comparison’. (Eds D Sankoff, JB Kruskal) pp. 253–264. (Addison-Wesley: Reading)

Sankoff D, Morel C, Cedergren RJ (1973) Evolution of 5S RNA and the non-randomness of base replacement. Nature 245, 232–234.
CrossRef |

Sauder JM, Arthur JW, Dunbrack RL (2000) Large-scale comparison of protein sequence alignment algorithms with structure alignments. Proteins: Structure, Function, and Genetics 40, 6–22.
CrossRef |

Schmollinger M, Nieselt K, Kaufmann M, Morgenstern B (2004) DIALIGN P: fast pair-wise and multiple sequence alignment using parallel processors. BMC Bioinformatics 5, 128.
CrossRef | PubMed |

Schuler GD, Altschul SF, Lipman DJ (1991) A workbench for multiple alignment construction and analysis. Proteins 9, 180–190.
CrossRef | PubMed |

Schultes EA, Hraber PT, LaBean TH (1999) Estimating the contributions of selection and self-organization in RNA secondary structure. Journal of Molecular Evolution 49, 76–83.
CrossRef | PubMed |

Schultz J, Maisel S, Gerlach D, Müller T, Wolf M (2005) A common core of secondary structure of the internal transcribed spacer 2 (ITS2) throughout the Eukaryota. RNA 11, 361–364.
CrossRef | PubMed |

Schwikowski B, Vingron M (1997a) The deferred path heuristic for the generalized tree alignment problem. Journal of Computational Biology 4, 415–431.
PubMed |


Schwikowski B, Vingron M (1997b) A clustering approach to generalized tree alignment with application to Alu repeats. Lecture Notes in Computer Science 1278, 115–124.

Schwikowski B, Vingron M (2003) Sequence graphs: boosting iterated dynamic programming using locally suboptimal solutions. Discrete Applied Mathematics 127, 95–117.
CrossRef |

Shakhnovich BE (2005) Improving the precision of the structure–function relationship by considering phylogenetic context. PLoS Computational Biology 1, e9.
CrossRef | PubMed |

Shull VL, Vogler AP, Baker MD, Maddison DR, Hammond PM (2001) Sequence alignment of 18S ribosomal RNA and the basal relationships of adephagan beetles: evidence for monophyly of aquatic families and the placement of Trachypachidae. Systematic Biology 50, 945–969.
CrossRef | PubMed |

Siddharthan R (2006) Sigma: multiple alignment of weakly-conserved non-coding DNA sequence. BMC Bioinformatics 7, 143.
CrossRef | PubMed |

Siebert S, Backofen R (2005) MARNA: multiple alignment and consensus structure prediction of RNAs based on sequence structure comparisons. Bioinformatics 21, 3352–3359.
CrossRef | PubMed |

Simmons MP (2004) Independence of alignment and tree search. Molecular Phylogenetics and Evolution 31, 874–879.
CrossRef | PubMed |

Simmons MP, Ochoterena H (2000) Gaps as characters in sequence-based phylogenetic analysis. Systematic Biology 49, 369–381.
CrossRef | PubMed |

Simmons MP, Freudenstein JV (2003) The effects of increasing genetic distance on alignment of, and tree construction from, rDNA internal transcribed spacer sequences. Molecular Phylogenetics and Evolution 26, 444–451.
CrossRef | PubMed |

Simmons MP, Carr TG, O’Neill K (2004) Relative character-state space, amount of potential phylogenetic information, and heterogeneity of nucleotide and amino acid characters. Molecular Phylogenetics and Evolution 32, 913–926.
CrossRef | PubMed |

Simossis VA, Heringa J (2004) Integrating protein secondary structure prediction and multiple sequence alignment. Current Protein and Peptide Science 5, 249–266.
CrossRef | PubMed |

Simossis VA, Heringa J (2005) PRALINE: a multiple sequence alignment toolbox that integrates homology-extended and secondary structure information. Nucleic Acids Research 33, W289–W294.
CrossRef | PubMed |

Simossis VA, Kleinjung J, Heringa J (2005) Homology-extended sequence alignment. Nucleic Acids Research 33, 816–824.
CrossRef | PubMed |

Slowinski JB (1998) The number of multiple alignments. Molecular Phylogenetics and Evolution 10, 264–266.
CrossRef | PubMed |

Sluys R (1996) The notion of homology in current comparative biology. Journal of Zoological Systematics and Evolutionary Research 34, 145–152.

Smith NGC, Hurst LD (1998) Sensitivity of patterns of molecular evolution to alterations in methodology: a critique of Hughes and Yeager. Journal of Molecular Evolution 47, 493–500.
CrossRef | PubMed |

del Sol Mesa A, Pazos F, Valencia A (2003) Automatic methods for predicting functionally important residues. Journal of Molecular Biology 326, 1289–1302.
CrossRef | PubMed |

Sprinzl M, Vassilenko KS (2005) Compilation of tRNA sequences and sequences of tRNA genes. Nucleic Acids Research 33, D139–D140.
CrossRef | PubMed |

Stebbings LA, Mizuguchi K (2004) HOMSTRAD: recent developments of the homologous protein structure alignment database. Nucleic Acids Research 32, D203–D207.
CrossRef | PubMed |

Stocsits RR, Hofaker IL, Fried C, Stadler PF (2005) Multiple sequence alignments of partially coding nucleic acid sequences. BMC Bioinformatics 6, 160.
CrossRef | PubMed |

Stoye J, Evers D, Meyer F (1998) Rose: generating sequence families. Bioinformatics 14, 157–163.
CrossRef | PubMed |

Subramanian AR, Weyer-Menkhoff J, Kaufmann M, Morgenstern B (2005) DIALIGN-T: an improved algorithm for segment-based multiple sequence alignment. BMC Bioinformatics 6, 66.
CrossRef | PubMed |

Sze S-H, Lu Y, Yang Q (2006) A polynomial time solvable formulation of multiple sequence alignment. Journal of Computational Biology 13, 309–319.
CrossRef | PubMed |

Szklarczyk R, Heringa J (2004) Tracking repeats using significance and transitivity. Bioinformatics 20, i311–i317.
CrossRef | PubMed |

Szymanski M, Barciszewska MZ, Erdmann VA, Barciszewski J (2002) 5S ribosomal RNA database. Nucleic Acids Research 30, 176–178.
CrossRef | PubMed |

Taylor WR (1986) Identification of protein sequence homology by consensus template alignment. Journal of Molecular Biology 188, 233–258.
CrossRef | PubMed |

Taylor WR (1987) Multiple sequence alignment by a pairwise algorithm. Computer Applications in the Biosciences 3, 81–87.
PubMed |


Taylor WR (1996) Multiple protein sequence alignment: algorithms and gap insertion. Methods in Enzymology 266, 343–367.
PubMed |


Teeling H, Gloeckner FO (2006) RibAlign: a software tool and database for eubacterial phylogeny based on concatenated ribosomal protein subunits. BMC Bioinformatics 7, 66.
CrossRef | PubMed |

Telford MJ, Wise MJ, Gowri-Shankar V (2005) Consideration of RNA secondary structure significantly improves likelihood-based estimates of phylogeny: examples from the Bilateria. Molecular Biology and Evolution 22, 1129–1136.
CrossRef | PubMed |

Terry MD, Whiting MF (2005) Comparison of two alignment techniques within a single complex data set: POY versus Clustal. Cladistics 21, 272–281.
CrossRef |

Thébault P, Monestié A, Higgins DG (1999) MIAH: automatic alignment of eukaryotic SSU rRNAs. Bioinformatics 15, 341–342.
CrossRef | PubMed |

Thompson JD, Higgins DG, Gibson TJ (1994) CLUSTAL W: improving the sensitivity of progressive multiple sequence alignment through sequence weighting, position-specific gap penalties and weight matrix choice. Nucleic Acids Research 22, 4673–4680.
PubMed |


Thompson JD, Gibson TJ, Plewniak F, Jeanmougin F, Higgins DG (1997) The CLUSTAL-X windows interface: flexible strategies for multiple sequence alignment aided by quality analysis tools. Nucleic Acids Research 25, 4876–4882.
CrossRef | PubMed |

Thompson JD, Plewniak F, Poch O (1999a) BAliBASE: a benchmark alignment database for the evaluation of multiple alignment programs. Bioinformatics 15, 87–88.
CrossRef | PubMed |

Thompson JD, Plewniak F, Poch O (1999b) A comprehensive comparison of multiple sequence alignment programs. Nucleic Acids Research 27, 2682–2690.
CrossRef | PubMed |

Thompson JD, Plewniak F, Thierry J-C, Poch O (2000) DbClustal: rapid and reliable global multiple alignments of protein sequences detected by database searches. Nucleic Acids Research 28, 2919–2926.
CrossRef | PubMed |

Thompson JD, Plewniak F, Ripp R, Thierry J-C, Poch O (2001) Towards a reliable objective function for multiple sequence alignments. Journal of Molecular Biology 314, 937–951.
CrossRef | PubMed |

Thompson JD, Thierry JC, Poch O (2003) RASCAL: rapid scanning and correction of multiple sequence alignments. Bioinformatics 19, 1155–1161.
CrossRef | PubMed |

Thompson JD, Koehl P, Ripp R, Poch O (2005) BAliBASE 3.0: latest developments of the multiple sequence alignment benchmark. Proteins: Structure, Function, and Bioinformatics 61, 127–136.
CrossRef |

Thomsen R , Fogel GB , Krink T (2002) A Clustal alignment improver using evolutionary algorithms. In ‘Proccedings of the fourth congress on evolutionary computation (CEC-2002)’. (Eds DB Fogel, X Yao, G Greenwood, H Iba, P Marrow, M Shackleton) pp. 121–126. (IEEE Press: Piscataway)

Thomsen R , Fogel GB , Krink T (2003) Improvement of Clustal-derived sequence alignments with evolutionary algorithms. In ‘Proccedings of the fifth congress on evolutionary computation (CEC-2003)’. (Eds DR Sarker, R Reynolds, H Abbass, KC Tan, B McKay, D Essam, T Gedeon) pp. 1499–1507. (IEEE Press: Piscataway)

Thorne JL, Kishino H (1992) Freeing phylogenies from artifacts of alignment. Molecular Biology and Evolution 9, 1148–1162.
PubMed |


Thorne JL, Churchill GA (1995) Estimation and reliability of molecular sequence alignments. Biometrics 51, 100–113.
CrossRef | PubMed |

Thorne JL, Kishino H, Felsenstein J (1991) An evolutionary model for maximum likelihood alignment of DNA sequences. Journal of Molecular Evolution 33, 114–124.
CrossRef | PubMed |

Thorne JL, Kishino H, Felsenstein J (1992) Inching toward reality: an improved likelihood model for sequence evolution. Journal of Molecular Evolution 34, 3–16.
CrossRef | PubMed |

Titus TA, Frost DR (1996) Molecular homology assessment and phylogeny in the lizard family Opluridae (Squamata: Iguania). Molecular Phylogenetics and Evolution 6, 49–62.
CrossRef | PubMed |

Touzet H, Perriquet O (2004) CARNAC: folding families of related RNAs. Nucleic Acids Research 32, W142–W145.
PubMed |


Trystram D, Zola J (2005) Parallel multiple sequence alignment with decentralized cache support. Lecture Notes in Computer Science 3648, 1217–1226.

Tsai YT, Huang YP, Yu CT, Lu CL (2004) MuSiC: a tool for multiple sequence alignment with constraints. Bioinformatics 20, 2309–2311.
CrossRef | PubMed |

Tyson H (1992) Relationships between amino acid sequences determined through optimum alignments, clustering, and specific distance patterns: application to a group of scorpion toxins. Genome 35, 360–371.
PubMed |


van Valen L (1982) Homology and causes. Journal of Morphology 173, 305–312.
CrossRef | PubMed |

Van Walle I, Lasters I, Wyns L (2004) Align-m—a new algorithm for multiple alignment of highly divergent sequences. Bioinformatics 20, 1428–1435.
CrossRef | PubMed |

Van Walle I, Lasters I, Wyns L (2005) SABmark—a benchmark for sequence alignment that covers the entire known fold space. Bioinformatics 21, 1267–1268.
CrossRef | PubMed |

Varani G , Pardi A (1994) Structure of RNA. In ‘RNA–protein interactions’. (Eds K Nagai, IW Mattaj) pp. 1–24. (IRL Press: Oxford)

Vingron M (1999) Sequence alignment and phylogeny construction. In ‘Mathematical support for molecular biology’. (Eds M Farach-Colton, FS Roberts, M Vingron, M Waterman) pp. 53–64. (American Mathematical Society: Providence)

Vingron M, Waterman MS (1994) Sequence alignments and penalty choice: review of concepts, case studies and implications. Journal of Molecular Biology 235, 1–12.
CrossRef | PubMed |

Vingron M, von Haeseler A (1997) Towards integration of multiple alignment and phylogenetic tree construction. Journal of Computational Biology 4, 23–34.
PubMed |


Vogt G, Etzold T, Argos P (1995) An assessment of amino acid exchange matrices in aligning protein sequences: the twilight zone revisited. Journal of Molecular Biology 249, 816–831.
CrossRef | PubMed |

Vogt L (2002) Testing and weighting characters. Organisms, Diversity and Evolution 2, 319–333.
CrossRef |

Wagner GP (1989) The biological homology concept. Annual Review of Ecology and Systematics 20, 51–69.
CrossRef |

Wallace IM, Blackshields G, Higgins DG (2005a) Multiple sequence alignments. Current Opinion in Structural Biology 15, 261–266.
CrossRef | PubMed |

Wallace IM, O’Sullivan O, Higgins DG (2005b) Evaluation of iterative alignment algorithms for multiple alignment. Bioinformatics 21, 1408–1414.
CrossRef | PubMed |

Wallace IM, O’Sullivan O, Higgins DG, Notredame C (2006) M-Coffee: combining multiple sequence alignment methods with T-Coffee. Nucleic Acids Research 34, 1692–1699.
CrossRef | PubMed |

Wang G, Dunbrack RL (2004) Scoring profile-to-profile sequence alignments. Protein Science 13, 1612–1626.
CrossRef | PubMed |

Wang L, Jiang T (1994) On the complexity of multiple sequence alignment. Journal of Computational Biology 1, 337–348.
PubMed |


Wang Y, Li K-B (2004) An adaptive and iterative algorithm for refining multiple sequence alignment. Computational Biology and Chemistry 28, 141–148.
CrossRef | PubMed |

Wareham HT (1995) A simplified proof of the NP- and MAX SNP-hardness of multiple sequence tree alignment. Journal of Computational Biology 2, 509–514.
PubMed |


Waterman MS (1995) ‘Introduction to computational biology: maps, sequences and genomes.’ (Chapman & Hall: London)

Wegner K, Jansen S, Wuchty S, Gauges R, Kummer U (2004) CombAlign: a protein sequence comparison algorithm considering recombinations. In Silico Biology 4, 0021.

Wegnez M (1987) Letter to the editor. Cell 51, 516.
CrossRef |

Wernersson R, Pedersen AG (2003) RevTrans: multiple alignment of coding DNA from aligned amino acid sequences. Nucleic Acids Research 31, 3537–3539.
CrossRef | PubMed |

Westbrook J, Feng Z, Chen L, Yang H, Berman HM (2003) The Protein Data Bank and structural genomics. Nucleic Acids Research 31, 489–491.
CrossRef | PubMed |

Wexler Y, Yakhini Z, Kashi Y, Geiger D (2005) Finding approximate tandem repeats in genomic sequences. Journal of Computational Biology 12, 928–942.
CrossRef | PubMed |

Wheeler WC (1993) The triangle inequality and character analysis. Molecular Biology and Evolution 10, 707–712.

Wheeler WC (1994) Sources of ambiguity in nucleic acid sequence alignment. In ‘Molecular ecology and evolution: approaches and applications’. (Eds B Schierwater, B Streit, GP Wagner, R DeSalle) pp. 323–352. (Birkhäuser Verlag: Basel)

Wheeler WC (1995) Sequence alignment, parameter sensitivity, and phylogenetic analysis of molecular data. Systematic Biology 44, 321–331.
CrossRef |

Wheeler W (1996) Optimization alignment: the end of multiple sequence alignment in phylogenetics? Cladistics 12, 1–9.
CrossRef |

Wheeler W (1998) Alignment characters, dynamic programming and heuristic solutions. In ‘Molecular approaches to ecology and evolution’. (Eds R DeSalle, B Schierwater) pp. 243–251. (Birkhäuser Verlag: Basel)

Wheeler WC (1999) Fixed character states and the optimization of molecular sequence data. Cladistics 15, 379–385.
CrossRef |

Wheeler W (2001 a) Homology and DNA sequence data. In ‘The character concept in evolutionary biology’. (Ed. GP Wagner) pp. 303–317. (Academic Press: San Diego)

Wheeler W (2001b) Homology and the optimization of DNA sequence data. Cladistics 17, S3–S11.
CrossRef | PubMed |

Wheeler WC (2002) Optimization alignment: down, up, error, and improvements. In ‘Techniques in molecular systematics and evolution’. (Eds R DeSalle, G Giribet, W Wheeler) pp. 55–69. (Birkhäuser Verlag: Basel)

Wheeler WC (2003a) Iterative pass optimization of sequence data. Cladistics 19, 254–260.
CrossRef | PubMed |

Wheeler WC (2003b) Implied alignment: a synapomorphy-based multiple-sequence alignment method and its use in cladogram search. Cladistics 19, 261–268.
CrossRef | PubMed |

Wheeler WC (2003c) Search-based optimization. Cladistics 19, 348–355.
CrossRef | PubMed |

Wheeler WC (2005) Alignment, dynamic homology, and optimization. In ‘Parsimony, phylogeny, and genomics’. (Ed. VA Albert) pp. 71–80. (Oxford University Press: Oxford)

Wheeler WC (2006) Dynamic homology and the likelihood criterion. Cladistics 22, 157–170.
CrossRef |

Wheeler WC, Gladstein DS (1994) MALIGN: a multiple sequence alignment program. Journal of Heredity 85, 417–418.

Whelan S, de Bakker PIW, Quevillon E, Rodriguez N, Goldman N (2006) PANDIT: an evolution-centric database of protein and associated nucleotide domains with inferred trees. Nucleic Acids Research 34, D327–D331.
CrossRef | PubMed |

Whiting AS, Sites JW, Pellegrino KCM, Rodrigues MT (2006) Comparing alignment methods for inferring the history of the new world lizard genus Mabuya (Squamata: Scincidae). Molecular Phylogenetics and Evolution 38, 719–730.
CrossRef | PubMed |

Williams DM (1993) A note on molecular homology: multiple patterns from single datasets. Cladistics 9, 233–245.
CrossRef |

Winnepenninckx B, Backeljau T (1996) 18S rRNA alignments derived from different secondary structure models can produce alternative phylogenies. Journal of Zoological Systematics and Evolutionary Research 34, 135–143.

Winter WP, Walsh KA, Neurath H (1968) Homology as applied to proteins. Science 162, 1433.
CrossRef | PubMed |

Wrabl JO, Grishin NV (2004) Gaps in structurally similar proteins: towards improvement of multiple sequence alignment. Proteins: Structure, Function, and Bioinformatics 54, 71–87.
CrossRef |

Wuyts J, Perrière G, Van de Peer Y (2004) The European ribosomal RNA database. Nucleic Acids Research 32, D101–D103.
CrossRef | PubMed |

Xiao L, Sulaiman IM, Ryan UM, Zhou L, Atwill ER, Tischler ML, Zhang X, Fayer R, Lal AA (2002) Host adaptation and host-parasite co-evolution in Cryptosporidium: implications for taxonomy and public health. International Journal for Parasitology 32, 1773–1785.
CrossRef | PubMed |

Yamada S, Gotoh O, Yamana H (2004) Extension of Prrn: implementation of a doubly nested randomized iterative refinement strategy under a piecewise linear gap cost. Genome Informatics 15, P082.

Yu H , Deng M (2005) ClustalY: speed up the guide tree building for ClustalW. In ‘Proceedings of the eighth international conference on high-performance computing in Asia-Pacific region (HPCASIA’05)’. pp. 608–610. (IEEE Press: Piscataway)

Yuan J, Amend A, Borkowski J, DeMarco R, Bailey W, Liu Y, Xie G, Blevins R (1999) MULTICLUSTAL: a systematic method for surveying ClustalW alignment parameters. Bioinformatics 15, 862–863.
CrossRef | PubMed |

Zhang X , Kahveci T (2006) A new approach for alignment of multiple proteins. In ‘Proceedings of the 11th Pacific Symposium on Biocomputing 2006, Hawaii’. pp. 339–350.

Zhou H, Zhou Y (2005) SPEM: improving multiple sequence alignment with sequence profiles and predicted secondary structure. Bioinformatics 21, 3615–3621.
CrossRef | PubMed |

Zhu J, Liu JS, Lawrence CE (1998) Bayesian adaptive sequence alignment algorithms. Bioinformatics 14, 25–39.
CrossRef | PubMed |

Zwieb C (1997) The uRNA database. Nucleic Acids Research 25, 102–103.
CrossRef | PubMed |







Full Text PDF (971 KB) Export Citation Cited By (60)

View Altmetrics