Previous Article | Next Article 
Journal of Bacteriology, April 2002, p. 2072-2080, Vol. 184, No. 8
0021-9193/02/$04.00+0 DOI: 10.1128/JB.184.8.2072-2080.2002
Copyright © 2002, American Society for Microbiology. All Rights Reserved.
Inferring Genome Trees by Using a Filter To Eliminate Phylogenetically Discordant Sequences and a Distance Matrix Based on Mean Normalized BLASTP Scores
G. D. Paul Clarke,1 Robert G. Beiko,2 Mark A. Ragan,3,4* and Robert L. Charlebois1,2,4
NeuroGadgets Inc., Ottawa, Ontario K1G 4B5,,1
Department of Biology, University of Ottawa, Ottawa, Ontario K1N 6N5, Canada, and,2
The Institute for Molecular Bioscience, The University of Queensland, Brisbane, Queensland 4072, Australia,3
Program in Evolutionary Biology, Canadian Institute for Advanced Research,4
Received 31 October 2001/
Accepted 14 January 2002
Darwin's paradigm holds that the diversity of present-day organisms has arisen via a process of genetic descent with modification, as on a bifurcating tree. Evidence is accumulating that genes are sometimes transferred not along lineages but rather across lineages. To the extent that this is so, Darwin's paradigm can apply only imperfectly to genomes, potentially complicating or perhaps undermining attempts to reconstruct historical relationships among genomes (i.e., a genome tree). Whether most genes in a genome have arisen via treelike (vertical) descent or by lateral transfer across lineages can be tested if enough complete genome sequences are used. We define a phylogenetically discordant sequence (PDS) as an open reading frame (ORF) that exhibits patterns of similarity relationships statistically distinguishable from those of most other ORFs in the same genome. PDSs represent between 6.0 and 16.8% (mean, 10.8%) of the analyzable ORFs in the genomes of 28 bacteria, eight archaea, and one eukaryote (Saccharomyces cerevisiae). In this study we developed and assessed a distance-based approach, based on mean pairwise sequence similarity, for generating genome trees. Exclusion of PDSs improved bootstrap support for basal nodes but altered few topological features, indicating that there is little systematic bias among PDSs. Many but not all features of the genome tree from which PDSs were excluded are consistent with the 16S rRNA tree.
* Corresponding author. Mailing address: The Institute for Molecular Bioscience, The University of Queensland, Brisbane, Qld 4072, Australia. Phone: 61-7-3365-1160. Fax: 61-7-3365-4388. E-mail: m.ragan{at}imb.uq.edu.au.
Journal of Bacteriology, April 2002, p. 2072-2080, Vol. 184, No. 8
0021-9193/02/$04.00+0 DOI: 10.1128/JB.184.8.2072-2080.2002
Copyright © 2002, American Society for Microbiology. All Rights Reserved.
This article has been cited by other articles:
-
Wellner, A., Gophna, U.
(2008). Neutrality of Foreign Complex Subunits in an Experimental Model of Lateral Gene Transfer. Mol Biol Evol
25: 1835-1840
[Abstract]
[Full Text]
-
Papadopoulos, J. S., Agarwala, R.
(2007). COBALT: constraint-based alignment tool for multiple protein sequences. Bioinformatics
23: 1073-1079
[Abstract]
[Full Text]
-
Fukami-Kobayashi, K., Minezaki, Y., Tateno, Y., Nishikawa, K.
(2007). A Tree of Life Based on Protein Domain Organizations. Mol Biol Evol
24: 1181-1189
[Abstract]
[Full Text]
-
Doolittle, W. F., Bapteste, E.
(2007). Inaugural Article: Pattern pluralism and the Tree of Life hypothesis. Proc. Natl. Acad. Sci. USA
104: 2043-2049
[Abstract]
[Full Text]
-
Lebrun, E., Santini, J. M., Brugna, M., Ducluzeau, A.-L., Ouchane, S., Schoepp-Cothenet, B., Baymann, F., Nitschke, W.
(2006). The Rieske Protein: A Case Study on the Pitfalls of Multiple Sequence Alignments and Phylogenetic Reconstruction. Mol Biol Evol
23: 1180-1191
[Abstract]
[Full Text]
-
Kunin, V., Goldovsky, L., Darzentas, N., Ouzounis, C. A.
(2005). The net of life: Reconstructing the microbial phylogenetic network. Genome Res
15: 954-959
[Abstract]
[Full Text]
-
Gu, X., Huang, W., Xu, D., Zhang, H.
(2005). GeneContent: software for whole-genome phylogenetic analysis. Bioinformatics
21: 1713-1714
[Abstract]
[Full Text]
-
Gophna, U., Doolittle, W. F., Charlebois, R. L.
(2005). Weighted Genome Trees: Refinements and Applications. J. Bacteriol.
187: 1305-1316
[Abstract]
[Full Text]
-
Kunin, V., Ahren, D., Goldovsky, L., Janssen, P., Ouzounis, C. A.
(2005). Measuring genome conservation across taxa: divided strains and united kingdoms. Nucleic Acids Res
33: 616-621
[Abstract]
[Full Text]
-
Yang, S., Doolittle, R. F., Bourne, P. E.
(2005). Phylogeny determined by protein domain content. Proc. Natl. Acad. Sci. USA
102: 373-378
[Abstract]
[Full Text]
-
Charlebois, R. L., Doolittle, W. F.
(2004). Computing prokaryotic gene ubiquity: Rescuing the core from extinction. Genome Res
14: 2469-2477
[Abstract]
[Full Text]
-
Novichkov, P. S., Omelchenko, M. V., Gelfand, M. S., Mironov, A. A., Wolf, Y. I., Koonin, E. V.
(2004). Genome-Wide Molecular Clock and Horizontal Gene Transfer in Bacterial Evolution. J. Bacteriol.
186: 6575-6585
[Abstract]
[Full Text]
-
Gu, X., Zhang, H.
(2004). Genome Phylogenetic Analysis Based on Extended Gene Contents. Mol Biol Evol
21: 1401-1408
[Abstract]
[Full Text]
-
Teeling, H., Lombardot, T., Bauer, M., Ludwig, W., Glockner, F. O.
(2004). Evaluation of the phylogenetic position of the planctomycete 'Rhodopirellula baltica' SH 1 by means of concatenated ribosomal protein sequences, DNA-directed RNA polymerase subunit sequences and whole genome trees. Int. J. Syst. Evol. Microbiol.
54: 791-801
[Abstract]
[Full Text]
-
Lynn, D. J., Singer, G. A. C., Hickey, D. A.
(2002). Synonymous codon usage is subject to selection in thermophilic bacteria. Nucleic Acids Res
30: 4272-4277
[Abstract]
[Full Text]
Copyright © 2002 by the American Society for Microbiology. All rights reserved.