Computing Ka and Ks with a consideration of unequal transitional substitutions.

Advanced Search

Zhang Zhang, Jun Li, Jun Yu

Author Information

Zhang Zhang: Institute of Computing Technology, Chinese Academy of Sciences, Beijing 100080, China. zhangzhang@genomics.org.cn

PMID: 16740169 DOI: 10.1186/1471-2148-6-44

BACKGROUND: Approximate methods for estimating nonsynonymous and synonymous substitution rates (Ka and Ks) among protein-coding sequences have adopted different mutation (substitution) models. In the past two decades, several methods have been proposed but they have not considered unequal transitional substitutions (between the two purines, A and G, or the two pyrimidines, T and C) that become apparent when sequences data to be compared are vast and significantly diverged.
RESULTS: We propose a new method (MYN), a modified version of the Yang-Nielsen algorithm (YN), for evolutionary analysis of protein-coding sequences in general. MYN adopts the Tamura-Nei Model that considers the difference among rates of transitional and transversional substitutions as well as factors in codon frequency bias. We evaluate the performance of MYN by comparing to other methods, especially to YN, and to show that MYN has minimal deviations when parameters vary within normal ranges defined by empirical data.
CONCLUSION: Our comparative results deriving from consistency analysis, computer simulations and authentic datasets, indicate that ignoring unequal transitional rates may lead to serious biases and that MYN performs well in most of the tested cases. These results also suggest that acquisitions of reliable synonymous and nonsynonymous substitution rates primarily depend on less biased estimates of transition/transversion rate ratio.

Mol Biol Evol. 1986 Sep;3(5):418-26 [PMID: 3444411]
J Mol Evol. 1985;22(2):160-74 [PMID: 3934395]
J Mol Evol. 1993 Jan;36(1):96-9 [PMID: 8433381]
Mol Biol Evol. 1993 Mar;10(2):271-81 [PMID: 8487630]
Mol Biol Evol. 1993 May;10(3):512-26 [PMID: 8336541]
Mol Biol Evol. 1994 Sep;11(5):715-24 [PMID: 7968485]
Mol Biol Evol. 1994 Sep;11(5):725-36 [PMID: 7968486]
J Mol Evol. 1995 Feb;40(2):190-226 [PMID: 7699723]
J Mol Evol. 1995 Dec;41(6):1152-9 [PMID: 8587111]
Mol Biol Evol. 1996 Jan;13(1):105-14 [PMID: 8583885]
Nature. 1997 Jan 9;385(6612):151-4 [PMID: 8990116]
Comput Appl Biosci. 1997 Oct;13(5):555-6 [PMID: 9367129]
J Mol Evol. 1998 Apr;46(4):409-18 [PMID: 9541535]
Genome Res. 1998 Dec;8(12):1233-44 [PMID: 9872979]
Nucleic Acids Res. 2005 Jan 1;33(Database issue):D447-53 [PMID: 15608235]
PLoS Biol. 2005 Feb;3(2):e38 [PMID: 15685292]
Mol Biol Evol. 2000 Jan;17(1):32-43 [PMID: 10666704]
Mol Biol Evol. 2000 Aug;17(8):1251-8 [PMID: 10908645]
Genome Res. 2002 Jan;12(1):198-202 [PMID: 11779845]
Mol Biol Evol. 2004 Dec;21(12):2290-8 [PMID: 15329386]
J Mol Evol. 1980 Dec;16(2):111-20 [PMID: 7463489]
Mol Biol Evol. 1985 Mar;2(2):150-74 [PMID: 3916709]

Algorithms

Amino Acid Substitution

Biological Evolution

Computer Simulation

Genetics, Medical

Humans

Models, Genetic

Oryza

Point Mutation

Sequence Alignment

Sequence Analysis, DNA

Comparative Study Journal Article Research Support, Non-U.S. Gov't

OpenLB
Open Library of Bioscience