統計学輪講 第20回

日時 2019年11月19日(火)
14時55分 ~ 15時45分
場所 経済学研究科棟 3階 第3教室
講演者 Tung Dang (農学生命科学研究科M1)
演題 Stochastic Variational Inference for Bayesian Phylogenetics: A Case of CAT Model
概要

The pattern of molecular evolution varies among gene sites and genes in a genome. By taking into account the complex heterogeneity of evolutionary processes among sites in a genome, Bayesian infinite mixture models of genomic evolution enable robust phylogenetic inference. With large modern data sets, however, the computational burden of Markov chain Monte Carlo sampling techniques becomes prohibitive. Here, we have developed a variational Bayesian procedure to speed up the widely used PhyloBayes MPI program, which deals with the heterogeneity of amino acid profiles. Rather than sampling from the posterior distribution, the procedure approximates the (unknown) posterior distribution using a manageable distribution called the variational distribution. The parameters in the variational distribution are estimated by minimizing Kullback–Leibler divergence. To examine performance, we analyzed three empirical data sets consisting of mitochondrial, plastid-encoded, and nuclear proteins. Our variational method accurately approximated the Bayesian inference of phylogenetic tree, mixture proportions, and the amino acid propensity of each component of the mixture while using orders of magnitude less computational time.

参考文献
Tung Dang and Hirohisa Kishino. “Stochastic variational inference for Bayesian phylogenetics: A case of CAT model.” Molecular biology and evolution 36.4 (2019): 825–833.