上海大学学报(自然科学版) ›› 2021, Vol. 27 ›› Issue (5): 856-865.doi: 10.12066/j.issn.1007-2861.2189

• 研究论文 • 上一篇    下一篇

混合非参数回归的贝叶斯推断

李道扬, 何幼桦()   

  1. 上海大学 理学院, 上海 200444
  • 收稿日期:2019-09-20 出版日期:2021-10-31 发布日期:2021-10-22
  • 通讯作者: 何幼桦 E-mail:heyouhua@t.shu.edu.cn
  • 作者简介:何幼桦(1960—), 男, 副教授, 博士, 研究方向为概率统计. E-mail: heyouhua@t.shu.edu.cn
  • 基金资助:
    国家自然科学基金资助项目(11971296)

Bayesian inference for mixture of nonparametric regression models

LI Daoyang, HE Youhua()   

  1. College of Sciences, Shanghai University, Shanghai 200444, China
  • Received:2019-09-20 Online:2021-10-31 Published:2021-10-22
  • Contact: HE Youhua E-mail:heyouhua@t.shu.edu.cn

摘要:

针对混合非参数回归问题, 给出了一种基于贝叶斯框架的推断方法. 在该方法中对每一个非参数混合成分用一个随机过程的有限维分布族作为先验, 同时分别构造混合比例、随机误差的方差和非参数混合成分的贝叶斯估计, 并通过马尔科夫链蒙特卡洛(Markov chain Monte Carlo, MCMC) 法抽样来进行后验推断. 数值模拟分别从样本量、回归曲线的相对位置和多分类情况 3 个角度进行. 模拟结果表明, 相较于全局期望最大化(global expectation maximalization)算法, 混合非参数回归的贝叶斯推断方法能够有效利用先验信息来提高模型的拟合和预测能力. 最后将混合非参数回归的贝叶斯推断方法应用于蚜虫与受感染烟草植物的实验, 同时解决了数据的聚类与回归拟合问题, 其有效性和适用性得证.

关键词: 混合回归, 非参数回归, 贝叶斯估计, 有限维分布, MCMC 抽样

Abstract:

For mixing nonparametric regression models, an inference method is proposed based on the Bayesian framework. In this method, a finite dimensional distribution family of the stochastic process is used as a prior distribution for each nonparametric component, and Bayesian estimators of mixture proportions, each random error's variance, and nonparametric components are constructed respectively. A Markov chain Monte Carlo (MCMC) method is used for posterior inference. The numerical simulations are performed from the perspectives of sample size, relative position of the regression curve, and multiclassification. The results show that, compared with the generalised expectation maximisation (GEM) algorithm, the Bayesian inference method of mixing nonparametric regression can effectively use the prior information to improve the ability of fitting and prediction. Finally, the Bayesian inference method is applied to the experimental data from aphids and infected tobacco plants and solved clustering and regression problems. This also demonstrates the effectiveness and applicability of the method.

Key words: mixture models, nonparametric regression, Bayesian estimation, finite dimensional distribution, Markov chain Monte Carlo (MCMC) sampling

中图分类号: