研究论文

混合非参数回归的贝叶斯推断

展开
  • 上海大学 理学院, 上海 200444
何幼桦(1960—), 男, 副教授, 博士, 研究方向为概率统计. E-mail: heyouhua@t.shu.edu.cn

收稿日期: 2019-09-20

  网络出版日期: 2021-10-22

基金资助

国家自然科学基金资助项目(11971296)

Bayesian inference for mixture of nonparametric regression models

Expand
  • College of Sciences, Shanghai University, Shanghai 200444, China

Received date: 2019-09-20

  Online published: 2021-10-22

摘要

针对混合非参数回归问题, 给出了一种基于贝叶斯框架的推断方法. 在该方法中对每一个非参数混合成分用一个随机过程的有限维分布族作为先验, 同时分别构造混合比例、随机误差的方差和非参数混合成分的贝叶斯估计, 并通过马尔科夫链蒙特卡洛(Markov chain Monte Carlo, MCMC) 法抽样来进行后验推断. 数值模拟分别从样本量、回归曲线的相对位置和多分类情况 3 个角度进行. 模拟结果表明, 相较于全局期望最大化(global expectation maximalization)算法, 混合非参数回归的贝叶斯推断方法能够有效利用先验信息来提高模型的拟合和预测能力. 最后将混合非参数回归的贝叶斯推断方法应用于蚜虫与受感染烟草植物的实验, 同时解决了数据的聚类与回归拟合问题, 其有效性和适用性得证.

本文引用格式

李道扬, 何幼桦 . 混合非参数回归的贝叶斯推断[J]. 上海大学学报(自然科学版), 2021 , 27(5) : 856 -865 . DOI: 10.12066/j.issn.1007-2861.2189

Abstract

For mixing nonparametric regression models, an inference method is proposed based on the Bayesian framework. In this method, a finite dimensional distribution family of the stochastic process is used as a prior distribution for each nonparametric component, and Bayesian estimators of mixture proportions, each random error's variance, and nonparametric components are constructed respectively. A Markov chain Monte Carlo (MCMC) method is used for posterior inference. The numerical simulations are performed from the perspectives of sample size, relative position of the regression curve, and multiclassification. The results show that, compared with the generalised expectation maximisation (GEM) algorithm, the Bayesian inference method of mixing nonparametric regression can effectively use the prior information to improve the ability of fitting and prediction. Finally, the Bayesian inference method is applied to the experimental data from aphids and infected tobacco plants and solved clustering and regression problems. This also demonstrates the effectiveness and applicability of the method.

参考文献

[1] Goldfeld S M, Quandt R E. A Markov model for switching regression[J]. Journal of Econometrics, 1973, 1(1): 3-15.
[2] Wedel M, Desarbo W S. A mixture likelihood approach for generalized linear models[J]. Journal of Classification, 1995, 12(1): 21-55.
[3] Gaffney S, Smyth P. Trajectory clustering with mixtures of regression models[C]// Knowledge Discovery and Data Mining. 1999: 63-72.
[4] Song W, Yao W, Xing Y. Robust mixture regression model fitting by Laplace distri-bution[J]. Computational Statistics and Data Analysis, 2014, 71: 128-137.
[5] Huang M, Li R, Wang S. Nonparametric mixture of regression models[J]. Journal of the American Statistical Association, 2013, 108(503): 929-941.
[6] Wu Q, Yao W. Mixtures of quantile regressions[J]. Computational Statistics and Data Analysis, 2016, 93: 162-176.
[7] 胡烨. 半参数混合泊松回归模型的估计[D]. 南京: 南京师范大学, 2017.
[7] Hu Y. The estimation of semiparametric Poisson mixture regression model[D]. Nanjing: Nanjing Normal University, 2017.
[8] Fraley C, Raftery A E. How many clusters? Which clustering method? Answers via model-based cluster analysis[J]. The Computer Journal, 1998, 41(8): 578-588.
[9] Xiang S, Yao W. Semiparametric mixtures of nonparametric regressions[J]. Annals of the Institute of Statistical Mathematics, 2018, 70(1): 1-24.
[10] Boiteau G, Singh M, Singh R P, et al. Rate of spread of PVY n by alate Myzus persicae (Sulzer) from infected to healthy plants under laboratory conditions[J]. Potato Research, 1998, 41(4): 335-344.
[11] Grün B, Leisch F. Finite mixtures of generalized linear regression models[C]// Recent Advances in Linear Models and Related Areas. 2008: 205-230.
文章导航

/