中国科学技术大学学报 ›› 2020, Vol. 50 ›› Issue (10): 1303-1314.DOI: 10.3969/j.issn.0253-2778.2020.10.003

• 科研论文 • 上一篇    下一篇

当前状态数据中比例风险模型的一种贝叶斯变量选择方法

  

  1. 中国科学技术大学统计与金融系,安徽合肥 230026
  • 收稿日期:2020-09-12 修回日期:2020-10-20 出版日期:2020-10-31 发布日期:2020-12-07

Bayesian variable selection for proportional hazards model with current status data

  1. Department of Statistics and Finance, University of Science and Technology of China, Hefei 230026,  China
  • Received:2020-09-12 Revised:2020-10-20 Online:2020-10-31 Published:2020-12-07
  • About author:Cui Di: PhD cadidate.Research field:Statistical model. E-mail:cuidi@mail.ustc.edu.cn

    Zhang Weiping:Corresponding author, PhD/professor. Research field:Statistical learning theory.E-mail:zwp@ustc.edu.cn

摘要: 针对当前状态数据中的比例风险模型提出了一种基于期望-最大化的贝叶斯变量选择方法. 该模型能够同时进行参数估计和变量选择,有效地增强了模型的可解释性和预测能力.为了识别风险因素, 首先对表示协变量是否存在的指示变量赋予合适的先验分布,使用单调样条来近似基准累积风险函数;然后 通过使用基于泊松隐变量的两阶段数据扩充技术提出了一种有效的期望-最大化对模型拟合算法;最后通过模拟研究和一个实例分析证明了所提方法的有效性.

关键词: 比例风险模型, 贝叶斯变量选择, 当前状态数据, EM算法, 样条

Abstract: A Bayesian proportional hazards (PH) model is proposed for analyzing current status data based on Expectation-Maximization Variable Selection (EMVS) method. This model can estimate parameters and select variables simultaneously, which efficiently improves  model interpretability and predictive ability. To identify risk factors,  appropriate priors are assigned on the indicator variables that denote the existence of covariates. The baseline cumulative hazard function is approximated via monotone splines. A novel Expectation-Maximization (EM) algorithm is developed for model fitting by using a two-stage data augmentation procedure involving latent Poisson variables. Finally, the performance of proposed method is investigated by simulations and a real data analysis.

Key words: proportional hazards model, Bayesian variable selection, current status data, EM algorithm, spline

中图分类号: