中国科学技术大学学报 ›› 2020, Vol. 50 ›› Issue (10): 1291-1302.DOI: 10.3969/j.issn.0253-2778.2020.10.002

• 科研论文 • 上一篇    下一篇

基于边际正则藤copulas对具有既定皮尔逊相关系数的多元离散随机变量的抽样算法

  

  1. 中国科学技术大学管理学院统计金融系,安徽合肥 230026
  • 收稿日期:2020-08-17 修回日期:2020-10-10 出版日期:2020-10-31 发布日期:2020-12-07

Sampling multivariate count variables with prespecified Pearson correlation using marginal regular vine copulas

  1. Department of Statistics and Finance, School of Management, University of Science and Technology of China, Hefei  230026, China
  • Received:2020-08-17 Revised:2020-10-10 Online:2020-10-31 Published:2020-12-07
  • About author:Yuan Zhenfei: PhD. Research field: Probability and statistics. E-mail: zfyuan@mail.ustc.edu.cn
    Hu Taizhong: Corresponding author, PhD/professor. Research field: Probability and statistics. E-mail: thu@mail.ustc.edu.cn

摘要: 基于多元离散随机变量的抽样问题在实践中的应用价值,Erhardt和Czado 提出了基于C藤Copulas的多元离散随机变量的抽样算法,其优化参数为C藤的边参数,目标函数为给定的皮尔逊偏相关系数与样本偏相关系数的距离. 本文引入了边际正则藤Copulas的概念,进而直接以随机变量对的样本相关系数与给定的皮尔逊相关系数σij之间的距离为目标函数进行优化. 三组模拟实验结果分别与文献[1]提出的基于C藤的抽样算法,文献[3]中使用的Naive基准抽样算法相比,基于边际正则藤Copula的抽样算法具有相对较高的精确性.本文中所使用的抽样算法通过Python语言实现并打包命名为countvar上传至PyPi.

关键词: C藤Copula, 边际正则藤Copula, 多远离散随机变量, Naive抽样算法, 正则藤, 抽样

Abstract:

The problem of sampling multivariate count variables has practical significance. Ref.[1]proposed an algorithm for sampling multivariate count random variables based on C-vine copulas, by which the parameters 

ρi,j|D

 of edge 

ei,j|D

 of the C-vine structure are estimated by optimizing the difference between the sample partial correlation 

σ︿i,j|D

 and the partial correlation 

σi,j|D

 calculated from the prespecified correlation matrix by the Pearson recurrence formula, where 

D

 is a conditioning node set. We introduce the concept of marginal regular vine copula, which leads to directly optimizing the difference between the sample correlation 

σ︿ij

 and the targeted correlation 

σij

 for pairs of variables. Three simulation studies illustrate that the new sampling method generates more accurate results than the C-vine sampling method in Ref.[1]and the Naive sampling method in Ref.[3]. The sampling algorithm routines are implemented in Python as package countvar in PyPi.

Key words: C-vine copula, marginal regular vine copula, multivariate count random variable, naive sampling method, regular vine, sampling

中图分类号: