Journal of University of Science and Technology of China ›› 2020, Vol. 50 ›› Issue (10): 1291-1302.DOI: 10.3969/j.issn.0253-2778.2020.10.002

• Research Article • Previous Articles     Next Articles

Sampling multivariate count variables with prespecified Pearson correlation using marginal regular vine copulas

  

  1. Department of Statistics and Finance, School of Management, University of Science and Technology of China, Hefei  230026, China
  • Received:2020-08-17 Revised:2020-10-10 Online:2020-10-31 Published:2020-12-07
  • About author:Yuan Zhenfei: PhD. Research field: Probability and statistics. E-mail: zfyuan@mail.ustc.edu.cn
    Hu Taizhong: Corresponding author, PhD/professor. Research field: Probability and statistics. E-mail: thu@mail.ustc.edu.cn

Abstract:

The problem of sampling multivariate count variables has practical significance. Ref.[1]proposed an algorithm for sampling multivariate count random variables based on C-vine copulas, by which the parameters 

ρi,j|D

 of edge 

ei,j|D

 of the C-vine structure are estimated by optimizing the difference between the sample partial correlation 

σ︿i,j|D

 and the partial correlation 

σi,j|D

 calculated from the prespecified correlation matrix by the Pearson recurrence formula, where 

D

 is a conditioning node set. We introduce the concept of marginal regular vine copula, which leads to directly optimizing the difference between the sample correlation 

σ︿ij

 and the targeted correlation 

σij

 for pairs of variables. Three simulation studies illustrate that the new sampling method generates more accurate results than the C-vine sampling method in Ref.[1]and the Naive sampling method in Ref.[3]. The sampling algorithm routines are implemented in Python as package countvar in PyPi.

Key words: C-vine copula, marginal regular vine copula, multivariate count random variable, naive sampling method, regular vine, sampling

CLC Number: