中国科学技术大学学报 ›› 2015, Vol. 45 ›› Issue (7): 575-581.DOI: 10.3969/j.issn.0253-2778.2015.07.007

• 科研论文 • 上一篇    

基于声道长度对齐的年龄语音转换

  

  1. 1.中国科学技术大学自动化系,安徽合肥 230027; 2.中国科学院合肥智能机械研究所,安徽合肥 230031; 3.语音及语言信息处理国家工程实验室,安徽合肥 230027
  • 出版日期:2015-07-30 发布日期:2023-05-15
  • 通讯作者: 汪增福,男,博士/教授.
  • 作者简介:李金中,男,1990年生,硕士生. 研究方向:语音信号处理.
  • 基金资助:
    国家自然科学基金(61472393),安徽省自主创新专项基金(13Z02008)资助.

Vocal tract length aligning based mandarin age voice conversion#br#

  1. 1.Dept. of Automation, University of Science & Technology of China, Hefei 230027, China;
    2.Institute of Intelligent Machines, Chinese Academy of Sciences, Hefei 230031, China;
    3.National Engineering Laboratory of Speech and Language Information Processing, Hefei 230027, China
  • Online:2015-07-30 Published:2023-05-15

摘要: 提出一种基于声道长度对齐的年龄语音转换方法.该方法包含频谱转换和基频转换两个方面,前者在频域依据声道因子和弯折函数对已进行基音标注过的每一帧语音的频谱进行弯折转换;后者对基频特征的转换采用线性变换方法.实验结果表明,通过对同一人不同年龄段的语音进行转换合成,由年龄较大语音向年龄较小语音转换时,转换合成得到的语音频谱平均距离得到明显减小,转换效果较好,而从年龄较小语音向年龄较大语音转换时,频谱平均距离减少较小,同时女性年龄语音转换的效果和自然度都好于男性.

关键词: 年龄语音转换, 声道长度对齐, 基音标注, 声道因子, 弯折函数 , 线性变换

Abstract: Vocal tract length aligning was proposed for mandarin age voice conversion which transforms age speech into some required target age speech. In the method, the speech spectrum which has been pitch marked was warped in the frequency domain based on the warping factor and warping function while pitch was converted by linear transformation. The experimental results show that the effect of transforming old age speech into a young one is better than otherwise and that the average spectra distance of the former is markedly reduced.Meanwhile, age voice conversion is better for female voice than for male voice in effectiveness and naturalness.

Key words: age voice conversion, vocal tract length aligning, pitch marker, warping factor, warping function, linear transformation

中图分类号: