加剧 发表于 2025-4-1 05:06:44

http://reply.papertrans.cn/63/6221/622081/622081_61.png

Little 发表于 2025-4-1 09:31:58

http://reply.papertrans.cn/63/6221/622081/622081_62.png

Acetaminophen 发表于 2025-4-1 13:14:19

,Pre-training Techniques for Improving Text-to-Speech Synthesis by Automatic Speech Recognition Baseterial. In this paper, we propose a pre-training technique framework to improve the performance of low-resource speech synthesis. The idea is to extend the training material of TTS model by using ASR based data augmentation method. Specifically, we first build a frame-wise phoneme classification net

ureter 发表于 2025-4-1 16:01:38

Interplay Between Prosody and Syntax-Semantics: Evidence from the Prosodic Features of Mandarin Tagntences. The statement parts in the tag questions exhibited focal characteristics similar to Mandarin general questions, while the tag “. showed the characteristics of post-focal compression. These findings were at odds with our hypothesis. Results of the current study suggested that the focal posit

支架 发表于 2025-4-1 21:03:21

,Improving Fine-Grained Emotion Control and Transfer with Gated Emotion Representations in Speech Syrom a jointly trained emotion strength predictor, our proposed method also allows to manually assign and control the fine-grained emotion strengths during inference. In experiment part, the proposed method is evaluated in both non-transferred emotional speech synthesis and cross-speaker transferred
页: 1 2 3 4 5 6 [7]
查看完整版本: Titlebook: Man-Machine Speech Communication; 17th National Confer Ling Zhenhua,Gao Jianqing,Jia Jia Conference proceedings 2023 The Editor(s) (if appl