单纯 发表于 2025-3-25 05:48:58
The COST-277 European Action: An OverviewThis paper summarizes the rationale for proposing the COST-277 “nonlinear speech processing” action, and the work done during these last four years. In addition, future perspectives are described.主讲人 发表于 2025-3-25 10:41:12
Neuro-fuzzy Logic in Signal Processing for Communications: From Bits to Protocolser in order to face complicate problems of non-Gaussian noise, to practical and robust implementations of these systems and up to higher layers in the communication chain, which are engaged in the protocol design. The ability for modeling uncertainty with a reasonable trade-off between complexity an紧张过度 发表于 2025-3-25 15:11:33
Connected Operators for Signal and Image Processing large number of new multimedia services. Traditionally, digital images were represented as rectangular arrays of pixels and digital video was seen as a continuous flow of digital images. New multimedia applications and services imply a representation that is closer to the real world or, at least, t感情脆弱 发表于 2025-3-25 16:46:16
Exploiting High-Level Information Provided by ALISP in Speaker Recognitionstral features. Recently, various works have demonstrated that high-level features convey more speaker information and can be added to the low-level features in order to increase the robustness of the system. This paper describes a text-independent speaker recognition system exploiting high-level inRepetitions 发表于 2025-3-25 20:04:42
MLP Internal Representation as Discriminative Features for Improved Speaker RecognitionASR) the projection provided by the pre-squashed outputs from a one hidden layer multi-layer perceptron (MLP) trained to recognise speech sub-units (phonemes) has previously been shown to significantly increase ASR performance. An analogous approach cannot be applied directly to speaker recognitionBUCK 发表于 2025-3-26 01:23:27
http://reply.papertrans.cn/67/6674/667315/667315_26.pngmodifier 发表于 2025-3-26 05:26:14
Parameter Optimization in a Text-Dependent Cryptographic-Speech-Key Generation Taskelection of the number of dimensions with the best performance for each of the phonemes. First, the Mel frequency cepstral coefficients, (first and second derivatives) of the speech signal are calculated. Then, an Automatic Speech Recogniser, which models are previously trained, is used to detect th评论者 发表于 2025-3-26 08:34:16
http://reply.papertrans.cn/67/6674/667315/667315_28.png性别 发表于 2025-3-26 16:41:28
http://reply.papertrans.cn/67/6674/667315/667315_29.png狗舍 发表于 2025-3-26 17:51:22
F0 and Intensity Distributions of Marsec Speakers: Types of Speaker Prosody. In this research, an attempt is made in two analyses to characterize some prosodic aspects of individual differences within the speaker community. For this, the statistical distributions of F0 and intensity parameters were examined. It was found in the first analysis (34 male speakers, nine speech