找回密码
 To register

QQ登录

只需一步,快速开始

扫一扫,访问微社区

Titlebook: Speech Separation by Humans and Machines; Pierre Divenyi Book 2005 Springer-Verlag US 2005 Information.Neuroscience.cognition.computer sci

[复制链接]
查看: 21023|回复: 59
发表于 2025-3-21 16:50:45 | 显示全部楼层 |阅读模式
书目名称Speech Separation by Humans and Machines
编辑Pierre Divenyi
视频video
概述Provides comprehensive and authoritative discussion of how humans separate speech and the state of the art in approaching these abilities with machines
图书封面Titlebook: Speech Separation by Humans and Machines;  Pierre Divenyi Book 2005 Springer-Verlag US 2005 Information.Neuroscience.cognition.computer sci
描述There is a serious problem in the recognition of sounds. It derives from the fact that they do not usually occur in isolation but in an environment in which a number of sound sources (voices, traffic, footsteps, music on the radio, and so on) are active at the same time. When these sounds arrive at the ear of the listener, the complex pressure waves coming from the separate sources add together to produce a single, more complex pressure wave that is the sum of the individual waves. The problem is how to form separate mental descriptions of the component sounds, despite the fact that the “mixture wave” does not directly reveal the waves that have been summed to form it. The name auditory scene analysis (ASA) refers to the process whereby the auditory systems of humans and other animals are able to solve this mixture problem. The process is believed to be quite general, not specific to speech sounds or any other type of sounds, and to exist in many species other than humans. It seems to involve assigning spectral energy to distinct “auditory objects” and “streams” that serve as the mental representations of distinct sound sources in the environment and the patterns that they make as
出版日期Book 2005
关键词Information; Neuroscience; cognition; computer science; development; multimedia; quality; science; speech pr
版次1
doihttps://doi.org/10.1007/b99695
isbn_softcover978-1-4419-5460-2
isbn_ebook978-0-387-22794-8
copyrightSpringer-Verlag US 2005
The information of publication is updating

书目名称Speech Separation by Humans and Machines影响因子(影响力)




书目名称Speech Separation by Humans and Machines影响因子(影响力)学科排名




书目名称Speech Separation by Humans and Machines网络公开度




书目名称Speech Separation by Humans and Machines网络公开度学科排名




书目名称Speech Separation by Humans and Machines被引频次




书目名称Speech Separation by Humans and Machines被引频次学科排名




书目名称Speech Separation by Humans and Machines年度引用




书目名称Speech Separation by Humans and Machines年度引用学科排名




书目名称Speech Separation by Humans and Machines读者反馈




书目名称Speech Separation by Humans and Machines读者反馈学科排名




单选投票, 共有 1 人参与投票
 

1票 100.00%

Perfect with Aesthetics

 

0票 0.00%

Better Implies Difficulty

 

0票 0.00%

Good and Satisfactory

 

0票 0.00%

Adverse Performance

 

0票 0.00%

Disdainful Garbage

您所在的用户组没有投票权限
发表于 2025-3-22 00:17:56 | 显示全部楼层
th machinesThere is a serious problem in the recognition of sounds. It derives from the fact that they do not usually occur in isolation but in an environment in which a number of sound sources (voices, traffic, footsteps, music on the radio, and so on) are active at the same time. When these sounds
发表于 2025-3-22 03:31:01 | 显示全部楼层
发表于 2025-3-22 08:25:36 | 显示全部楼层
On Ideal Binary Mask As the Computational Goal of Auditory Scene Analysis,fic objectives such as enhancing ASR and speech intelligibility. The resulting evaluation metric has the properties of simplicity and generality, and is easy to apply when the premixing target is available. The goal of the ideal binary mask has led to effective for speech separation algorithms that attempt to explicitly estimate such masks.
发表于 2025-3-22 09:06:27 | 显示全部楼层
发表于 2025-3-22 13:14:30 | 显示全部楼层
Blind Source Separation Using Graphical Models,el the temporal structure of the speech signals. A maximum likelihood approach is used to separate a voice from jazz music given only one mixed channel. In case of two microphones, the problem of separating two voices recorded by two microphones has been tackled. The mixing coefficients, time delays
发表于 2025-3-22 20:48:16 | 显示全部楼层
发表于 2025-3-22 22:22:28 | 显示全部楼层
Automatic Speech Processing by Inference in Generative Models,he basic paradigm explored was to design a simple model for the data we observe in which the key quantities that we would eventually like to compute appear as hidden (latent) variables. By executing probabilistic inference in such models, we automatically estimating the hidden quantities and thus pe
发表于 2025-3-23 01:50:57 | 显示全部楼层
发表于 2025-3-23 06:02:44 | 显示全部楼层
 关于派博传思  派博传思旗下网站  友情链接
派博传思介绍 公司地理位置 论文服务流程 影响因子官网 SITEMAP 大讲堂 北京大学 Oxford Uni. Harvard Uni.
发展历史沿革 期刊点评 投稿经验总结 SCIENCEGARD IMPACTFACTOR 派博系数 清华大学 Yale Uni. Stanford Uni.
|Archiver|手机版|小黑屋| 派博传思国际 ( 京公网安备110108008328) GMT+8, 2025-5-23 01:06
Copyright © 2001-2015 派博传思   京公网安备110108008328 版权所有 All rights reserved
快速回复 返回顶部 返回列表