...
首页> 外文期刊>Ear and hearing. >Interactions Between Unsupervised Learning and the Degree of Spectral Mismatch on Short-Term Perceptual Adaptation to Spectrally Shifted Speech
【24h】

Interactions Between Unsupervised Learning and the Degree of Spectral Mismatch on Short-Term Perceptual Adaptation to Spectrally Shifted Speech

机译:无监督学习与频谱转移语音的短期知觉适应的频谱不匹配程度之间的相互作用

获取原文
获取原文并翻译 | 示例
           

摘要

Objectives: Cochlear implant listeners are able to at least partially adapt to the spectral mismatch associated with the implant device and speech processor via daily exposure and/or explicit training. The overall goal of this study was to investigate interactions between short-term unsupervised learning (i.e., passive adaptation) and the degree of spectral mismatch in normal-hearing listeners' adaptation to spectrally shifted vowels.Design: Normal-hearing subjects were tested while listening to acoustic cochlear implant simulations. Unsupervised learning was measured by testing vowel recognition repeatedly over a 5 day period; no feedback or explicit training was provided. In experiment 1, subjects listened to 8-channel, sine-wave vocoded speech. The spectral envelope was compressed to simulate a 16 mm cochlear implant electrode array. The analysis bands were fixed and the compressed spectral envelope was linearly shifted toward the base by 3.6, 6, or 8.3 mm to simulate different insertion depths of the electrode array, resulting in a slight, moderate, or severe spectral shift. In experiment 2, half the subjects were exclusively exposed to a severe shift with 8 or 16 channels (exclusive groups), and half the subjects were exposed to 8-channel severely shifted speech, 16-channel severely shifted speech, and 8-channel moderately shifted speech, alternately presented within each test session (mixed group). The region of stimulation in the cochlea was fixed (16 mm in extent and 15 mm from the apex) and the analysis bands were manipulated to create the spectral shift conditions. To determine whether increased spectral resolution would improve adaptation, subjects were exposed to 8- or 16-channel severely shifted speech.Results: In experiment 1, at the end of the adaptation period, there was no significant difference between 8-channel speech that was spectrally matched and that shifted by 3.6 mm. There was a significant, but less-complete, adaptation to the 6 mm shift and no adaptation to the 8.3 mm shift. In experiment 2, for the mixed exposure group, there was significant adaptation to severely shifted speech with 8 channels and even greater adaptation with 16 channels. For the exclusive exposure group, there was no significant adaptation to severely shifted speech with either 8 or 16 channels.Conclusions: These findings suggest that listeners are able to passively adapt to spectral shifts up to 6 mm. For spectral shifts beyond 6 mm, some passive adaptation was observed with mixed exposure to a smaller spectral shift, even at the expense of some low frequency information. Mixed exposure to the smaller shift may have enhanced listeners' access to spectral envelope details that were not accessible when listening exclusively to severely shifted speech. The results suggest that the range of spectral mismatch that can support passive adaptation may be larger than previously reported. Some amount of passive adaptation may be possible with severely shifted speech byexposing listeners to a relatively small mismatch in conjunction with the severe mismatch.
机译:目标:耳蜗植入听众能够通过每天的暴露和/或明确的训练,至少部分适应与植入设备和语音处理器相关的频谱失配。这项研究的总体目标是调查短期无监督学习(即被动适应)与正常听觉的听众对频谱移位元音的适应程度之间频谱不匹配的程度。设计:正常听觉的受试者在聆听时进行测试进行声学耳蜗植入仿真。通过在5天内反复测试元音识别来测量无监督学习。没有提供反馈或显式培训。在实验1中,受试者听了8声道正弦波声控语音。压缩频谱包络以模拟16 mm的人工耳蜗植入电极阵列。固定分析带,将压缩后的频谱包络线向底部线性移动3.6、6或8.3 mm,以模拟电极阵列的不同插入深度,从而导致轻微,中度或严重的频谱移动。在实验2中,一半的受试者仅暴露于8或16通道的严重移位(不包括组),一半的受试者暴露于8通道的严重移位语音,16通道的严重移位语音和8通道的中等程度语音移位的语音,在每个测试会话(混合组)中交替显示。耳蜗中的刺激区域是固定的(范围为16毫米,距顶点15毫米),并操纵分析带以创建光谱移动条件。为了确定增加的频谱分辨率是否会改善适应性,将受试者暴露于8通道或16通道严重移位的语音中。结果:在实验1中,在自适应周期结束时,8通道语音与8通道语音之间没有显着差异。光谱匹配,并且移动了3.6毫米。对6毫米偏移有明显但不完全的适应,而对8.3毫米偏移没有适应。在实验2中,对于混合曝光组,对于8个通道的严重移位语音具有明显的适应性,而对于16个通道则具有更大的适应性。对于排他性接触人群,并没有明显适应具有8个或16个通道的严重移位语音的结论。结论:这些发现表明,听众能够被动适应最大6 mm的频谱移位。对于超过6毫米的频谱偏移,即使以一些低频信息为代价,也可以观察到一些被动适应性,同时混合了较小的频谱偏移。混合使用较小的偏移可能会增强听众对频谱包络细节的访问权限,而这些频谱包络细节仅在收听严重偏移的语音时无法访问。结果表明,可以支持被动适应的光谱失配范围可能比以前报道的要大。通过使收听者暴露于与严重失配相伴的相对较小的失配中,在语音严重偏移的情况下,可能会有一定数量的被动适应。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号