no-sensitive to phase distortion, phase signal of noise is implemented when phase is restored [5][6]。 In the frequency domain, equation (5) is expressed as

5。2。 Proposed VAD based on improved    spectral

transforms   (DFT)   of  y(i) , s(i)  and d (i)  , respectively。

Because s(i) and d (i) are independent and Nk is gauss distribution, equation (6) is expressed as (7) in frequency domain。

On the basis of section above, we can firstly apply spectral subtraction for noisy sound of CVR to reducing noise and enhance speech, and then enhanced signal is filtered by a preceding filter, finally cockpit voice is extracted by means of double thresholds VAD。 Figure    2

shows the flow chart of proposed VAD。 The preceding



filter is a high-pass filter, such as10。9375z1 ,   which

can filter low-frequency interference, especially

For a frame of speech signal, we

interference of frequency 50Hz or 60Hz, and advance spectrum of  high  frequency which  is  useful  for cockpit

For a wonderful VAD, two requirements must be taken into considered comprehensively: to detect  more  speech

where n (k ) is  statistical  mean  of  unvoiced speech,

2

Sk   is amplitude of enhanced speech。

However, basic SS can generate much musical noises in residual noises。 Some modified SS are proposed to reduce effect。 Weighting factorand power coefficient 

sections and more unvoiced speech sections。 However, when VAD tries to detect more speech frames, it misjudges silence as speech or otherwise。 The latter is ever worse than the former for accident investigation。 Therefore, two evaluation standards are compared to weighing  quantificationally  the  performance  of   VAD:

are introduced into SS, so equation (8) is modified as

probability  of  correctly  detecting  speech  frame Pcs

 

probability of correctly detecting noise frame Pcn  ,   which are expressed as

Modified SS is degraded  to  basic  SS  when =2 and =1[5]。  Other  modified  SS  is  showed  in    relative

references [6][7]。 Better enhancement performance can be gained by adjusting two parameters suitably, but voice

where  N handand  N handare   relatively   the  overall

distortion becomes severer as the degree of noise reduction is larger。

5。Proposed VAD based on improved spectral subtraction

5。1。Improved spectral subtraction

In this paper, we propose iterative spectral subtraction to formerly reducing noise and enhancing speech。 This method uses basic SS or modified SS for appropriate times。 The former enhanced speech becomes latter input signal, so music noise is seen as input noise to reduce again。

number of hand-labeling speech frames and noise  frames

by hand-labeling, N1 and N0 are relatively number of being detected correctly by VAD。

6。2。 Experiment results

In this paper, a section of speech in car (SNR =8) from standard voice bank Aurora2 and a section of true cockpit sound are used, simulation experiments based on traditional double thresholds VAD only and the proposed VAD are carried out。 Figure 3 and table 2 compare the performance of various methods in different environment。 Due to former spectral subtraction, the SNR increases, the curves of STE and ZCR become smoother and proper probability increases。

上一篇:可重构机床设计英文文献和中文翻译
下一篇:模糊TOPSIS方法对初级破碎机英文文献和中文翻译

新課改下小學语文洧效阅...

安康汉江网讯

LiMn1-xFexPO4正极材料合成及充放电性能研究

我国风险投资的发展现状问题及对策分析

麦秸秆还田和沼液灌溉对...

老年2型糖尿病患者运动疗...

张洁小说《无字》中的女性意识

网络语言“XX体”研究

互联网教育”变革路径研究进展【7972字】

ASP.net+sqlserver企业设备管理系统设计与开发