next up previous
Next: Frequency to eigendomain transform Up: Speech Enhancement Summaries Previous: Bayes optimal decision rule

Ephraim and Malah suppression rule [3]

This involves deriving the MMSE STSA estimator using a complex Gaussian model of the a priori probability distribution of speech and noise Fourier expansion coefficients. If $y[n]=x[n]+b[n]$ and $X(k)=A_k exp(j\alpha_k)$, then the MMSE estimator of $A_k$ is

\begin{displaymath}
\begin{array}{lll}
\hat{A}_k & = & \mathcal{E}[A_k\vert Y_...
...\alpha_k) P(a_k, \alpha_k) d \alpha_k d a_k }\\
\end{array}
\end{displaymath}

With the assumption of Fourier coefficients having a Gaussian distribution, the polar form of the coefficients have the following marginal distribution

\begin{displaymath}
P(a_k)=\left\{
\begin{array}{ll}
\frac{2a_k}{\lambda_x(k...
...0,\infty)\\
0 & \textrm{ otherwise }
\end{array}
\right.
\end{displaymath}

and

\begin{displaymath}
p(\alpha_k)=\left\{
\begin{array}{ll}
\frac{1}{2\pi} & \t...
...[-\pi,\pi)\\
0 & \textrm{ otherwise }
\end{array}
\right.
\end{displaymath}

The prior pdf is

\begin{displaymath}
P(Y_k\vert a_k,\alpha_k) = \frac{1}{\pi \lambda_b(k)} exp\l...
...{1}{\lambda_b(k)} \vert Y_k-a_k e^{j\alpha_k}\vert^2 \right\}
\end{displaymath}

The joint pdf is

\begin{displaymath}
P(a_k,\alpha_k)=\frac{a_k}{\pi \lambda_x(k)} exp(-\frac{a_k^2}{\lambda_x(k)})
\end{displaymath}

The posterior density can be worked out to be

\begin{displaymath}
P(a_k\vert Y_k)=\frac{a_k}{\sigma_k^2} exp\left(-\frac{a_k^...
... \sigma_k^2}\right) I_0\left(\frac{a_ks_k}{\sigma_k^2}\right)
\end{displaymath}

where

\begin{displaymath}
\begin{array}{lll}
\frac{1}{\lambda(k)} & = & \frac{1}{\la...
...k & = & \frac{\epsilon_k}{1+\epsilon_k} \gamma_k
\end{array}
\end{displaymath}

The authors use the first moment of the posterior distribution giving

\begin{displaymath}
\hat{A}_k=\lambda(k)^{1/2}\Gamma(1.5)\Phi(-0.5,1;-v_k)
\end{displaymath}

They also extend the amplitude estimator under signal presence uncertainty (see for example Maximum Likelihood estimator) but this is beyond the scope of this summary.


next up previous
Next: Frequency to eigendomain transform Up: Speech Enhancement Summaries Previous: Bayes optimal decision rule
Vinesh Bhunjun 2004-09-17