One noisy speech clip, cleaned with DeepFilterNet3 and then sent through two low-bitrate voice codecs. Listen at each stage, see it in the spectrograms, and check the numbers below.
On this clip, DeepFilterNet3 makes the speech clearly cleaner. The full +6.4 dB SNR gain applies before transmission; through Codec2 and Opus it narrows to about +1 to +2.5 dB as the codec takes over. Denoising runs faster than real time, so it is cheap to add ahead of a live link.
One 5-second example, so treat it as an early signal to test more widely, not a finished benchmark.
Measured in each path against the noisy signal carried the same way.
In each stage, compare Noisy (the input) with DeepFilterNet3 (the denoised version, marked Under test). Clean is the ideal reference.
DeepFilterNet3 applied straight to the noisy capture, before any codec.
Each signal through Codec2 at 3200 bps, a very low bitrate vocoder used on HF/VHF radio that rebuilds speech from parameters.
Each signal through Opus wideband at 12 kbps with 15% packet loss, rebuilt by Opus's neural concealment (FARGAN).
SNR and loudness are measured against the clean reference. The denoised row is highlighted in each path.
| Track | Duration (s) | Loudness (dBFS) | SNR (dB) | Δ SNR | SI-SDR (dB) | Δ SI-SDR |
|---|---|---|---|---|---|---|
| 1Direct denoising | ||||||
| Noisy | 5.00 | -15.4 | -0.3 | baseline | -0.3 | baseline |
| DeepFilterNet3 | 4.97 | -20.9 | 6.1 | ▲ +6.4 | 5.2 | ▲ +5.5 |
| Clean | 5.00 | -18.6 | reference | n/a | reference | n/a |
| 2After Codec2 (3200 bps) | ||||||
| Noisy | 5.00 | -17.7 | -3.3 | baseline | -29.8 | n/a |
| DeepFilterNet3 | 4.96 | -21.0 | -2.0 | ▲ +1.3 | -39.5 | see note * |
| Clean | 5.00 | -20.1 | -2.5 | n/a | -30.6 | n/a |
| 3After Opus (12 kbps, 15% loss) | ||||||
| Noisy | 5.04 | -16.2 | -4.5 | baseline | -29.6 | n/a |
| DeepFilterNet3 | 4.98 | -20.7 | -2.1 | ▲ +2.5 | -52.7 | see note * |
| Clean | 5.04 | -18.9 | -2.9 | n/a | -35.1 | n/a |
Δ is the denoised minus the noisy track in the same path; positive means denoising helped. * SI-SDR is unreliable through a codec, which rebuilds the waveform, so judge the codec stages by SNR and by ear.