listening tests of opus at googlelistening tests of opus at google fall 2011 jan skoglund . 2 ......

10
Listening tests of Opus at Google Fall 2011 Jan Skoglund

Upload: others

Post on 30-Mar-2020

3 views

Category:

Documents


0 download

TRANSCRIPT

Page 1: Listening tests of Opus at GoogleListening tests of Opus at Google Fall 2011 Jan Skoglund . 2 ... Test 1 – Narrowband coding of Mandarin speech ... • 19 listeners after post-screening

1

Listening tests of Opus at Google Fall 2011 Jan Skoglund

Page 2: Listening tests of Opus at GoogleListening tests of Opus at Google Fall 2011 Jan Skoglund . 2 ... Test 1 – Narrowband coding of Mandarin speech ... • 19 listeners after post-screening

2 2

Introduction

• Four MUSHRA-type tests performed in Aug-Sep 2011 at Google • Two tests of coding Mandarin speech • Two tests of transcoding English speech • Both trained and untrained English-speaking listeners • Only untrained Mandarin-speaking listeners • All tests presented on Windows PC with headphones

Page 3: Listening tests of Opus at GoogleListening tests of Opus at Google Fall 2011 Jan Skoglund . 2 ... Test 1 – Narrowband coding of Mandarin speech ... • 19 listeners after post-screening

3 3

Test 1 – Narrowband coding of Mandarin speech •  4 different male and 4 different female speakers

–  2 male and 2 female speakers from ITU-T P.501 –  2 male and 2 female speakers recorded at Google

• Reference files sampled at 48 kHz in low background noise •  2 anchors

–  Reference file lowpass-filtered at 3.5 kHz –  Reference file resampled at 8 kHz, with MNRU at 15 dB SNR

•  21 listeners after post-screening –  No listeners rejected

•  3 narrowband codecs –  Opus NB at 11 kbps, variable bit rate –  Speex NB at 11 kbps, variable bit rate –  iLBC at 15.2 kbps, constant bit rate

Page 4: Listening tests of Opus at GoogleListening tests of Opus at Google Fall 2011 Jan Skoglund . 2 ... Test 1 – Narrowband coding of Mandarin speech ... • 19 listeners after post-screening

4 4

Overall result – Narrowband Mandarin speech

99.8  

76.3  

22.8  

76.8   77.9  

63.6  

0.0  

10.0  

20.0  

30.0  

40.0  

50.0  

60.0  

70.0  

80.0  

90.0  

100.0  

•  Opus at 11 kbps is comparable to iLBC at 15 kbps

•  Opus at 11 kbps is better than Speex at 11 kbps

Page 5: Listening tests of Opus at GoogleListening tests of Opus at Google Fall 2011 Jan Skoglund . 2 ... Test 1 – Narrowband coding of Mandarin speech ... • 19 listeners after post-screening

5 5

Test 2 – Wideband and fullband coding of Mandarin speech •  4 different male and 4 different female speakers

–  2 male and 2 female speakers from ITU-T P.501 –  2 male and 2 female speakers recorded at Google

• Reference files sampled at 48 kHz in low background noise •  2 anchors: lowpass-filtered at 3.5 kHz and 7.0 kHz •  19 listeners after post-screening

–  Rejected 3 listeners having score correlation with the total average lower than 0.8

•  3 wideband codecs –  Opus WB at 19.85 kbps, variable bit rate –  Speex WB at 23.8 kbps, constant bit rate –  G.722.1 at 24 kbps, constant bit rate

•  2 fullband codecs –  Opus FB at 32 kbps, constant bit rate –  G.719 at 32 kbps, constant bit rate

Page 6: Listening tests of Opus at GoogleListening tests of Opus at Google Fall 2011 Jan Skoglund . 2 ... Test 1 – Narrowband coding of Mandarin speech ... • 19 listeners after post-screening

6 6

Overall result – Wideband and fullband Mandarin speech

•  Opus at 32 kbps is better than G.719 at 32 kbps

•  Opus at 20 kbps is better than Speex and G.722.1 at 24 kbps

99.0  

54.6  

79.5   81.6  

53.6  

72.6  

98.1  93.4  

0.0  

10.0  

20.0  

30.0  

40.0  

50.0  

60.0  

70.0  

80.0  

90.0  

100.0  

Page 7: Listening tests of Opus at GoogleListening tests of Opus at Google Fall 2011 Jan Skoglund . 2 ... Test 1 – Narrowband coding of Mandarin speech ... • 19 listeners after post-screening

7 7

Test 3 – Narrowband transcoding of English speech •  4 different male and 4 different female speakers

–  2 male and 2 female speakers from ITU-T P.501 –  2 male and 2 female speakers from McGill database

• Reference files sampled at 48 kHz in low background noise •  2 anchors

–  Reference file lowpass-filtered at 3.5 kHz –  Reference file resampled at 8 kHz, with MNRU at 15 dB SNR

•  19 listeners after post-screening –  No listeners rejected

•  5 narrowband transcoding scenarios –  G.711 at 64 kbps -> Opus NB at 12.2 kbps, variable bit rate –  G.711 at 64 kbps -> AMR NB at 12.2 kbps, constant bit rate –  AMR NB at 12.2 kbps -> G.711 at 64 kbps -> Opus NB at 12.2 kbps –  Opus NB at 12.2 kbps -> G.711 at 64 kbps -> AMR NB at 12.2 kbps –  AMR NB at 12.2 kbps -> G.711 at 64 kbps -> AMR NB at 12.2 kbps

Page 8: Listening tests of Opus at GoogleListening tests of Opus at Google Fall 2011 Jan Skoglund . 2 ... Test 1 – Narrowband coding of Mandarin speech ... • 19 listeners after post-screening

8 8

Overall result – Narrowband transcoding

•  Opus NB pre-coded with G.711 is comparable to AMR NB pre-coded with G.711

•  Opus NB transcoded to AMR NB via G.711 is better than AMR NB tandem-coded via G.711

99.5  

63.5  

14.9  

54.5  51.1  

54.1  50.9  

47.8  

0.0  

10.0  

20.0  

30.0  

40.0  

50.0  

60.0  

70.0  

80.0  

90.0  

100.0  

Page 9: Listening tests of Opus at GoogleListening tests of Opus at Google Fall 2011 Jan Skoglund . 2 ... Test 1 – Narrowband coding of Mandarin speech ... • 19 listeners after post-screening

9 9

Test 4 – Wideband transcoding of English speech •  4 different male and 4 different female speakers

–  2 male and 2 female speakers from ITU-T P.501 –  2 male and 2 female speakers from McGill database

• Reference files sampled at 48 kHz in low background noise •  2 anchors: lowpass-filtered at 3.5 kHz and 7 kHz •  18 listeners after post-screening

–  No listeners rejected •  4 wideband single coding and transcoding scenarios

–  Opus WB at 19.85 kbps, variable bit rate –  AMR WB at 19.85 kbps, constant bit rate –  AMR WB at 19.85 kbps -> Opus WB at 19.85 kbps –  Opus WB at 19.85 kbps -> AMR WB at 19.85 kbps

Page 10: Listening tests of Opus at GoogleListening tests of Opus at Google Fall 2011 Jan Skoglund . 2 ... Test 1 – Narrowband coding of Mandarin speech ... • 19 listeners after post-screening

10 10

Overall result – Wideband transcoding

•  Single-coded Opus WB is better than single-coded AMR WB

•  Single-coded AMR WB is slightly better than transcoding AMR WB -> Opus WB and Opus WB -> AMR WB (statistically significant)

99.4  

37.0  

74.2  78.4  

64.0   65.3  62.8  

0.0  

10.0  

20.0  

30.0  

40.0  

50.0  

60.0  

70.0  

80.0  

90.0  

100.0