Skip to main content

The 2nd Clarity Prediction Challenge

info

The 2nd Clarity Prediction Challenge has now finished. For the details of the systems that were submitted and to see the table of results, please visit the Clarity-2023 Workshsop website.

For details of information on forthcoming challenge see here.

To allow the development of better hearing aids, we need ways to evaluate the speech intelligibility of audio signals automatically. We need a prediction model that takes the audio produced by a hearing aid and the listener's characteristics (e.g. audiogram) and estimates the speech intelligibility score that the listener would achieve in a listening test.

Last year we ran the CPC1 Challenge to develop such a model. The challenge was presented at an online workshop and a special session of Interspeech 2022. We are now running the 2nd round of this challenge, which builds on the first by using more complex signals and a larger set of listening test data for training and evaluating the prediction systems.

The outputs of the new challenge will be presented at an ISCA workshop that is being run as a satellite event to Interspeech 2023 in Dublin on 19th August 2023.

This site provides entrants with what they need to know to take part in the challenge.

Short Description

The task involves estimating the intelligibility of speech-in-noise signals that have been processed by hearing aid algorithms and presented to listeners with hearing loss. Each signal contains a short sentence that the listeners were asked to repeat. The system you build needs to be able to predict how many of the words were recognised correctly by the listeners. It is not expected that systems can do this accurately on a per sentence basis, but rather we will rank systems on this basis of how well they perform over a large evaluation set, i.e., which system produces the lowest average estimation error.

The hearing aid signals being assessed vary widely in quality. Examples of good, fair and poor signals are provided below. Your prediction algorithm needs to be able to cope with this variation.

GoodFairPoor

The data

You will be provided with a set of training data which you can use to develop your systems. This data consists of

  • Audio produced by a variety of (simulated) hearing aids for speech-in-noise;
  • The corresponding clean reference signals (the original speech);
  • Characteristics of the listeners (pure tone audiograms, etc);
  • The measured speech intelligibility scores from listening tests, where the listener was asked to say what they heard after listening to the hearing aid processed signal.

For full details of the data see the Data page.

The task

You will be provided with an evaluation set containing

  • Audio produced by a variety of (simulated) hearing aids for speech-in-noise;
  • The audiogram of a listener;
  • The clean reference signal (the original speech).

Your task will be to produce a score (0.0 to 1.0), which should predict the proportion of words in the reference signal that the listener would be able to repeat correctly after listening to the hearing aid processed signal.

We will be considering two types of system: intrusive and non-intrusive. Intrusive systems (also known as double-ended) are those that require a clean speech reference, and non-intrusive systems (also known as single-ended) are those that use the hearing aid output alone.

Intrusive and non-intrusive systems will be separately ranked according to the RMSE between their predictions and the true values.

To help you get started we have provided a baseline system that uses the HASPI metric to predict the speech intelligibility score. Details of this system are available on the Baseline page.

For full details of the task see the rules page.

Registering and submitting

To take part in the challenge you will need to register your team and download the data. Entrants will have until 31st July to complete their submissions. Full instructions for submission are available on the Submission page.