![]() |
ICASSP 2023 SPGC Challenge:
|
![]() ![]() |
News:
The ADReSS-M Signal Processing Grand Challenge targets a difficult automatic prediction problem of societal and medical relevance, namely, the detection of Alzheimer's Dementia (AD). Dementia is a category of neurodegenerative diseases that entails a long-term and usually gradual decrease of cognitive functioning. While there has been much interest in automated methods for cognitive impairment detection by the signal processing and machine learning communities (de La Fuente, Ritchie and Luz, 2020), most of the proposed approaches have not investigated which speech features can be generalised and transferred across languages for AD prediction, and to the best of our knowledge no work has investigated acoustic features of the speech signal in multilingual AD detection. The ADReSS-M Challenge targets this issue by defining a prediction task whereby participants train their models based on English speech data and assess their models' performance on spoken Greek data. It is expected that the models submitted to the challenge will focus on acoustic features of the speech signal and discover features whose predictive power is preserved across languages, but other approaches can be considered.
In keeping with the objectives of AD prediction evaluation, the ADReSS-M challenge's dataset is statistically matched so as to mitigate common biases often overlooked in evaluations of AD detection methods, including repeated occurrences of speech from the same participant (common in longitudinal datasets), variations in audio quality, and imbalances of gender, age and educational level. By focusing on AD recognition using spontaneous speech, we depart from neuropsychological and clinical evaluation approaches, as spontaneous speech analysis has the potential to enable novel applications for speech technology in longitudinal, unobtrusive monitoring of cognitive health.
This challenge aims to provide a platform for contributions and discussions on applying signal processing and machine learning methods for Multilingual Alzheimer's Dementia Recognition through Spontaneous Speech. We invite the submission of papers describing system ideas (machine learning architectures), novel signal processing features, feature selection, and feature extraction methods in the context of ADReSS-M Challenge.
The organizers have recently conducted an extensive systematic review of the scientific literature on speech and language processing AI methods for detection of cognitive decline in AD (de La Fuente, Ritchie and Luz, 2020) which might offer participants a handy overview of approaches and results in this field.
The ADReSS-M challenge consists of the following tasks:
You may choose to do one or both of these tasks. You will be provided with access to a training set (see relevant section below), and two weeks prior to the paper submission deadline you will be provided with test sets on which you can test your models.
You may send up to five sets of results to us for scoring for each task. You are required to submit all your attempts together, in separate files named: madress_results_task1.txt, madress_results_task2.txt (or one of these, should you choose not to enter both tasks). These must contains the IDs of the test files and your model's predictions. You will be provided with README files in the test sets archives with further details. The test sets will contain README.md files with further details.
As the broader scientific goal of ADReSS-M is to gain insight into the nature of the relationship between speech and cognitive function across different languages, we encourage you to upload a paper describing your approaches and results to a pre-print repository such as arXiv or medRxiv regardless of your ranking in the Challenge, and to share your code through a publicly accessible repository, if possible using a literate programming "notebook" environment such as R Markdown or Jupyter Notebook.
The top 5 ranked teams will be invited to submit a 2-page paper describing their approach and present it at ICASSP-2023 . Accepted papers will be in the ICASSP proceedings. The teams that present their work at ICASSP are also invited to submit a full paper about their work to to the IEEE Open Journal of Signal Processing (OJ-SP).
In order to gain access to the ADReSS-M dataset, please email madress2023@ed.ac.uk with your contact information and affiliation, as well as a general statement on how you plan to use the data, with a specific mention to the ADReSS-M challenge. If you are a student, please ask your supervisor to join as well. This membership will give you full access to the dataset through DementiaBank, where the ADReSS-M dataset will be available.
Note: Please do not use any other data from DementiaBank to train your models (e.g. for augmentation), as the task dataset may contain files from that repository.
MADReSS ├── sample-gr ├── sample-gr-groundtruth.csv ├── train └── training-groundtruth.csv Important: note that the sample and test data are now (as of 6-1-23) in Greek, rather than Spanish. If you already downloaded the Spanish sample data, please replace it by the sample-gr data provided. The test data will be similar to these samples.The groundtruth.csv files contain the participant's cognitive status (Control or ProbableAD, for cognitively normal and patients diagnosed with probable Alzheimer's dementia), and the results of their mini-mental state examination (MMSE) test, which is widely used for cognitive accessment. The train directory contains a training set of 237 picture description recordings in English. The sample contains a small sample of 8 picture descriptions in Greek. The test set will consist of 48 recordings, all in Greek.
The ADReSS-M training dataset has been balanced with respect to age and gender in order to eliminate potential confunding and bias. We employed a propensity score approach to matching (Rosenbaum & Rubin, 1983; Rubin 1973; Ho et al. 2007). The dataset was checked for matching according to scores defined in terms of the probability of an instance being treated as AD given covariates age and gender estimated through logistic regression, and matching instances were selected. All standardized mean differences for the covariates were well below 0.1 and all standardized mean differences for squares and two-way interactions between covariates were well below 0.15, indicating adequate balance for the covariates. The propensity score was estimated using a probit regression of the treatment on the covariates age and gender (probit generated a better balanced than logistic regression).
Task 1: AD classification will be evaluated through the accuracy metric: \[ \displaystyle \operatorname {A} = {\frac { TN + TP }{N} } \] Specificity, sensitivity (\(\rho\)) and \(F_1\) scores for the AD class will also be reported on the ranked list to be published on this web site. These metrics will be computed as follows: \[ \displaystyle \operatorname{Sp} = { \frac { TN }{TN + FP} }, \] and \[ \displaystyle \operatorname {F_1} = { 2 \frac { \pi \times \rho }{\pi + \rho} } \] where \[ \displaystyle \operatorname {\pi} = { \frac { TP }{TP + FP} }, \] \[ \displaystyle \operatorname {\rho} = { \frac { TP }{TP + FN} }, \] N is the number of patients, TP is the number of true positives, TN is the number of true negatives, FP is the number of false positives and FN the number of false negatives. You will also be asked to submit prediction probabilities, so that area under the ROC curve scores can also be published.
Task 2 (MMSE prediction) will be evaluated using the coefficient of determination: \[ \displaystyle \operatorname {R^2} =1 - \frac {\sum_{i=1}^N(\hat{y}_{i} - y_{i})^2} {\sum_{i=1}^N(\hat{y}_{i} - \bar{y})^2} \] and the root mean squared error: \[ \displaystyle \operatorname {RMSE} ={\sqrt {\frac {\sum _{i=1}^{N}({\hat {y}}_{i}-y_{i})^{2}}{N}}} \] where \(\hat{y}\) is the predicted MMSE score, \(y\) is the patient's actual MMSE score, and \(\bar{y}\) is the mean score.
When more than one attempt is submitted for scoring against the test set, all results should be considered (not only the best result overall) and reported in the paper.
The ranking of submissions will be done based on accuracy scores for the classification task (task 1), and on RMSE scores for the MMSE score regression task (task 2). The top 5 models will consist of:A paper describing this Signal Processing Grand Challenge and its dataset more fully, along with a basic set of baseline results will be posted to https://arxiv.org/ shortly, and linked to this web page. Papers submitted to this Challenge using the MADReSS dataset should cite this paper as follows
We encourage you to submit papers describing your approaches to the
tasks set here to https://arxiv.org/,
regardless of your ranking in the Challenge, and to share your code
through open-source repositories. Please note that the intellectual
property (IP) related to your submission is not transferred to the
challenge organizers, i.e., if code is shared/submitted, the
participants remain the owners of their code. When the code is made
publicly available, an appropriate license should be added.
Important Dates
Please format your paper following the ICASSP 2023 guidelines, except for the page limit, which for the SPGC is 2 pages. Further instructions will be given here in due time.
Papers submitted to this Challenge should refer to the ADReSS-M Challenge descrition paper (see reference above for citation).
![]() |
![]() |
![]() |
![]() |
![]() |
![]() |