APROCSA dataset

An open dataset of connected speech in aphasia with consensus ratings of auditory-perceptual features

Zoe Ezzes, Sarah M. Schneck, Marianne Casilio, Davida Fromm, Antje S. Mefford, Michael de Riesthal, Stephen M. Wilson

Language Neuroscience Laboratory
Department of Hearing and Speech Sciences
Vanderbilt University Medical Center

Video 1. Audiovisual recording of participant 1738.
1738 transcript
Video 2. Audiovisual recording of participant 1944.
1944 transcript
Video 3. Audiovisual recording of participant 1713.
1713 transcript
Video 4. Audiovisual recording of participant 1554.
1554 transcript
Video 5. Audiovisual recording of participant 1833.
1833 transcript
Video 6. Audiovisual recording of participant 1731.
1731 transcript
Table 1. Demographic, neurological, and behavioral characteristics of the participants
173819441713155418331731
Age727163466748
SexMFFFMM
HandednessRRRRAR
Education (years)141614151418
RaceWBWWWW
Time post onset (months)12015123351852
Stroke etiologyIIIIHI
Lesion extent (cm3)147.251.129.2*17.89.7*218.6
Quick Aphasia Battery
Word comprehension9.3810.0010.0010.0010.008.54
Sentence comprehension9.388.139.589.587.712.71
Word finding7.005.509.008.007.001.50
Grammatical construction7.757.137.505.135.750.75
Speech motor programming5.007.507.507.507.505.00
Repetition7.508.759.177.087.924.58
Reading7.509.179.178.757.920.83
Overall7.727.698.847.967.523.74

M = Male; F = Female; R = Right; A = Ambidextrous; W = White; B = Black; I = Ischemic; H = Hemorrhagic; * = Acute lesion extent.

Table 2. Consensus ratings of APROCSA features for the 6 participants
Feature173819441713155418331731
Anomia132223
Abandoned utterances021121
Empty speech020111
Semantic paraphasias001112
Phonemic paraphasias001001
Neologisms000000
Jargon000000
Perseverations000001
Stereotypies and automatisms000002
Short and simplified utterances010214
Omission of bound morphemes011103
Omission of function words001224
Paragrammatism111111
Pauses between utterances120211
Pauses within utterances232222
Halting and effortful211112
Reduced speech rate231222
Retracing131121
False starts121121
Conduite d’approche101010
Target unclear110001
Meaning unclear110113
Off-topic000001
Expressive aphasia121223
Apraxia of speech211112
Dysarthria100000
Overall communication impairment221223
Sample duration (total; min:sec)39:0756:5036:1558:0346:2274:26
Sample duration (analyzed; min:sec)6:566:025:548:487:207:23

0: Not present; 1: Mild; 2: Moderate; 3: Marked; 4: Severe. See Casilio et al. (2019) for detailed definitions of connected speech features and scores.