READ ME File For 'Measuring Listening Effort' Dataset DOI: https://doi.org/10.5258/SOTON/D3777 Date that the file was created: December 2025 ------------------- GENERAL INFORMATION ------------------- Title of Dataset: Author/Principal Investigator Information Name: Xiaoxue Wang ORCID: 0009-0003-8894-2376 Institution: University of Southampton Address: University Road, Southampton, UK, SO17 1BJ Email: xw1u20@soton.ac.uk yuki.s.wang@outlook.com - Date of data collection: From 2023-12 to 2024-02 - Geographic location of data collection: University of Southampton, Southampton, UK - Information about funding sources that supported the collection of the data: Self-fund -------------------------- SHARING/ACCESS INFORMATION -------------------------- Licenses/restrictions placed on the data, or limitations of reuse: N/A Recommended citation for the data: https://doi.org/10.5258/SOTON/D3777 This dataset supports the publication: N/A Links to other publicly accessible locations of the data: N/A Links/relationships to ancillary or related data sets: N/A -------------------- DATA & FILE OVERVIEW -------------------- This dataset contains: - ECG_SingleFiles (folder) Contains data of ECG of each participant - EEG_SingleFiles (folder) Contains data of EEG of each participant - Resp_SingleFiles (filder) Contains data of Respiration of each participant - GSR_All.mat (mat file) - Pupil_data (folder) Contains data of Pupillometry of each participant - Matrix_All.mat (mat file) Contains data of behaviour response for each participant Relationship between files, if important for context: Those data listed below were collected with the same device: Biopac 100C system (hardware), and AcqKnowledge software, with the same sampling rate of 1000Hz - ECG_SingleFiles (folder) - EEG_SingleFiles (folder) - Resp_SingleFiles (filder) - GSR_All.mat (mat file) This data was collected with laptop and Pupil Core glasses (hardware) with Pupil Lab software: - Pupil_data (folder), with the sampling rate of 100Hz This data was collected with laptop and Matlab software: - Matrix_All.mat (mat file) Additional related data collected that was not included in the current data package: - Pilot data which was used to improve experiment design. They were not included in data analysis. If data was derived from another source, list source: N/A If there are there multiple versions of the dataset, list the file updated, when and why update was made: In the uploaded data, EEG, ECG, Respiration, and GSR was extracted from original data into a separate file for each participant. GSR data was further combined into one single dataset for all participants, and each participant occupy one row (instead of one file) in this combined data. -------------------------- METHODOLOGICAL INFORMATION -------------------------- Description of methods used for collection/generation of data: Pupillometry was collected through PupilCore device, at a sampling rate of 100 Hz. ---------------------------------------------------------------------------------------------------------- Methods for processing the data: GSR_All.mat: Instead of having separate data sets for each participant, GSR was combined into one dataset, with participant Matrix_All.mat: All the other data are raw data. ---------------------------------------------------------------------------------------------------------- Software- or Instrument-specific information needed to interpret the data, including software and hardware version numbers: Matlab2024b and Python (integrated in PyCharm) was used to analyse data. Standards and calibration information, if appropriate: N/A Environmental/experimental conditions: Semi anechoic chamber (the floor is normal flat floor instead of specially made for sound insulation) ---------------------------------------------------------------------------------------------------------- Describe any quality-assurance procedures performed on the data: The uploaded data are largely raw data used for later data analysis. There’s no specific quality-assurance procedure performed on the data. The quality assurance were performed before data was collected to make sure the collected data is with good quality. ---------------------------------------------------------------------------------------------------------- People involved with sample collection, processing, analysis and/or submission: Main Researcher: Xiaoxue Wang Supervisor: David Simpson, Stefan Bleeck -------------------------- DATA-SPECIFIC INFORMATION ————————————— ============================================================= - Explanation of Clicks (as a variable called clickIdx in the data file): - Each experiment contains 1155 clicks, which records specific event for each trial. The full experiment contains: 5 (training trials) + 40 (full experiment) trials. Each trial contains 7 clicks, the first 5 clicks record the start of the each words in the stimulus presented, and the last 2 clicks records where the retention start and end. The number of clicks for the full experiment is (5 + 40) * 7 = 1155. The number inside the variable means the sample number (time = sample number / sampling frequency) of the clicks ============================================================= Data collected: GSR (galvanic skin response) ECG (Electrocardiogram) EEG (Electroencephalogram) Respiration Pupil diametre Number of cases/rows: 33 (with empty data in between) ============================================================= Variable list, defining any abbreviations, units of measure, codes or symbols used: GSR (galvanic skin response) In data name “GSR_All.mat”: Contains variable: “Clicks_All”: The index where the click appears In sample “GSR_All_cleaned”: GSR data (unit: microsiemens (µS)) ---------------------------------------------------------------------------------------------------------- ECG (Electrocardiogram) Each subject has two files from two repeated experiments. Each file contains three variables: clickIdx (details see “Explanation of Clicks) data fs (sampling frequency) ---------------------------------------------------------------------------------------------------------- EEG (Electroencephalogram) Each subject has two files from two repeated experiments. Each file contains three variables: clickIdx (details see “Explanation of Clicks) data fs (sampling frequency) ---------------------------------------------------------------------------------------------------------- Respiration Each subject has two files from two repeated experiments. Each file contains three variables: clickIdx (details see “Explanation of Clicks) data fs (sampling frequency) ---------------------------------------------------------------------------------------------------------- Pupil diametre Each subject has four files from two repeated experiments, and from each eye. Each file contains three variables: clickIdx (details see “Explanation of Clicks) data fs (sampling frequency) Missing data codes: Missing data would be empty inside the dataset