The University of Southampton
University of Southampton Institutional Repository

Constructing theoretically informed measures of pause duration in experimentally manipulated writing

Constructing theoretically informed measures of pause duration in experimentally manipulated writing
Constructing theoretically informed measures of pause duration in experimentally manipulated writing

This paper argues that traditional threshold-based approaches to the analysis of pauses in writing fail to capture the complexity of the cognitive processes involved in text production. It proposes that, to capture these processes, pause analysis should focus on the transition times between linearly produced units of text. Following a review of some of the problematic features of traditional pause analysis, the paper is divided into two sections. These are designed to demonstrate: (i) how to isolate relevant transitions within a text and calculate their durations; and (ii) the use of mixture modelling to identify structure within the distributions of pauses at different locations. The paper uses a set of keystroke logs collected from 32 university students writing argumentative texts about current affairs topics to demonstrate these methods. In the first section, it defines how pauses are calculated using a reproducible framework, explains the distinction between linear and non-linear text transitions, and explains how relevant sections of text are identified. It provides Excel scripts for automatically identifying relevant pauses and calculating their duration. The second section applies mixture modelling to linear transitions at sentence, sub sentence, between-word and within-word boundaries for each participant. It concludes that these transitions cannot be characterised by a single distribution of “cognitive” pauses. It proposes, further, that transitions between words should be characterised by a three-component distribution reflecting lexical, supra-lexical and reflective processes, while transitions at other text locations can be modelled by two-component distributions distinguishing between fluent and less fluent or more reflective processing. The paper concludes by recommending that, rather than imposing fixed thresholds to distinguish processes, researchers should instead impose a common set of theoretically informed distributions on the data and estimate how the parameters of these distributions vary for different individuals and under different conditions.

Keystroke analysis, Pause analysis, Writing processes
0922-4777
Hall, Sophie, Marie
07207cf8-85cb-4bea-a4a7-c368fc1a60cc
Baaijen, Veerle
e59bf4fe-848f-4d42-a47f-ed3cf0ecf028
Galbraith, David
c4914b0d-4fd1-4127-91aa-4e8afee72ff1
Hall, Sophie, Marie
07207cf8-85cb-4bea-a4a7-c368fc1a60cc
Baaijen, Veerle
e59bf4fe-848f-4d42-a47f-ed3cf0ecf028
Galbraith, David
c4914b0d-4fd1-4127-91aa-4e8afee72ff1

Hall, Sophie, Marie, Baaijen, Veerle and Galbraith, David (2022) Constructing theoretically informed measures of pause duration in experimentally manipulated writing. Reading and Writing. (doi:10.1007/s11145-022-10284-4).

Record type: Article

Abstract

This paper argues that traditional threshold-based approaches to the analysis of pauses in writing fail to capture the complexity of the cognitive processes involved in text production. It proposes that, to capture these processes, pause analysis should focus on the transition times between linearly produced units of text. Following a review of some of the problematic features of traditional pause analysis, the paper is divided into two sections. These are designed to demonstrate: (i) how to isolate relevant transitions within a text and calculate their durations; and (ii) the use of mixture modelling to identify structure within the distributions of pauses at different locations. The paper uses a set of keystroke logs collected from 32 university students writing argumentative texts about current affairs topics to demonstrate these methods. In the first section, it defines how pauses are calculated using a reproducible framework, explains the distinction between linear and non-linear text transitions, and explains how relevant sections of text are identified. It provides Excel scripts for automatically identifying relevant pauses and calculating their duration. The second section applies mixture modelling to linear transitions at sentence, sub sentence, between-word and within-word boundaries for each participant. It concludes that these transitions cannot be characterised by a single distribution of “cognitive” pauses. It proposes, further, that transitions between words should be characterised by a three-component distribution reflecting lexical, supra-lexical and reflective processes, while transitions at other text locations can be modelled by two-component distributions distinguishing between fluent and less fluent or more reflective processing. The paper concludes by recommending that, rather than imposing fixed thresholds to distinguish processes, researchers should instead impose a common set of theoretically informed distributions on the data and estimate how the parameters of these distributions vary for different individuals and under different conditions.

Text
Hall2022_Article_ConstructingTheoreticallyInfor - Version of Record
Available under License Creative Commons Attribution.
Download (1MB)

More information

Accepted/In Press date: 7 March 2022
e-pub ahead of print date: 18 April 2022
Published date: 18 April 2022
Additional Information: Funding Information: This work was supported through funding from the ESRC South Coast Doctoral Training Partnership. Publisher Copyright: © 2022, The Author(s). Copyright: Copyright 2022 Elsevier B.V., All rights reserved.
Keywords: Keystroke analysis, Pause analysis, Writing processes

Identifiers

Local EPrints ID: 457616
URI: http://eprints.soton.ac.uk/id/eprint/457616
ISSN: 0922-4777
PURE UUID: 3039c568-adb4-4368-b64d-0e381100150c
ORCID for Sophie, Marie Hall: ORCID iD orcid.org/0000-0002-5318-6721
ORCID for David Galbraith: ORCID iD orcid.org/0000-0003-4195-6386

Catalogue record

Date deposited: 14 Jun 2022 16:42
Last modified: 18 Mar 2024 03:21

Export record

Altmetrics

Contributors

Author: Sophie, Marie Hall ORCID iD
Author: Veerle Baaijen
Author: David Galbraith ORCID iD

Download statistics

Downloads from ePrints over the past year. Other digital versions may also be available to download e.g. from the publisher's website.

View more statistics

Atom RSS 1.0 RSS 2.0

Contact ePrints Soton: eprints@soton.ac.uk

ePrints Soton supports OAI 2.0 with a base URL of http://eprints.soton.ac.uk/cgi/oai2

This repository has been built using EPrints software, developed at the University of Southampton, but available to everyone to use.

We use cookies to ensure that we give you the best experience on our website. If you continue without changing your settings, we will assume that you are happy to receive cookies on the University of Southampton website.

×