TY - JOUR
T1 - Sequence labelling and sequence classification with gaze
T2 - Novel uses of eye‐tracking data for Natural Language Processing
AU - Barrett, Maria Jung
AU - Hollenstein, Nora
PY - 2020/11/5
Y1 - 2020/11/5
N2 - Eye‐tracking data from reading provide a structured signal with a fine‐grained temporal resolution which closely follows the sequential structure of the text. It is highly correlated with the cognitive load associated with different stages of human, cognitive text processing. While eye‐tracking data have been extensively studied to understand human cognition, it has only recently been considered for Natural Language Processing (NLP). In this review, we provide a comprehensive overview of how gaze data are being used in data‐driven NLP, in particular for sequence labelling and sequence classification tasks. We argue that eye‐tracking may effectively counter one of the core challenges of machine‐learning‐based NLP: the scarcity of annotated data. We outline the recent advances in gaze‐augmented NLP to discuss how the gaze signal from human readers can be leveraged while also considering the potentials and limitations of this data source.
AB - Eye‐tracking data from reading provide a structured signal with a fine‐grained temporal resolution which closely follows the sequential structure of the text. It is highly correlated with the cognitive load associated with different stages of human, cognitive text processing. While eye‐tracking data have been extensively studied to understand human cognition, it has only recently been considered for Natural Language Processing (NLP). In this review, we provide a comprehensive overview of how gaze data are being used in data‐driven NLP, in particular for sequence labelling and sequence classification tasks. We argue that eye‐tracking may effectively counter one of the core challenges of machine‐learning‐based NLP: the scarcity of annotated data. We outline the recent advances in gaze‐augmented NLP to discuss how the gaze signal from human readers can be leveraged while also considering the potentials and limitations of this data source.
U2 - https://doi.org/10.1111/lnc3.12396
DO - https://doi.org/10.1111/lnc3.12396
M3 - Journal article
SN - 1749-818X
VL - 14
SP - 1
EP - 16
JO - Language and Linguistics Compass
JF - Language and Linguistics Compass
IS - 11
ER -