A Versatile Dataset of Mouse and Eye Movements on Search Engine Results Pages

descriptionPublicationkeyboard_double_arrow_right Article , Preprint , Conference object 13 Jul 2025Embargo end date: 01 Jan 2025Publisher:ACMJournal:Proceedings of the 48th International ACM SIGIR Conference on Research and Development in Information Retrieval

Authors: Kayhan Latifzadeh; Jacek Gwizdka; Luis A. Leiva;

doi: 10.1145/3726302.3730325 , 10.48550/arxiv.2507.08003

arXiv: 2507.08003

A Versatile Dataset of Mouse and Eye Movements on Search Engine Results Pages

- Summary
- Subjects
- Metrics

Abstract

We contribute a comprehensive dataset to study user attention and purchasing behavior on Search Engine Result Pages (SERPs). Previous work has relied on mouse movements as a low-cost large-scale behavioral proxy but also has relied on self-reported ground-truth labels, collected at post-task, which can be inaccurate and prone to biases. To address this limitation, we use an eye tracker to construct an objective ground-truth of continuous visual attention. Our dataset comprises 2,776 transactional queries on Google SERPs, collected from 47 participants, and includes: (1) HTML source files, with CSS and images; (2) rendered SERP screenshots; (3) eye movement data; (4) mouse movement data; (5) bounding boxes of direct display and organic advertisements; and (6) scripts for further preprocessing the data. In this paper we provide an overview of the dataset and baseline experiments (classification tasks) that can inspire researchers about the different possibilities for future work.

Related Organizations

The University of Texas at Austin
United States
University of Luxembourg
Luxembourg

Keywords

Human-Computer Interaction, FOS: Computer and information sciences, Computer Vision and Pattern Recognition (cs.CV), Information Retrieval, Computer Vision and Pattern Recognition, Information Retrieval (cs.IR), Human-Computer Interaction (cs.HC)

Impact byBIP!

	selected citations These citations are derived from selected sources. This is an alternative to the "Influence" indicator, which also reflects the overall/total impact of an article in the research community at large, based on the underlying citation network (diachronically).	1
	popularity This indicator reflects the "current" impact/attention (the "hype") of an article in the research community at large, based on the underlying citation network.	Average
	influence This indicator reflects the overall/total impact of an article in the research community at large, based on the underlying citation network (diachronically).	Average
	impulse This indicator reflects the initial momentum of an article directly after its publication, based on the underlying citation network.	Average

Found an issue? Give us feedback

1

Average

Green