<?xml-model href='http://www.tei-c.org/release/xml/tei/custom/schema/relaxng/tei_all.rng' schematypens='http://relaxng.org/ns/structure/1.0'?><TEI xmlns="http://www.tei-c.org/ns/1.0">
	<teiHeader>
		<fileDesc>
			<titleStmt><title level='a'>Analyzing Unconstrained Reading Patterns of Digital Documents Using Eye Tracking</title></titleStmt>
			<publicationStmt>
				<publisher></publisher>
				<date>09/01/2021</date>
			</publicationStmt>
			<sourceDesc>
				<bibl> 
					<idno type="par_id">10313300</idno>
					<idno type="doi">10.1109/JCDL52503.2021.00036</idno>
					<title level='j'>2021 ACM/IEEE Joint Conference on Digital Libraries (JCDL)</title>
<idno></idno>
<biblScope unit="volume"></biblScope>
<biblScope unit="issue"></biblScope>					

					<author>Bhanuka Mahanama</author><author>Gavindya Jayawardena</author><author>Sampath Jayarathna</author>
				</bibl>
			</sourceDesc>
		</fileDesc>
		<profileDesc>
			<abstract><ab><![CDATA[Researchers read scientific literature to keep currentin the field and find state-of-the-art solutions to various scientificproblems. Prior work suggests that reading patterns may varywith the researcher’s domain expertise and on the content ofthe digital document. In this work, we present a pilot study ofeye-tracking measures during a reading task with the options forzooming and panning of the reading material. The main goal isto analyze unconstrained reading patterns of digital documentsusing eye movement fixations and dwell time on various sectionsof a digital document. Our results indicate that participantsmostly focused on methodology and results sections, which isconsistent with the prior work with constrained reading patterns.]]></ab></abstract>
		</profileDesc>
	</teiHeader>
	<text><body xmlns="http://www.tei-c.org/ns/1.0" xmlns:xsi="http://www.w3.org/2001/XMLSchema-instance" xmlns:xlink="http://www.w3.org/1999/xlink">
<div xmlns="http://www.tei-c.org/ns/1.0"><head>I. INTRODUCTION</head><p>The pattern of reading digital documents varies from person to person. When reading digital documents one may zoom and pan into different sections during focused reading tasks. While this behavior seems stochastic, prior work <ref type="bibr">[1]</ref> suggests that individuals with similar expertise in reading digital documents may exhibit common reading patterns. Jayawardena et al. <ref type="bibr">[2]</ref>, for instance, used eye tracking to analyze the reading patterns of novice researchers while reading scientific articles. In this study, all participants spent the longest time reading the methodology section, with a comparatively low cognitive load. Their experimental setup, however, did not allow participants to zoom or pan into the document being read. Therefore, the natural behaviour of reading a digital document has not been captured.</p><p>In this paper, we examine the reading patterns of novice readers allowing participants to read scientific literature in an unconstrained manner; i.e., with the freedom to zoom and pan into the document.</p></div>
<div xmlns="http://www.tei-c.org/ns/1.0"><head>II. METHODOLOGY</head><p>For each participant, we conduct an eye movement analysis to obtain, 1) eye movement fixation count, and 2) dwell time, within various predefined areas of interest (AOI).</p></div>
<div xmlns="http://www.tei-c.org/ns/1.0"><head>A. Data Collection</head><p>We recruited three doctoral students as novice researchers (1F, 2M) in the field of Computer Science. All participants were in their second or third year in the respective doctoral programs. They were aged 20-30 years, with normal or corrected-tonormal vision. We verified the vision of the participants by conducting a simple visual acuity test. Participants verbally confirmed their familiarity of reading scientific literature.</p></div>
<div xmlns="http://www.tei-c.org/ns/1.0"><head>B. Reading Task</head><p>Participants were asked to read a preselected two-page, peerreviewed publication. Simultaneously, their eye movements were recorded using a GazePoint GP3 desktop eye-tracker 1 , which has a sampling frequency of 60 Hz and an angular accuracy of 0.5&#176;-1&#176;. Prior to each recording, we used the nine-point calibration, and verified the accuracy of readings through visual inspection.</p><p>Next, we introduced the digital document viewing tool we used in the experiment to the participants. We explained the controls of the digital document viewing tool such as zooming and panning, using a different document than the reading material used in the experiment.</p><p>While the participants were reading the document, we captured the screen content and the gaze information. We used the index of the frame of the screen recording to aggregate the screen recording and the gaze position. Further, we used a single image containing the entire paper as the primary reference for mapping of an AOI.</p></div>
<div xmlns="http://www.tei-c.org/ns/1.0"><head>C. Data Processing</head><p>We started the gaze processing by combining the gaze positions with each frame of the screen recording. Based on the features found in the main image and the content in a given frame, we matched the local features in the screen recording frame with the global features in the main image. We used Scale Invariant Feature Transform (SIFT) <ref type="bibr">[3]</ref> on both images to identify key features. Based on the descriptors and the key points identified by the SIFT, we generated a list of potential matches between the features of the screen recording frame and the paper content. For the generation of matches, we used Brute-force matching with L1 norm.</p><p>Then we sorted the possible key-point matches based on the L1 distance to get the most resembling key-points between screen capture and the document. Using the two most resembling points (matches with lowest L1 distance), we mapped the gaze coordinates from the screen capture frame to the main image. For the mapping, we used linear interpolation to compute the coordinates on the main image. Through the analysis of the behavior of the gaze locations, we computed two selected eye-tracking metrics; fixation count and dwell time.</p></div>
<div xmlns="http://www.tei-c.org/ns/1.0"><head>D. Eye Movements Processing</head><p>For the eye movements analysis, we generated fixation count and dwell time of participants during the reading task from the eye movements recordings. Fixation count indicates the number of times that the eyes fixated on an AOI, whereas the dwell time indicates the total amount of time spent on reading or skimming an AOI.</p><p>We calculated fixations counts and dwell times of the participants using RAEMAP <ref type="bibr">[4]</ref> eye movement analysis pipeline where the eye movements are classified using I-VT algorithm with a threshold of 18&#176; <ref type="bibr">[5]</ref>. We specified six AOIs: (1) title, (2) abstract, (3) introduction, (4) methodology, (5) results, (6) conclusions, and (7) references on the selected scientific article to analyze the eye-movements of participants. Then we used the fixation data and mapped them with the specific locations of the paper. The average fixation count and dwell time (seconds) of participants on AOIs suggests that participants preferred to fixate more on both methodology and results sections. Participants spent about same amount of time reading methodology and results sections compared to the other sections (see Table <ref type="table">I</ref>).</p></div>
<div xmlns="http://www.tei-c.org/ns/1.0"><head>III. CONCLUSIONS</head><p>The primary objective of this study is to develop a methodology to capture reading patterns of novice readers in an unconstrained environment. We formulated a method to generate AOI-based eye-tracking metrics from a stream of gaze data and its corresponding screen capture. By computing eye movement measures within various sections of the document, we were able to obtain results consistent with the prior work <ref type="bibr">[2]</ref> on reading pattern analysis without the option to pan and zoom. This indicates the applicability of this method for unconstrained reading scenarios.</p><p>In the future, we will focus on limiting false-positive key point matching between the on-screen content and the paper content. This is in particular useful when assessing the unconstrained reading patterns with less distinctive features such as images, charts, and tables. Further, we will incorporate additional metrics such as gaze transition and pupillometry from recorded eye-movement data.</p></div></body>
		</text>
</TEI>
