Top-Down and Bottom-Up Processes in Web Search Navigation
Top-Down and Bottom-Up Processes in Web Search Navigation
Top-Down and Bottom-Up Processes in Web Search Navigation
Create successful ePaper yourself
Turn your PDF publications into a flip-book with our unique Google optimized e-Paper software.
<strong>Top</strong>-<strong>Down</strong> <strong>and</strong> <strong>Bottom</strong>-<strong>Up</strong> <strong>Processes</strong> <strong>in</strong> <strong>Web</strong> <strong>Search</strong> <strong>Navigation</strong><br />
Shu-Chieh Wu (shu-chieh.wu@nasa.gov)<br />
NASA Ames Research Center <strong>and</strong> San Jose State University, MS 262-4<br />
Moffett Field, CA 94035 USA<br />
Abstract<br />
In current theories of web navigation, l<strong>in</strong>k evaluation has<br />
been treated primarily as a bottom-up process <strong>in</strong>volv<strong>in</strong>g<br />
assess<strong>in</strong>g the semantic distance between a search target <strong>and</strong> a<br />
given l<strong>in</strong>k <strong>in</strong> the <strong>in</strong>formation architecture. We <strong>in</strong>vestigated<br />
whether there exists top-down <strong>in</strong>fluence from familiarity with<br />
the search target <strong>and</strong>/or the <strong>in</strong>formation architecture. We<br />
compared search performance on targets that varied <strong>in</strong> the<br />
level of category ambiguity <strong>and</strong> the presence of category<br />
names. We found that categorically unambiguous search<br />
targets resulted <strong>in</strong> fewer categories be<strong>in</strong>g evaluated, fewer<br />
fixations <strong>and</strong> shorter fixation durations, <strong>and</strong> overall fast <strong>in</strong>itial<br />
classification regardless of the presence of category names,<br />
suggest<strong>in</strong>g proactive use of top-down knowledge <strong>in</strong> guid<strong>in</strong>g<br />
search behavior. Our results have implications on design<strong>in</strong>g<br />
<strong>in</strong>formation architectures that support efficient top-down<br />
strategies <strong>in</strong> searches for menu items <strong>and</strong> web l<strong>in</strong>ks.<br />
Keywords: <strong>Web</strong> navigation; web search; eye movements;<br />
eye track<strong>in</strong>g; visual scann<strong>in</strong>g; menu organization.<br />
Introduction<br />
The prevalence <strong>and</strong> improved accuracy of web search<br />
technologies have transformed the process of how users<br />
seek <strong>in</strong>formation. While keyword search has arguably<br />
become the most dom<strong>in</strong>ant method for locat<strong>in</strong>g <strong>in</strong>formation<br />
on the <strong>in</strong>ternet, search through navigation rema<strong>in</strong>s an<br />
important element of web user experience. <strong>Navigation</strong><br />
offers the ability to locate unfamiliar targets <strong>in</strong> the absence<br />
of proper keywords <strong>and</strong> to browse all available <strong>in</strong>stances on<br />
a particular theme. Outside of the web environment,<br />
navigation through multiple levels of menus also rema<strong>in</strong>s<br />
the primary method for locat<strong>in</strong>g <strong>in</strong>formation on personal<br />
digital devices such as digital cameras, personal digital<br />
assistants (PDAs) <strong>and</strong> cellular phones, where implement<strong>in</strong>g<br />
a search function is impractical given the limitation <strong>in</strong><br />
display areas <strong>and</strong> <strong>in</strong>put methods. Different from keyword<br />
search where results are collated <strong>and</strong> returned, search<br />
through navigation requires users to click through a series of<br />
l<strong>in</strong>ks <strong>in</strong> order to navigate themselves toward search goals.<br />
How users at any given moment search among available<br />
l<strong>in</strong>ks <strong>and</strong> choose one that they believe will lead them to their<br />
goals is a question of both theoretical <strong>in</strong>terest <strong>and</strong> practical<br />
implications on the design of efficient <strong>in</strong>formation<br />
architecture.<br />
In this paper, we exam<strong>in</strong>e the process underly<strong>in</strong>g l<strong>in</strong>k<br />
evaluation, a critical element of navigation. The result of<br />
l<strong>in</strong>k evaluation dictates how a user navigates, whether it is<br />
to proceed with a l<strong>in</strong>k or to backtrack from a dead-end page.<br />
Craig S. Miller (cmiller@cs.depaul.edu)<br />
DePaul University, 243 S. Wabash Avenue<br />
Chicago, IL 60657 USA<br />
1848<br />
Theories of web navigation have focused on predict<strong>in</strong>g<br />
among a group of l<strong>in</strong>ks which one will more likely be<br />
selected <strong>in</strong> actual user behavior but with little attention paid<br />
to the evaluative process, per se, of a s<strong>in</strong>gle l<strong>in</strong>k. This paper<br />
aims to address possible top-down <strong>and</strong> bottom-up factors<br />
that <strong>in</strong>fluence l<strong>in</strong>k evaluation.<br />
L<strong>in</strong>k Evaluation<br />
Despite its central role <strong>in</strong> navigation, the process underly<strong>in</strong>g<br />
l<strong>in</strong>k evaluation is not always explicitly def<strong>in</strong>ed <strong>in</strong> current<br />
theories of web navigation. Some models like MESA<br />
(Method for Evaluat<strong>in</strong>g Site Architectures) have l<strong>in</strong>k<br />
evaluation outcomes generated outside the model by<br />
separate procedures <strong>and</strong> fed <strong>in</strong>to the model (Miller &<br />
Rem<strong>in</strong>gton, 2004). Others like CoLiDeS (Comprehensionbased<br />
L<strong>in</strong>ked model of Deliberate <strong>Search</strong>) (Kitajima,<br />
Blackmon, & Polson, 2000) <strong>and</strong> SNIF-ACT (Scent-based<br />
<strong>Navigation</strong> <strong>and</strong> Information Forag<strong>in</strong>g <strong>in</strong> the ACT<br />
architecture) (Fu & Pirolli, 2007) model l<strong>in</strong>k evaluation as<br />
measur<strong>in</strong>g the semantic distance between an encountered<br />
l<strong>in</strong>k <strong>and</strong> a search target. There is no question that l<strong>in</strong>k<br />
evaluation <strong>in</strong>volves some form of semantic comparison. The<br />
question is where the comparison takes place; that is,<br />
whether the comparison really is between the particular l<strong>in</strong>k<br />
be<strong>in</strong>g evaluated <strong>and</strong> the search target, an assumption shared<br />
by both models.<br />
One reason to question such an assumption of l<strong>in</strong>k<br />
evaluation processes is that users often possess a certa<strong>in</strong><br />
amount of knowledge, not only of their search targets (i.e.,<br />
queries) but also of the likely options from the <strong>in</strong>formation<br />
architecture. For example, a shopper who wishes to<br />
purchase an MP3 player from an onl<strong>in</strong>e store would not<br />
necessarily expect to f<strong>in</strong>d the word MP3 on the front page of<br />
the store. Rather, a savvy shopper may expect to f<strong>in</strong>d l<strong>in</strong>ks<br />
like electronics or portable audios. In other words, to<br />
facilitate search, users are likely to take <strong>in</strong>to account what<br />
opportunities are available based on their prior experience<br />
<strong>and</strong> apply that knowledge to rephrase their search targets <strong>in</strong><br />
ways conform<strong>in</strong>g to what they believe to be conventional<br />
contents of the <strong>in</strong>formation architecture.<br />
What is be<strong>in</strong>g suggested here is that l<strong>in</strong>k evaluation likely<br />
<strong>in</strong>volves not only a bottom-up process <strong>in</strong> which the mean<strong>in</strong>g<br />
of an encountered l<strong>in</strong>k is compared to that of the search<br />
target but also a top-down process that precedes evaluation<br />
<strong>in</strong> which a user rephrases the search target <strong>in</strong> languages<br />
closer to those available from the <strong>in</strong>formation architecture.
They differ <strong>in</strong> the level at which a user <strong>in</strong>jects his/her prior<br />
knowledge of both the search goal (i.e., query) <strong>and</strong> the<br />
available l<strong>in</strong>ks <strong>in</strong>to the search process, <strong>and</strong> consequently the<br />
nature of the evaluative process. The difference between our<br />
stipulation of the top-down processes (rephras<strong>in</strong>g a search<br />
target to match available l<strong>in</strong>ks) <strong>and</strong> assess<strong>in</strong>g semantic<br />
distance between a given l<strong>in</strong>k <strong>and</strong> words related to the<br />
search target (as <strong>in</strong> <strong>in</strong>formation forag<strong>in</strong>g) is that the former<br />
could lead to cases where no evaluative process takes place<br />
because the user simply knows what he/she is look<strong>in</strong>g for<br />
<strong>and</strong> where it is. In top-down l<strong>in</strong>k evaluation, the nature of<br />
the evaluative process is simplified from comparison to<br />
recognition. Note that Hornof (2004) <strong>and</strong> Fleetwood <strong>and</strong><br />
Byrne (2006) study processes where the user is search<strong>in</strong>g<br />
for a specific str<strong>in</strong>g. This is ak<strong>in</strong> to what happens <strong>in</strong> a topdown<br />
strategy.<br />
Figure 1 proposes an idealized process that uses two<br />
dist<strong>in</strong>ct strategies for search<strong>in</strong>g a menu item depend<strong>in</strong>g on<br />
the ability to first recall a category phrase that might appear<br />
<strong>in</strong> the desired selection. After acquir<strong>in</strong>g the search target,<br />
the first condition box <strong>in</strong>dicates that a user tries to retrieve a<br />
relevant phrase that likely appears <strong>in</strong> the menu. If the user is<br />
able to retrieve a relevant phrase, it employs a top-down<br />
strategy. That is, it prepares a search for the physical<br />
properties of that phrase. If a relevant phrase cannot be<br />
retrieved, it employs a bottom-up strategy, which requires<br />
an item-by-item calculation of category membership. In this<br />
way, the bottom-up strategy <strong>in</strong>curs a greater time cost when<br />
search<strong>in</strong>g through menu items, not only because it likely<br />
requires more iterations of l<strong>in</strong>k visitation <strong>and</strong> evaluation<br />
before a f<strong>in</strong>al selection can be made but also because the<br />
evaluative process itself takes longer than that <strong>in</strong> the topdown<br />
strategy. Note that for the top-down strategy, this<br />
schematic <strong>in</strong>dicates that a user still confirms category<br />
membership when f<strong>in</strong>d<strong>in</strong>g a physical match, but it is<br />
possible that a confident user may forgo this step if the<br />
<strong>in</strong>itially retrieved phrase is almost certa<strong>in</strong>ly part of the<br />
desired category.<br />
Figure 1: A schematic diagram of how search proceeds under top-down <strong>and</strong> bottom-up processes<br />
1849
The possibility of top-down strategies has at least two<br />
important implications for how users perform at web <strong>and</strong><br />
menu navigation. First, a top-down strategy would<br />
presumably be more efficient when the user can look for the<br />
physical features of a specific target by way of a faster scan<br />
of the menu items. Second, a top-down strategy may cause a<br />
user to overlook highly relevant items if their labels do not<br />
physically match the character sequence that is anticipated<br />
<strong>in</strong> the target label. Given its implications for human<br />
performance, underst<strong>and</strong><strong>in</strong>g when top-down strategies occur<br />
could be helpful for design<strong>in</strong>g <strong>in</strong>formation architectures <strong>and</strong><br />
diagnos<strong>in</strong>g navigation problems when they occur.<br />
Present Research<br />
In the present research, we <strong>in</strong>vestigate whether there is<br />
evidence for top-down process<strong>in</strong>g <strong>in</strong> <strong>in</strong>dividual l<strong>in</strong>k<br />
evaluation. We hypothesize that the dist<strong>in</strong>ction between topdown<br />
<strong>and</strong> bottom-up processes is most likely revealed <strong>in</strong> the<br />
comparison between search for categorically ambiguous <strong>and</strong><br />
categorically unambiguous goals. When the search goal<br />
clearly <strong>in</strong>dicates its own category, users are more likely to<br />
apply their prior knowledge of the <strong>in</strong>formation architecture<br />
to rephrase the search target <strong>and</strong> transform the process of<br />
l<strong>in</strong>k evaluation to recognition. Conversely, when the search<br />
goal is categorically ambiguous, users are more likely to<br />
depend on bottom-up processes <strong>and</strong> compare the search<br />
target aga<strong>in</strong>st each encountered l<strong>in</strong>k. Certa<strong>in</strong>ly categorically<br />
ambiguous targets will lead to a longer search process<br />
where<strong>in</strong> more l<strong>in</strong>ks are evaluated. The critical prediction<br />
here however is on <strong>in</strong>dividual l<strong>in</strong>k evaluation, that it should<br />
take less time with categorically unambiguous than<br />
categorically ambiguous goals. Alternatively, if search is<br />
carried out exclusively through bottom-up processes,<br />
<strong>in</strong>dividual l<strong>in</strong>k evaluation time should be equivalent <strong>in</strong><br />
searches for both categorically ambiguous <strong>and</strong> unambiguous<br />
targets because the nature of the evaluative process rema<strong>in</strong>s<br />
the same.<br />
Naturally, categorically unambiguous goals are more<br />
likely to conta<strong>in</strong> category names as part of their description.<br />
As a result, faster l<strong>in</strong>k evaluation could be equally<br />
attributable to close semantic distance. To control for this<br />
potential confound, we <strong>in</strong>dependently manipulated category<br />
ambiguity <strong>and</strong> the presence of category names. If l<strong>in</strong>k<br />
evaluation is subject to top-down <strong>in</strong>fluence, we would<br />
expect to see fast l<strong>in</strong>k evaluation <strong>in</strong> category-unambiguous<br />
search goals despite the absence of category names.<br />
Experiment<br />
Participants<br />
Ten students recruited from local colleges participated.<br />
They had no experience with the expert database <strong>and</strong> were<br />
naïve to the purpose of the study.<br />
1850<br />
Apparatus<br />
The study was carried out on a Pentium 4 PC runn<strong>in</strong>g<br />
Firefox. Eye movements were monitored us<strong>in</strong>g a headmounted<br />
high-speed eye tracker (Applied Sciences<br />
Laboratory, Model 501) with eye-head <strong>in</strong>tegration function,<br />
sampl<strong>in</strong>g at 120Hz. Gaze positions were then synchronized<br />
with recorded scenes us<strong>in</strong>g GazeTracker software (Eye<br />
Response Technology), which records video at 640x480<br />
pixel resolution <strong>and</strong> samples at 40 frames per second.<br />
Doma<strong>in</strong> for the Information Architecture<br />
The website used <strong>in</strong> the study was generated based on an<br />
“expert database” ma<strong>in</strong>ta<strong>in</strong>ed by the Media Relation<br />
Department of DePaul University, which conta<strong>in</strong>s<br />
descriptions of 970 university faculty members <strong>and</strong> their<br />
respective areas of expertise as a resource for journalists <strong>in</strong><br />
need for a subject-matter expert. The database was<br />
organized <strong>in</strong> levels of categories <strong>and</strong> sub-categories <strong>and</strong><br />
implemented <strong>in</strong> a web-based application. For details of the<br />
categories <strong>and</strong> the process by which they were derived, see<br />
Miller et al. (2007). In the present research, the expert<br />
database was reduced <strong>and</strong> restructured to conta<strong>in</strong> one level<br />
of top categories, one level of various numbers of subcategories,<br />
<strong>and</strong> at the bottom one level of content items (i.e.,<br />
expert descriptions). The result<strong>in</strong>g database had the<br />
follow<strong>in</strong>g 9 top-level categories: Arts <strong>and</strong> Literature,<br />
Bus<strong>in</strong>ess <strong>and</strong> Economics, Education, Law <strong>and</strong> Legal, Health<br />
<strong>and</strong> Medic<strong>in</strong>e, Politics <strong>and</strong> Public Policy, Religion, Society<br />
<strong>and</strong> Culture, <strong>and</strong> Science <strong>and</strong> Technology.<br />
Task <strong>and</strong> Design<br />
Participants were given descriptions of experts <strong>and</strong> asked<br />
to locate them <strong>in</strong> the web application. The descriptions<br />
chosen as search targets varied accord<strong>in</strong>g to two factors:<br />
category ambiguity <strong>and</strong> the presence of category names. We<br />
h<strong>and</strong>-selected the descriptions that represented clear cases<br />
for each of the four factorial conditions. The level of<br />
category ambiguity was determ<strong>in</strong>ed <strong>in</strong> a previous study<br />
(Miller et al., 2007) based on how users assigned the<br />
descriptions to the 9 top-level categories. We used the<br />
follow<strong>in</strong>g calculation to measure the distribution of category<br />
choices for a description:<br />
ambiguity = 1 −<br />
c<br />
∑<br />
i=<br />
1<br />
Here, c equals the number of categories, Si is the number of<br />
choices for a particular category <strong>and</strong> n is the total number of<br />
choices. For a fully unambiguous description, where the<br />
description was consistently assigned to only one category,<br />
the ambiguity measure is 0. For the present study, we used<br />
the ambiguity measure as a guide for select<strong>in</strong>g categorically<br />
ambiguous <strong>and</strong> unambiguous descriptions while also<br />
controll<strong>in</strong>g for their length. In the end, all descriptions<br />
classified as categorically unambiguous had an ambiguity<br />
⎛<br />
⎜<br />
⎝<br />
si<br />
n<br />
⎞<br />
⎟<br />
⎠<br />
2
measure less than .25 <strong>and</strong> all descriptions classified as<br />
categorically ambiguous had a measure greater than .45.<br />
The presence of category names was determ<strong>in</strong>ed by<br />
calculat<strong>in</strong>g the amount of co-occurrence of words between a<br />
particular expert description <strong>and</strong> the category to which most<br />
users assigned the description. Descriptions with no cooccurrence<br />
were assigned to one condition. For the opposite<br />
condition, a description was considered hav<strong>in</strong>g category<br />
names if it conta<strong>in</strong>ed words from the category to which<br />
most users assigned the description.<br />
To illustrate, here are examples of the four types of<br />
descriptions from Science <strong>and</strong> Technology category:<br />
“Use of technology <strong>in</strong>clud<strong>in</strong>g computers,<br />
telecommunications <strong>and</strong> multimedia”<br />
(categorically unambiguous with category names)<br />
“Fiber-optic communications, chaos <strong>and</strong> optical<br />
systems, lasers”<br />
(categorically unambiguous without category names)<br />
“Ethics of new technology, employee privacy <strong>and</strong><br />
technology”<br />
(categorically ambiguous with category names)<br />
“The harmful effects environmental contam<strong>in</strong>ation has<br />
on liv<strong>in</strong>g organisms <strong>and</strong> systems”<br />
(categorically ambiguous without category names)<br />
Note that <strong>in</strong> the case of descriptions that were categorically<br />
ambiguous <strong>and</strong> with category names, the particular category<br />
name conta<strong>in</strong>ed <strong>in</strong> the descriptions always corresponded to<br />
the category to which the descriptions belong. In other<br />
words, there were no mislead<strong>in</strong>g category names <strong>in</strong> the<br />
descriptions chosen.<br />
Four descriptions were selected from each category<br />
(except Religion) to cover the four factorial conditions,<br />
result<strong>in</strong>g <strong>in</strong> a total of 32 search task trials. Their length was<br />
between 46 to 89 characters (~66 characters on average).<br />
The tasks were selected so that average length was<br />
comparable across all four conditions of descriptions,<br />
average ambiguity was comparable with<strong>in</strong> the two<br />
ambiguity classes (with <strong>and</strong> without category names), <strong>and</strong><br />
average word overlap was comparable with<strong>in</strong> the two<br />
overlap classes (categorically ambiguous <strong>and</strong> categorically<br />
unambiguous).<br />
Procedure<br />
In the beg<strong>in</strong>n<strong>in</strong>g of each trial, participants were given the<br />
search target alone on a separate page. They were <strong>in</strong>structed<br />
to read the description through before click<strong>in</strong>g on a<br />
“cont<strong>in</strong>ue” l<strong>in</strong>k which, when pressed, displayed the top level<br />
categories <strong>and</strong> started the timer. Each task scenario was<br />
term<strong>in</strong>ated upon either f<strong>in</strong>d<strong>in</strong>g the target expert or after two<br />
m<strong>in</strong>utes had elapsed. Then the next trial was presented.<br />
The web application recorded <strong>and</strong> time-stamped every<br />
selection performed by the participant. The categories <strong>and</strong><br />
content items were consistently arranged <strong>in</strong> the same order<br />
on the page throughout the tasks. The order of the tasks was<br />
1851<br />
r<strong>and</strong>omized for each participant. Eye movements were<br />
recorded along with displayed web contents.<br />
Eye Movement Data Process<strong>in</strong>g<br />
The analysis of eye movements focused on identify<strong>in</strong>g<br />
fixations on category l<strong>in</strong>ks. On the top-level menu we<br />
def<strong>in</strong>ed 10 non-overlapp<strong>in</strong>g areas of <strong>in</strong>terest (AOIs) which<br />
<strong>in</strong>cluded one AOI cover<strong>in</strong>g each of the 9 category l<strong>in</strong>ks <strong>and</strong><br />
one cover<strong>in</strong>g the top area where the current task description<br />
was displayed. Fixations were def<strong>in</strong>ed as 4 or more<br />
consecutive sampled gaze po<strong>in</strong>ts fall<strong>in</strong>g with<strong>in</strong> an area of 60<br />
pixels <strong>and</strong> with a total duration of at least 100 ms. We then<br />
identified fixations with<strong>in</strong> each of the 10 AOIs. When<br />
calculat<strong>in</strong>g the evaluation time of a given category,<br />
successive fixations with<strong>in</strong> the same AOI for that category<br />
were comb<strong>in</strong>ed, along with <strong>in</strong>terven<strong>in</strong>g saccade <strong>in</strong>tervals.<br />
Results<br />
The analyses focused on the processes lead<strong>in</strong>g up to the<br />
first top-level category selection on each task, where we<br />
hypothesized the effect of top-down process<strong>in</strong>g was most<br />
likely to appear. Of particular <strong>in</strong>terests was the time spent<br />
on evaluat<strong>in</strong>g <strong>in</strong>dividual categories. We use fixation<br />
duration as the <strong>in</strong>dicator of evaluation time <strong>and</strong> hypothesize<br />
that the durations of fixations on <strong>in</strong>dividual category l<strong>in</strong>ks<br />
would be shorter <strong>in</strong> the search for categorically<br />
unambiguous targets than for categorically ambiguous<br />
targets. In addition, because the categories were fixed <strong>and</strong><br />
displayed at constant locations throughout the session, it is<br />
possible that a participant with a category <strong>in</strong> m<strong>in</strong>d may look<br />
directly toward the desired category without evaluat<strong>in</strong>g any<br />
other l<strong>in</strong>k. In that case, top-down process<strong>in</strong>g may be<br />
evidenced as select<strong>in</strong>g the first fixated l<strong>in</strong>k. Analysis to<br />
follow also exam<strong>in</strong>ed this possibility.<br />
Validation of Category Ambiguity Manipulation<br />
As a first step, we sought validation for the manipulation<br />
of category ambiguity by exam<strong>in</strong><strong>in</strong>g the percentages of<br />
correct category selections. For the unambiguous targets,<br />
the category lead<strong>in</strong>g to the target was <strong>in</strong>itially selected on<br />
88.8% of the trials for those with category names <strong>and</strong> 86.2%<br />
for those without. For the ambiguous targets, the category<br />
lead<strong>in</strong>g to the target was <strong>in</strong>itially selected on 70.0% of the<br />
trials for those with category names <strong>and</strong> 55.0% for those<br />
without.<br />
Additional validation for the manipulation of category<br />
ambiguity could be seen <strong>in</strong> the total numbers of different<br />
categories exam<strong>in</strong>ed. For the unambiguous targets,<br />
participants exam<strong>in</strong>ed on average 3.8 different categories<br />
for those with category names <strong>and</strong> 3.9 categories for those<br />
without. For the ambiguous targets, participants exam<strong>in</strong>ed<br />
on average 4.5 different categories for those with category<br />
names <strong>and</strong> 5.8 for those without. Results from a repeated<br />
Analysis of Variance (ANOVA) with factors of category<br />
ambiguity <strong>and</strong> presence of category names showed<br />
significantly more categories were fixated for categorically<br />
ambiguous targets, F(1, 9) = 22.59, p < .001.
In summary, for categorically unambiguous targets<br />
participants evaluated fewer categories <strong>and</strong> their f<strong>in</strong>al<br />
selections converged greatly on the category that would lead<br />
to the target.<br />
Category L<strong>in</strong>k Evaluative <strong>Processes</strong><br />
Next, we exam<strong>in</strong>ed the evaluative processes <strong>in</strong> terms of<br />
total time taken, numbers of fixations generated prior to<br />
category selection, <strong>and</strong> the duration of fixations. For the<br />
time taken to make the first category selection, we extracted<br />
from video record<strong>in</strong>gs the elapsed time from when the toplevel<br />
categories were first visible to when they were last<br />
seen before the page transitioned to show the sub-categories<br />
of a selected category. S<strong>in</strong>ce trials were blocked by<br />
participant, we performed mixed model analyses where the<br />
participants were modeled as a r<strong>and</strong>om effect (S<strong>in</strong>ger,<br />
1998). We then tested the fixed effects of trial number,<br />
category ambiguity, the presence of category names, <strong>and</strong> the<br />
<strong>in</strong>teraction between category ambiguity <strong>and</strong> the presence of<br />
category names. The mixed model revealed a significant<br />
effect for the trial number, F(1, 306) = 19.94, p < .0001. The<br />
effect of category ambiguity was significant, F(1, 306) =<br />
11.51, p < .0001. While the effect of the presence of<br />
category names was not significant, F(1, 306) = 2.25, p =<br />
.1350, the <strong>in</strong>teraction of category ambiguity <strong>and</strong> the<br />
presence of trigger was significant, F(1, 306) = 4.46, p =<br />
.0354. The means for the four conditions of targets are<br />
presented <strong>in</strong> Table 1 (the mixed model provided a st<strong>and</strong>ard<br />
error of .57 for all four means). The <strong>in</strong>teraction of ambiguity<br />
<strong>and</strong> presence of category names can be seen <strong>in</strong> the<br />
difference <strong>in</strong> selection times. For the ambiguous targets, the<br />
presence of category names resulted <strong>in</strong> selection times that<br />
were more than 1 second faster on average. However, the<br />
presence of category names did not yield faster selection<br />
times for the unambiguous targets.<br />
Category<br />
Ambiguity<br />
Presence<br />
of<br />
category<br />
names<br />
Mean<br />
Selection<br />
Time<br />
St<strong>and</strong>ard<br />
Error<br />
Ambiguous None 5.8 .57<br />
Ambiguous Present 4.5 .57<br />
Unambiguous None 3.1 .57<br />
Unambiguous Present 3.3 .57<br />
Table 1. Model estimated mean selection time (<strong>in</strong> seconds)<br />
<strong>and</strong> st<strong>and</strong>ard error of first l<strong>in</strong>k selection times.<br />
Although time to first category selection provides a<br />
general measure of search efficiency, faster selection times<br />
observed for categorically unambiguous targets than<br />
categorically ambiguous targets do not necessarily suggest<br />
different processes, especially <strong>in</strong> light of the f<strong>in</strong>d<strong>in</strong>g that<br />
fewer categories were exam<strong>in</strong>ed <strong>in</strong> the former case. An<br />
exam<strong>in</strong>ation of the fixation data showed that efficient search<br />
of categorically unambiguous targets was marked by both<br />
fewer fixations <strong>and</strong> shorter fixation durations, provid<strong>in</strong>g<br />
some <strong>in</strong>dication of differences <strong>in</strong> process<strong>in</strong>g. Table 2 shows<br />
1852<br />
<strong>in</strong> each condition average numbers of fixations made prior<br />
to the first category selection <strong>and</strong> their average durations. 1<br />
On average, fixation durations (i.e., l<strong>in</strong>k evaluation times)<br />
were approximately 20 ms shorter for categorically<br />
unambiguous targets. Although this decrease <strong>in</strong> l<strong>in</strong>k<br />
evaluation time was consistent with the use of a top-down<br />
strategy, results from a repeated ANOVA on fixation<br />
durations with factors of category ambiguity <strong>and</strong> presence<br />
of triggers failed to f<strong>in</strong>d it to be statistically reliable, F(1, 9)<br />
= 2.91, p > 0.12.<br />
Category<br />
Ambiguity<br />
Presence<br />
of<br />
category<br />
names<br />
Number<br />
of fixations<br />
Duration<br />
of fixations<br />
(ms)<br />
Ambiguous None 17.1 283<br />
Ambiguous Present 11.7 279<br />
Unambiguous None 8.7 267<br />
Unambiguous Present 8.8 256<br />
Table 2. Average number of fixations made dur<strong>in</strong>g the<br />
first visit to the top-level category menu <strong>and</strong> their average<br />
durations<br />
The calculation of l<strong>in</strong>k evaluation time requires there be<br />
at least one qualified fixation fall<strong>in</strong>g with the AOI of a<br />
category l<strong>in</strong>k dur<strong>in</strong>g the first visit to the top-level category<br />
menu. As mentioned previously, top-down strategy may<br />
lead participants to saccade directly to a desired l<strong>in</strong>k without<br />
evaluat<strong>in</strong>g any other l<strong>in</strong>k <strong>in</strong> the process. In such cases, there<br />
would be no qualified fixations for analysis. To capture this<br />
aspect of top-down process<strong>in</strong>g, we exam<strong>in</strong>ed the number of<br />
trials where the target category was visited on the very first<br />
fixation, out of the eight total trials <strong>in</strong> each condition. For<br />
categorically unambiguous targets, participants visited the<br />
target category on their first fixation on 2.8 out of 8 trials<br />
for those with triggers <strong>and</strong> 2.8 out of 8 for those without<br />
triggers. For categorically ambiguous targets, participants<br />
visited the target category on the first fixation on 1.8 out of<br />
8 trials for those with triggers <strong>and</strong> 1.9 out of 8 for those<br />
without triggers. Clearly participants were able to quickly<br />
narrow their search down to one category immediately at<br />
least on some trials regardless whether the targets were<br />
considered categorically ambiguous or not. However, they<br />
were more likely to do so for categorically unambiguous<br />
targets.<br />
General Discussion<br />
The present f<strong>in</strong>d<strong>in</strong>gs suggest that l<strong>in</strong>k evaluation possibly<br />
<strong>in</strong>volves more than assess<strong>in</strong>g the semantic distance between<br />
the search target <strong>and</strong> each of the encountered l<strong>in</strong>ks. When<br />
the search target clearly <strong>in</strong>dicated its own category,<br />
participants evaluated fewer categories <strong>and</strong> spent less time<br />
on each. They were also more likely to fixate directly the<br />
1 Calculation of average fixation durations did not <strong>in</strong>clude the<br />
last fixation made dur<strong>in</strong>g the first visit to the top-level category<br />
menu as it is <strong>in</strong>flated by the time to make manual selections.
target category, bypass<strong>in</strong>g l<strong>in</strong>k evaluation altogether. These<br />
f<strong>in</strong>d<strong>in</strong>gs are consistent with the possibility that participants<br />
generated their own category names through top-down<br />
process<strong>in</strong>g prior to carry<strong>in</strong>g out the search process.<br />
Given the equivalent <strong>and</strong> overall fast selections <strong>in</strong> the two<br />
category-unambiguous conditions, it is reasonable to believe<br />
that participants generated their own category names even <strong>in</strong><br />
the absence of them based on their knowledge of the<br />
descriptions <strong>and</strong> the <strong>in</strong>formation architecture. In contrast,<br />
use of a bottom-up strategy seems implausible for the faster<br />
selection times (i.e. < 3 seconds). After account<strong>in</strong>g for page<br />
refresh, mouse movement <strong>and</strong> l<strong>in</strong>k selection, little time<br />
rema<strong>in</strong>s for an item-by-item evaluation of the category<br />
labels, which must take at least 230 ms per l<strong>in</strong>k to account<br />
for the time needed for each eye movement (Card, Moran,<br />
& Newell, 1983).<br />
The present f<strong>in</strong>d<strong>in</strong>gs also suggest that the presence of<br />
category names facilitates category selection at least when<br />
the search target does not have a clear category selection. In<br />
these cases, the participants may still use the more efficient<br />
top-down strategy, rely<strong>in</strong>g on the category name <strong>in</strong> the<br />
description to perform the visual search for its physical<br />
properties. Category ambiguity still <strong>in</strong>curs a greater time<br />
cost, possibly to confirm category membership or consider<br />
alternate categories before mak<strong>in</strong>g the selection.<br />
Not surpris<strong>in</strong>gly, the trial number was a significant factor<br />
for the amount of time used to select a l<strong>in</strong>k. In particular, the<br />
selection time decreased with the number of trials. This<br />
result is consistent with <strong>in</strong>creased use of top-down strategies<br />
as users become more familiar with the categories <strong>and</strong> their<br />
location on the page.<br />
The presence of efficient top-down strategies has<br />
implications for the effective design of web pages <strong>and</strong> menu<br />
systems. In applied web design usability research, the<br />
superord<strong>in</strong>ate category names are sometimes referred to as<br />
“trigger words” to signify <strong>in</strong> a post-hoc manner their be<strong>in</strong>g a<br />
user’s <strong>in</strong>ternally formulated search goal <strong>and</strong> trigger<strong>in</strong>g a<br />
selection response from the user (Spool, Perfetti, & Brittan,<br />
2004). To the extent that users form key “trigger words” <strong>and</strong><br />
then scan for their physical properties, care must be taken to<br />
<strong>in</strong>clude these words <strong>in</strong> the selection labels. Otherwise, users<br />
may miss the targeted selection even if its label is highly<br />
relevant to the user’s <strong>in</strong>formation goal. This consideration is<br />
consistent with Spool’s advice of identify<strong>in</strong>g common<br />
trigger words for users <strong>and</strong> <strong>in</strong>clud<strong>in</strong>g them <strong>in</strong> l<strong>in</strong>k labels.<br />
Certa<strong>in</strong>ly any postulation of dichotomy is prone to<br />
oversimplification. The question is whether mak<strong>in</strong>g such a<br />
dist<strong>in</strong>ction is useful <strong>in</strong> the underst<strong>and</strong><strong>in</strong>g of different<br />
mixture of processes that may go on <strong>in</strong> l<strong>in</strong>k-based search.<br />
Although computational model<strong>in</strong>g <strong>in</strong> web search has seen<br />
great improvements, empirical <strong>in</strong>vestigation of the issue <strong>in</strong><br />
our op<strong>in</strong>ion is still lack<strong>in</strong>g. The present research represents a<br />
step <strong>in</strong> f<strong>in</strong>d<strong>in</strong>g factors that elicit top-down process<strong>in</strong>g <strong>and</strong><br />
measures that <strong>in</strong>dicate when top-down process<strong>in</strong>g occurs.<br />
1853<br />
Acknowledgments<br />
This work was funded by grant NNA06CA99A from<br />
NASA’s Human Research Program. We are <strong>in</strong>debted to Jim<br />
Lofton who developed the "UI Nav Test" web application,<br />
which we used to adm<strong>in</strong>ister our navigation studies. We also<br />
thank Paul Roth, Mike Niebl<strong>in</strong>g, Ben Burton, Nils Hanson,<br />
Dev<strong>in</strong> Carter <strong>and</strong> Anna Eskra for test<strong>in</strong>g <strong>and</strong> comment<strong>in</strong>g<br />
on prelim<strong>in</strong>ary versions of the web application, Jaymie<br />
Massoletti for assistance with data analysis, Usha<br />
Viswanathan for participants recruit, <strong>and</strong> Joel Lachter for<br />
equipment setup.<br />
References<br />
Card, S. K., Moran, T.P. & Newell, A. (1983). The<br />
Psychology of Human-Computer Interaction. Hillsdale,<br />
NJ: LEA.<br />
Fleetwood, M. D., & Byrne, M. D. (2006). Model<strong>in</strong>g the<br />
visual search of displays: A revised ACT-R model of icon<br />
search based on eye-track<strong>in</strong>g data. Human-Computer<br />
Interaction, 21, 153-197.<br />
Fu, W.-T., <strong>and</strong> Pirolli, P. (2007). SNIF-ACT: A cognitive<br />
model of user navigation on the World Wide <strong>Web</strong>.<br />
Human-Computer Interaction, 22, 355-412..<br />
Hornof, A. J. (2004). Cognitive strategies for the visual<br />
search of hierarchical computer displays. Human-<br />
Computer Interaction, 19, 183-223.<br />
Kitajima, M., Blackmon, M.H., <strong>and</strong> Polson, P.G. (2000). A<br />
comprehension-based model of web navigation <strong>and</strong> its<br />
application to web usability analysis. In S. McDonald, Y.<br />
Waern, & G. Cockton (Eds.), People <strong>and</strong> Computers XIV<br />
– Usability or Else: Proceed<strong>in</strong>gs of HCI 2000 (pp. 357-<br />
373). London: Spr<strong>in</strong>ger.<br />
Miller, C. S., Fuchs, S., Anantharaman, N., & Kulkarni, P.<br />
(2007). Compar<strong>in</strong>g two methods for predict<strong>in</strong>g navigation<br />
problems <strong>in</strong> <strong>in</strong>formation hierarchies. Human Factors <strong>and</strong><br />
Ergonomics Society Annual Meet<strong>in</strong>g Proceed<strong>in</strong>gs (pp.<br />
363-367). Santa Monica, CA: HFES.<br />
Miller, C. S., <strong>and</strong> Rem<strong>in</strong>gton, R.W. (2004). Model<strong>in</strong>g<br />
<strong>in</strong>formation navigation: Implications for <strong>in</strong>formation<br />
architecture. Human-Computer Interaction, 19, 225-271.<br />
S<strong>in</strong>ger, J. D. (1998). Us<strong>in</strong>g SAS PROC MIXED to fit<br />
multilevel models, hierarchical models, <strong>and</strong> <strong>in</strong>dividual<br />
growth models. Journal of Educational <strong>and</strong> Behavioral<br />
Statistics, 23, 323-355.<br />
Spool, J. M., Perfetti, C., & Brittan, D. (2004). Design<strong>in</strong>g<br />
for the scent of <strong>in</strong>formation. North Andover, MA: User<br />
Interface Eng<strong>in</strong>eer<strong>in</strong>g.