14.07.2013 Views

Top-Down and Bottom-Up Processes in Web Search Navigation

Top-Down and Bottom-Up Processes in Web Search Navigation

Top-Down and Bottom-Up Processes in Web Search Navigation

SHOW MORE
SHOW LESS

Create successful ePaper yourself

Turn your PDF publications into a flip-book with our unique Google optimized e-Paper software.

<strong>Top</strong>-<strong>Down</strong> <strong>and</strong> <strong>Bottom</strong>-<strong>Up</strong> <strong>Processes</strong> <strong>in</strong> <strong>Web</strong> <strong>Search</strong> <strong>Navigation</strong><br />

Shu-Chieh Wu (shu-chieh.wu@nasa.gov)<br />

NASA Ames Research Center <strong>and</strong> San Jose State University, MS 262-4<br />

Moffett Field, CA 94035 USA<br />

Abstract<br />

In current theories of web navigation, l<strong>in</strong>k evaluation has<br />

been treated primarily as a bottom-up process <strong>in</strong>volv<strong>in</strong>g<br />

assess<strong>in</strong>g the semantic distance between a search target <strong>and</strong> a<br />

given l<strong>in</strong>k <strong>in</strong> the <strong>in</strong>formation architecture. We <strong>in</strong>vestigated<br />

whether there exists top-down <strong>in</strong>fluence from familiarity with<br />

the search target <strong>and</strong>/or the <strong>in</strong>formation architecture. We<br />

compared search performance on targets that varied <strong>in</strong> the<br />

level of category ambiguity <strong>and</strong> the presence of category<br />

names. We found that categorically unambiguous search<br />

targets resulted <strong>in</strong> fewer categories be<strong>in</strong>g evaluated, fewer<br />

fixations <strong>and</strong> shorter fixation durations, <strong>and</strong> overall fast <strong>in</strong>itial<br />

classification regardless of the presence of category names,<br />

suggest<strong>in</strong>g proactive use of top-down knowledge <strong>in</strong> guid<strong>in</strong>g<br />

search behavior. Our results have implications on design<strong>in</strong>g<br />

<strong>in</strong>formation architectures that support efficient top-down<br />

strategies <strong>in</strong> searches for menu items <strong>and</strong> web l<strong>in</strong>ks.<br />

Keywords: <strong>Web</strong> navigation; web search; eye movements;<br />

eye track<strong>in</strong>g; visual scann<strong>in</strong>g; menu organization.<br />

Introduction<br />

The prevalence <strong>and</strong> improved accuracy of web search<br />

technologies have transformed the process of how users<br />

seek <strong>in</strong>formation. While keyword search has arguably<br />

become the most dom<strong>in</strong>ant method for locat<strong>in</strong>g <strong>in</strong>formation<br />

on the <strong>in</strong>ternet, search through navigation rema<strong>in</strong>s an<br />

important element of web user experience. <strong>Navigation</strong><br />

offers the ability to locate unfamiliar targets <strong>in</strong> the absence<br />

of proper keywords <strong>and</strong> to browse all available <strong>in</strong>stances on<br />

a particular theme. Outside of the web environment,<br />

navigation through multiple levels of menus also rema<strong>in</strong>s<br />

the primary method for locat<strong>in</strong>g <strong>in</strong>formation on personal<br />

digital devices such as digital cameras, personal digital<br />

assistants (PDAs) <strong>and</strong> cellular phones, where implement<strong>in</strong>g<br />

a search function is impractical given the limitation <strong>in</strong><br />

display areas <strong>and</strong> <strong>in</strong>put methods. Different from keyword<br />

search where results are collated <strong>and</strong> returned, search<br />

through navigation requires users to click through a series of<br />

l<strong>in</strong>ks <strong>in</strong> order to navigate themselves toward search goals.<br />

How users at any given moment search among available<br />

l<strong>in</strong>ks <strong>and</strong> choose one that they believe will lead them to their<br />

goals is a question of both theoretical <strong>in</strong>terest <strong>and</strong> practical<br />

implications on the design of efficient <strong>in</strong>formation<br />

architecture.<br />

In this paper, we exam<strong>in</strong>e the process underly<strong>in</strong>g l<strong>in</strong>k<br />

evaluation, a critical element of navigation. The result of<br />

l<strong>in</strong>k evaluation dictates how a user navigates, whether it is<br />

to proceed with a l<strong>in</strong>k or to backtrack from a dead-end page.<br />

Craig S. Miller (cmiller@cs.depaul.edu)<br />

DePaul University, 243 S. Wabash Avenue<br />

Chicago, IL 60657 USA<br />

1848<br />

Theories of web navigation have focused on predict<strong>in</strong>g<br />

among a group of l<strong>in</strong>ks which one will more likely be<br />

selected <strong>in</strong> actual user behavior but with little attention paid<br />

to the evaluative process, per se, of a s<strong>in</strong>gle l<strong>in</strong>k. This paper<br />

aims to address possible top-down <strong>and</strong> bottom-up factors<br />

that <strong>in</strong>fluence l<strong>in</strong>k evaluation.<br />

L<strong>in</strong>k Evaluation<br />

Despite its central role <strong>in</strong> navigation, the process underly<strong>in</strong>g<br />

l<strong>in</strong>k evaluation is not always explicitly def<strong>in</strong>ed <strong>in</strong> current<br />

theories of web navigation. Some models like MESA<br />

(Method for Evaluat<strong>in</strong>g Site Architectures) have l<strong>in</strong>k<br />

evaluation outcomes generated outside the model by<br />

separate procedures <strong>and</strong> fed <strong>in</strong>to the model (Miller &<br />

Rem<strong>in</strong>gton, 2004). Others like CoLiDeS (Comprehensionbased<br />

L<strong>in</strong>ked model of Deliberate <strong>Search</strong>) (Kitajima,<br />

Blackmon, & Polson, 2000) <strong>and</strong> SNIF-ACT (Scent-based<br />

<strong>Navigation</strong> <strong>and</strong> Information Forag<strong>in</strong>g <strong>in</strong> the ACT<br />

architecture) (Fu & Pirolli, 2007) model l<strong>in</strong>k evaluation as<br />

measur<strong>in</strong>g the semantic distance between an encountered<br />

l<strong>in</strong>k <strong>and</strong> a search target. There is no question that l<strong>in</strong>k<br />

evaluation <strong>in</strong>volves some form of semantic comparison. The<br />

question is where the comparison takes place; that is,<br />

whether the comparison really is between the particular l<strong>in</strong>k<br />

be<strong>in</strong>g evaluated <strong>and</strong> the search target, an assumption shared<br />

by both models.<br />

One reason to question such an assumption of l<strong>in</strong>k<br />

evaluation processes is that users often possess a certa<strong>in</strong><br />

amount of knowledge, not only of their search targets (i.e.,<br />

queries) but also of the likely options from the <strong>in</strong>formation<br />

architecture. For example, a shopper who wishes to<br />

purchase an MP3 player from an onl<strong>in</strong>e store would not<br />

necessarily expect to f<strong>in</strong>d the word MP3 on the front page of<br />

the store. Rather, a savvy shopper may expect to f<strong>in</strong>d l<strong>in</strong>ks<br />

like electronics or portable audios. In other words, to<br />

facilitate search, users are likely to take <strong>in</strong>to account what<br />

opportunities are available based on their prior experience<br />

<strong>and</strong> apply that knowledge to rephrase their search targets <strong>in</strong><br />

ways conform<strong>in</strong>g to what they believe to be conventional<br />

contents of the <strong>in</strong>formation architecture.<br />

What is be<strong>in</strong>g suggested here is that l<strong>in</strong>k evaluation likely<br />

<strong>in</strong>volves not only a bottom-up process <strong>in</strong> which the mean<strong>in</strong>g<br />

of an encountered l<strong>in</strong>k is compared to that of the search<br />

target but also a top-down process that precedes evaluation<br />

<strong>in</strong> which a user rephrases the search target <strong>in</strong> languages<br />

closer to those available from the <strong>in</strong>formation architecture.


They differ <strong>in</strong> the level at which a user <strong>in</strong>jects his/her prior<br />

knowledge of both the search goal (i.e., query) <strong>and</strong> the<br />

available l<strong>in</strong>ks <strong>in</strong>to the search process, <strong>and</strong> consequently the<br />

nature of the evaluative process. The difference between our<br />

stipulation of the top-down processes (rephras<strong>in</strong>g a search<br />

target to match available l<strong>in</strong>ks) <strong>and</strong> assess<strong>in</strong>g semantic<br />

distance between a given l<strong>in</strong>k <strong>and</strong> words related to the<br />

search target (as <strong>in</strong> <strong>in</strong>formation forag<strong>in</strong>g) is that the former<br />

could lead to cases where no evaluative process takes place<br />

because the user simply knows what he/she is look<strong>in</strong>g for<br />

<strong>and</strong> where it is. In top-down l<strong>in</strong>k evaluation, the nature of<br />

the evaluative process is simplified from comparison to<br />

recognition. Note that Hornof (2004) <strong>and</strong> Fleetwood <strong>and</strong><br />

Byrne (2006) study processes where the user is search<strong>in</strong>g<br />

for a specific str<strong>in</strong>g. This is ak<strong>in</strong> to what happens <strong>in</strong> a topdown<br />

strategy.<br />

Figure 1 proposes an idealized process that uses two<br />

dist<strong>in</strong>ct strategies for search<strong>in</strong>g a menu item depend<strong>in</strong>g on<br />

the ability to first recall a category phrase that might appear<br />

<strong>in</strong> the desired selection. After acquir<strong>in</strong>g the search target,<br />

the first condition box <strong>in</strong>dicates that a user tries to retrieve a<br />

relevant phrase that likely appears <strong>in</strong> the menu. If the user is<br />

able to retrieve a relevant phrase, it employs a top-down<br />

strategy. That is, it prepares a search for the physical<br />

properties of that phrase. If a relevant phrase cannot be<br />

retrieved, it employs a bottom-up strategy, which requires<br />

an item-by-item calculation of category membership. In this<br />

way, the bottom-up strategy <strong>in</strong>curs a greater time cost when<br />

search<strong>in</strong>g through menu items, not only because it likely<br />

requires more iterations of l<strong>in</strong>k visitation <strong>and</strong> evaluation<br />

before a f<strong>in</strong>al selection can be made but also because the<br />

evaluative process itself takes longer than that <strong>in</strong> the topdown<br />

strategy. Note that for the top-down strategy, this<br />

schematic <strong>in</strong>dicates that a user still confirms category<br />

membership when f<strong>in</strong>d<strong>in</strong>g a physical match, but it is<br />

possible that a confident user may forgo this step if the<br />

<strong>in</strong>itially retrieved phrase is almost certa<strong>in</strong>ly part of the<br />

desired category.<br />

Figure 1: A schematic diagram of how search proceeds under top-down <strong>and</strong> bottom-up processes<br />

1849


The possibility of top-down strategies has at least two<br />

important implications for how users perform at web <strong>and</strong><br />

menu navigation. First, a top-down strategy would<br />

presumably be more efficient when the user can look for the<br />

physical features of a specific target by way of a faster scan<br />

of the menu items. Second, a top-down strategy may cause a<br />

user to overlook highly relevant items if their labels do not<br />

physically match the character sequence that is anticipated<br />

<strong>in</strong> the target label. Given its implications for human<br />

performance, underst<strong>and</strong><strong>in</strong>g when top-down strategies occur<br />

could be helpful for design<strong>in</strong>g <strong>in</strong>formation architectures <strong>and</strong><br />

diagnos<strong>in</strong>g navigation problems when they occur.<br />

Present Research<br />

In the present research, we <strong>in</strong>vestigate whether there is<br />

evidence for top-down process<strong>in</strong>g <strong>in</strong> <strong>in</strong>dividual l<strong>in</strong>k<br />

evaluation. We hypothesize that the dist<strong>in</strong>ction between topdown<br />

<strong>and</strong> bottom-up processes is most likely revealed <strong>in</strong> the<br />

comparison between search for categorically ambiguous <strong>and</strong><br />

categorically unambiguous goals. When the search goal<br />

clearly <strong>in</strong>dicates its own category, users are more likely to<br />

apply their prior knowledge of the <strong>in</strong>formation architecture<br />

to rephrase the search target <strong>and</strong> transform the process of<br />

l<strong>in</strong>k evaluation to recognition. Conversely, when the search<br />

goal is categorically ambiguous, users are more likely to<br />

depend on bottom-up processes <strong>and</strong> compare the search<br />

target aga<strong>in</strong>st each encountered l<strong>in</strong>k. Certa<strong>in</strong>ly categorically<br />

ambiguous targets will lead to a longer search process<br />

where<strong>in</strong> more l<strong>in</strong>ks are evaluated. The critical prediction<br />

here however is on <strong>in</strong>dividual l<strong>in</strong>k evaluation, that it should<br />

take less time with categorically unambiguous than<br />

categorically ambiguous goals. Alternatively, if search is<br />

carried out exclusively through bottom-up processes,<br />

<strong>in</strong>dividual l<strong>in</strong>k evaluation time should be equivalent <strong>in</strong><br />

searches for both categorically ambiguous <strong>and</strong> unambiguous<br />

targets because the nature of the evaluative process rema<strong>in</strong>s<br />

the same.<br />

Naturally, categorically unambiguous goals are more<br />

likely to conta<strong>in</strong> category names as part of their description.<br />

As a result, faster l<strong>in</strong>k evaluation could be equally<br />

attributable to close semantic distance. To control for this<br />

potential confound, we <strong>in</strong>dependently manipulated category<br />

ambiguity <strong>and</strong> the presence of category names. If l<strong>in</strong>k<br />

evaluation is subject to top-down <strong>in</strong>fluence, we would<br />

expect to see fast l<strong>in</strong>k evaluation <strong>in</strong> category-unambiguous<br />

search goals despite the absence of category names.<br />

Experiment<br />

Participants<br />

Ten students recruited from local colleges participated.<br />

They had no experience with the expert database <strong>and</strong> were<br />

naïve to the purpose of the study.<br />

1850<br />

Apparatus<br />

The study was carried out on a Pentium 4 PC runn<strong>in</strong>g<br />

Firefox. Eye movements were monitored us<strong>in</strong>g a headmounted<br />

high-speed eye tracker (Applied Sciences<br />

Laboratory, Model 501) with eye-head <strong>in</strong>tegration function,<br />

sampl<strong>in</strong>g at 120Hz. Gaze positions were then synchronized<br />

with recorded scenes us<strong>in</strong>g GazeTracker software (Eye<br />

Response Technology), which records video at 640x480<br />

pixel resolution <strong>and</strong> samples at 40 frames per second.<br />

Doma<strong>in</strong> for the Information Architecture<br />

The website used <strong>in</strong> the study was generated based on an<br />

“expert database” ma<strong>in</strong>ta<strong>in</strong>ed by the Media Relation<br />

Department of DePaul University, which conta<strong>in</strong>s<br />

descriptions of 970 university faculty members <strong>and</strong> their<br />

respective areas of expertise as a resource for journalists <strong>in</strong><br />

need for a subject-matter expert. The database was<br />

organized <strong>in</strong> levels of categories <strong>and</strong> sub-categories <strong>and</strong><br />

implemented <strong>in</strong> a web-based application. For details of the<br />

categories <strong>and</strong> the process by which they were derived, see<br />

Miller et al. (2007). In the present research, the expert<br />

database was reduced <strong>and</strong> restructured to conta<strong>in</strong> one level<br />

of top categories, one level of various numbers of subcategories,<br />

<strong>and</strong> at the bottom one level of content items (i.e.,<br />

expert descriptions). The result<strong>in</strong>g database had the<br />

follow<strong>in</strong>g 9 top-level categories: Arts <strong>and</strong> Literature,<br />

Bus<strong>in</strong>ess <strong>and</strong> Economics, Education, Law <strong>and</strong> Legal, Health<br />

<strong>and</strong> Medic<strong>in</strong>e, Politics <strong>and</strong> Public Policy, Religion, Society<br />

<strong>and</strong> Culture, <strong>and</strong> Science <strong>and</strong> Technology.<br />

Task <strong>and</strong> Design<br />

Participants were given descriptions of experts <strong>and</strong> asked<br />

to locate them <strong>in</strong> the web application. The descriptions<br />

chosen as search targets varied accord<strong>in</strong>g to two factors:<br />

category ambiguity <strong>and</strong> the presence of category names. We<br />

h<strong>and</strong>-selected the descriptions that represented clear cases<br />

for each of the four factorial conditions. The level of<br />

category ambiguity was determ<strong>in</strong>ed <strong>in</strong> a previous study<br />

(Miller et al., 2007) based on how users assigned the<br />

descriptions to the 9 top-level categories. We used the<br />

follow<strong>in</strong>g calculation to measure the distribution of category<br />

choices for a description:<br />

ambiguity = 1 −<br />

c<br />

∑<br />

i=<br />

1<br />

Here, c equals the number of categories, Si is the number of<br />

choices for a particular category <strong>and</strong> n is the total number of<br />

choices. For a fully unambiguous description, where the<br />

description was consistently assigned to only one category,<br />

the ambiguity measure is 0. For the present study, we used<br />

the ambiguity measure as a guide for select<strong>in</strong>g categorically<br />

ambiguous <strong>and</strong> unambiguous descriptions while also<br />

controll<strong>in</strong>g for their length. In the end, all descriptions<br />

classified as categorically unambiguous had an ambiguity<br />

⎛<br />

⎜<br />

⎝<br />

si<br />

n<br />

⎞<br />

⎟<br />

⎠<br />

2


measure less than .25 <strong>and</strong> all descriptions classified as<br />

categorically ambiguous had a measure greater than .45.<br />

The presence of category names was determ<strong>in</strong>ed by<br />

calculat<strong>in</strong>g the amount of co-occurrence of words between a<br />

particular expert description <strong>and</strong> the category to which most<br />

users assigned the description. Descriptions with no cooccurrence<br />

were assigned to one condition. For the opposite<br />

condition, a description was considered hav<strong>in</strong>g category<br />

names if it conta<strong>in</strong>ed words from the category to which<br />

most users assigned the description.<br />

To illustrate, here are examples of the four types of<br />

descriptions from Science <strong>and</strong> Technology category:<br />

“Use of technology <strong>in</strong>clud<strong>in</strong>g computers,<br />

telecommunications <strong>and</strong> multimedia”<br />

(categorically unambiguous with category names)<br />

“Fiber-optic communications, chaos <strong>and</strong> optical<br />

systems, lasers”<br />

(categorically unambiguous without category names)<br />

“Ethics of new technology, employee privacy <strong>and</strong><br />

technology”<br />

(categorically ambiguous with category names)<br />

“The harmful effects environmental contam<strong>in</strong>ation has<br />

on liv<strong>in</strong>g organisms <strong>and</strong> systems”<br />

(categorically ambiguous without category names)<br />

Note that <strong>in</strong> the case of descriptions that were categorically<br />

ambiguous <strong>and</strong> with category names, the particular category<br />

name conta<strong>in</strong>ed <strong>in</strong> the descriptions always corresponded to<br />

the category to which the descriptions belong. In other<br />

words, there were no mislead<strong>in</strong>g category names <strong>in</strong> the<br />

descriptions chosen.<br />

Four descriptions were selected from each category<br />

(except Religion) to cover the four factorial conditions,<br />

result<strong>in</strong>g <strong>in</strong> a total of 32 search task trials. Their length was<br />

between 46 to 89 characters (~66 characters on average).<br />

The tasks were selected so that average length was<br />

comparable across all four conditions of descriptions,<br />

average ambiguity was comparable with<strong>in</strong> the two<br />

ambiguity classes (with <strong>and</strong> without category names), <strong>and</strong><br />

average word overlap was comparable with<strong>in</strong> the two<br />

overlap classes (categorically ambiguous <strong>and</strong> categorically<br />

unambiguous).<br />

Procedure<br />

In the beg<strong>in</strong>n<strong>in</strong>g of each trial, participants were given the<br />

search target alone on a separate page. They were <strong>in</strong>structed<br />

to read the description through before click<strong>in</strong>g on a<br />

“cont<strong>in</strong>ue” l<strong>in</strong>k which, when pressed, displayed the top level<br />

categories <strong>and</strong> started the timer. Each task scenario was<br />

term<strong>in</strong>ated upon either f<strong>in</strong>d<strong>in</strong>g the target expert or after two<br />

m<strong>in</strong>utes had elapsed. Then the next trial was presented.<br />

The web application recorded <strong>and</strong> time-stamped every<br />

selection performed by the participant. The categories <strong>and</strong><br />

content items were consistently arranged <strong>in</strong> the same order<br />

on the page throughout the tasks. The order of the tasks was<br />

1851<br />

r<strong>and</strong>omized for each participant. Eye movements were<br />

recorded along with displayed web contents.<br />

Eye Movement Data Process<strong>in</strong>g<br />

The analysis of eye movements focused on identify<strong>in</strong>g<br />

fixations on category l<strong>in</strong>ks. On the top-level menu we<br />

def<strong>in</strong>ed 10 non-overlapp<strong>in</strong>g areas of <strong>in</strong>terest (AOIs) which<br />

<strong>in</strong>cluded one AOI cover<strong>in</strong>g each of the 9 category l<strong>in</strong>ks <strong>and</strong><br />

one cover<strong>in</strong>g the top area where the current task description<br />

was displayed. Fixations were def<strong>in</strong>ed as 4 or more<br />

consecutive sampled gaze po<strong>in</strong>ts fall<strong>in</strong>g with<strong>in</strong> an area of 60<br />

pixels <strong>and</strong> with a total duration of at least 100 ms. We then<br />

identified fixations with<strong>in</strong> each of the 10 AOIs. When<br />

calculat<strong>in</strong>g the evaluation time of a given category,<br />

successive fixations with<strong>in</strong> the same AOI for that category<br />

were comb<strong>in</strong>ed, along with <strong>in</strong>terven<strong>in</strong>g saccade <strong>in</strong>tervals.<br />

Results<br />

The analyses focused on the processes lead<strong>in</strong>g up to the<br />

first top-level category selection on each task, where we<br />

hypothesized the effect of top-down process<strong>in</strong>g was most<br />

likely to appear. Of particular <strong>in</strong>terests was the time spent<br />

on evaluat<strong>in</strong>g <strong>in</strong>dividual categories. We use fixation<br />

duration as the <strong>in</strong>dicator of evaluation time <strong>and</strong> hypothesize<br />

that the durations of fixations on <strong>in</strong>dividual category l<strong>in</strong>ks<br />

would be shorter <strong>in</strong> the search for categorically<br />

unambiguous targets than for categorically ambiguous<br />

targets. In addition, because the categories were fixed <strong>and</strong><br />

displayed at constant locations throughout the session, it is<br />

possible that a participant with a category <strong>in</strong> m<strong>in</strong>d may look<br />

directly toward the desired category without evaluat<strong>in</strong>g any<br />

other l<strong>in</strong>k. In that case, top-down process<strong>in</strong>g may be<br />

evidenced as select<strong>in</strong>g the first fixated l<strong>in</strong>k. Analysis to<br />

follow also exam<strong>in</strong>ed this possibility.<br />

Validation of Category Ambiguity Manipulation<br />

As a first step, we sought validation for the manipulation<br />

of category ambiguity by exam<strong>in</strong><strong>in</strong>g the percentages of<br />

correct category selections. For the unambiguous targets,<br />

the category lead<strong>in</strong>g to the target was <strong>in</strong>itially selected on<br />

88.8% of the trials for those with category names <strong>and</strong> 86.2%<br />

for those without. For the ambiguous targets, the category<br />

lead<strong>in</strong>g to the target was <strong>in</strong>itially selected on 70.0% of the<br />

trials for those with category names <strong>and</strong> 55.0% for those<br />

without.<br />

Additional validation for the manipulation of category<br />

ambiguity could be seen <strong>in</strong> the total numbers of different<br />

categories exam<strong>in</strong>ed. For the unambiguous targets,<br />

participants exam<strong>in</strong>ed on average 3.8 different categories<br />

for those with category names <strong>and</strong> 3.9 categories for those<br />

without. For the ambiguous targets, participants exam<strong>in</strong>ed<br />

on average 4.5 different categories for those with category<br />

names <strong>and</strong> 5.8 for those without. Results from a repeated<br />

Analysis of Variance (ANOVA) with factors of category<br />

ambiguity <strong>and</strong> presence of category names showed<br />

significantly more categories were fixated for categorically<br />

ambiguous targets, F(1, 9) = 22.59, p < .001.


In summary, for categorically unambiguous targets<br />

participants evaluated fewer categories <strong>and</strong> their f<strong>in</strong>al<br />

selections converged greatly on the category that would lead<br />

to the target.<br />

Category L<strong>in</strong>k Evaluative <strong>Processes</strong><br />

Next, we exam<strong>in</strong>ed the evaluative processes <strong>in</strong> terms of<br />

total time taken, numbers of fixations generated prior to<br />

category selection, <strong>and</strong> the duration of fixations. For the<br />

time taken to make the first category selection, we extracted<br />

from video record<strong>in</strong>gs the elapsed time from when the toplevel<br />

categories were first visible to when they were last<br />

seen before the page transitioned to show the sub-categories<br />

of a selected category. S<strong>in</strong>ce trials were blocked by<br />

participant, we performed mixed model analyses where the<br />

participants were modeled as a r<strong>and</strong>om effect (S<strong>in</strong>ger,<br />

1998). We then tested the fixed effects of trial number,<br />

category ambiguity, the presence of category names, <strong>and</strong> the<br />

<strong>in</strong>teraction between category ambiguity <strong>and</strong> the presence of<br />

category names. The mixed model revealed a significant<br />

effect for the trial number, F(1, 306) = 19.94, p < .0001. The<br />

effect of category ambiguity was significant, F(1, 306) =<br />

11.51, p < .0001. While the effect of the presence of<br />

category names was not significant, F(1, 306) = 2.25, p =<br />

.1350, the <strong>in</strong>teraction of category ambiguity <strong>and</strong> the<br />

presence of trigger was significant, F(1, 306) = 4.46, p =<br />

.0354. The means for the four conditions of targets are<br />

presented <strong>in</strong> Table 1 (the mixed model provided a st<strong>and</strong>ard<br />

error of .57 for all four means). The <strong>in</strong>teraction of ambiguity<br />

<strong>and</strong> presence of category names can be seen <strong>in</strong> the<br />

difference <strong>in</strong> selection times. For the ambiguous targets, the<br />

presence of category names resulted <strong>in</strong> selection times that<br />

were more than 1 second faster on average. However, the<br />

presence of category names did not yield faster selection<br />

times for the unambiguous targets.<br />

Category<br />

Ambiguity<br />

Presence<br />

of<br />

category<br />

names<br />

Mean<br />

Selection<br />

Time<br />

St<strong>and</strong>ard<br />

Error<br />

Ambiguous None 5.8 .57<br />

Ambiguous Present 4.5 .57<br />

Unambiguous None 3.1 .57<br />

Unambiguous Present 3.3 .57<br />

Table 1. Model estimated mean selection time (<strong>in</strong> seconds)<br />

<strong>and</strong> st<strong>and</strong>ard error of first l<strong>in</strong>k selection times.<br />

Although time to first category selection provides a<br />

general measure of search efficiency, faster selection times<br />

observed for categorically unambiguous targets than<br />

categorically ambiguous targets do not necessarily suggest<br />

different processes, especially <strong>in</strong> light of the f<strong>in</strong>d<strong>in</strong>g that<br />

fewer categories were exam<strong>in</strong>ed <strong>in</strong> the former case. An<br />

exam<strong>in</strong>ation of the fixation data showed that efficient search<br />

of categorically unambiguous targets was marked by both<br />

fewer fixations <strong>and</strong> shorter fixation durations, provid<strong>in</strong>g<br />

some <strong>in</strong>dication of differences <strong>in</strong> process<strong>in</strong>g. Table 2 shows<br />

1852<br />

<strong>in</strong> each condition average numbers of fixations made prior<br />

to the first category selection <strong>and</strong> their average durations. 1<br />

On average, fixation durations (i.e., l<strong>in</strong>k evaluation times)<br />

were approximately 20 ms shorter for categorically<br />

unambiguous targets. Although this decrease <strong>in</strong> l<strong>in</strong>k<br />

evaluation time was consistent with the use of a top-down<br />

strategy, results from a repeated ANOVA on fixation<br />

durations with factors of category ambiguity <strong>and</strong> presence<br />

of triggers failed to f<strong>in</strong>d it to be statistically reliable, F(1, 9)<br />

= 2.91, p > 0.12.<br />

Category<br />

Ambiguity<br />

Presence<br />

of<br />

category<br />

names<br />

Number<br />

of fixations<br />

Duration<br />

of fixations<br />

(ms)<br />

Ambiguous None 17.1 283<br />

Ambiguous Present 11.7 279<br />

Unambiguous None 8.7 267<br />

Unambiguous Present 8.8 256<br />

Table 2. Average number of fixations made dur<strong>in</strong>g the<br />

first visit to the top-level category menu <strong>and</strong> their average<br />

durations<br />

The calculation of l<strong>in</strong>k evaluation time requires there be<br />

at least one qualified fixation fall<strong>in</strong>g with the AOI of a<br />

category l<strong>in</strong>k dur<strong>in</strong>g the first visit to the top-level category<br />

menu. As mentioned previously, top-down strategy may<br />

lead participants to saccade directly to a desired l<strong>in</strong>k without<br />

evaluat<strong>in</strong>g any other l<strong>in</strong>k <strong>in</strong> the process. In such cases, there<br />

would be no qualified fixations for analysis. To capture this<br />

aspect of top-down process<strong>in</strong>g, we exam<strong>in</strong>ed the number of<br />

trials where the target category was visited on the very first<br />

fixation, out of the eight total trials <strong>in</strong> each condition. For<br />

categorically unambiguous targets, participants visited the<br />

target category on their first fixation on 2.8 out of 8 trials<br />

for those with triggers <strong>and</strong> 2.8 out of 8 for those without<br />

triggers. For categorically ambiguous targets, participants<br />

visited the target category on the first fixation on 1.8 out of<br />

8 trials for those with triggers <strong>and</strong> 1.9 out of 8 for those<br />

without triggers. Clearly participants were able to quickly<br />

narrow their search down to one category immediately at<br />

least on some trials regardless whether the targets were<br />

considered categorically ambiguous or not. However, they<br />

were more likely to do so for categorically unambiguous<br />

targets.<br />

General Discussion<br />

The present f<strong>in</strong>d<strong>in</strong>gs suggest that l<strong>in</strong>k evaluation possibly<br />

<strong>in</strong>volves more than assess<strong>in</strong>g the semantic distance between<br />

the search target <strong>and</strong> each of the encountered l<strong>in</strong>ks. When<br />

the search target clearly <strong>in</strong>dicated its own category,<br />

participants evaluated fewer categories <strong>and</strong> spent less time<br />

on each. They were also more likely to fixate directly the<br />

1 Calculation of average fixation durations did not <strong>in</strong>clude the<br />

last fixation made dur<strong>in</strong>g the first visit to the top-level category<br />

menu as it is <strong>in</strong>flated by the time to make manual selections.


target category, bypass<strong>in</strong>g l<strong>in</strong>k evaluation altogether. These<br />

f<strong>in</strong>d<strong>in</strong>gs are consistent with the possibility that participants<br />

generated their own category names through top-down<br />

process<strong>in</strong>g prior to carry<strong>in</strong>g out the search process.<br />

Given the equivalent <strong>and</strong> overall fast selections <strong>in</strong> the two<br />

category-unambiguous conditions, it is reasonable to believe<br />

that participants generated their own category names even <strong>in</strong><br />

the absence of them based on their knowledge of the<br />

descriptions <strong>and</strong> the <strong>in</strong>formation architecture. In contrast,<br />

use of a bottom-up strategy seems implausible for the faster<br />

selection times (i.e. < 3 seconds). After account<strong>in</strong>g for page<br />

refresh, mouse movement <strong>and</strong> l<strong>in</strong>k selection, little time<br />

rema<strong>in</strong>s for an item-by-item evaluation of the category<br />

labels, which must take at least 230 ms per l<strong>in</strong>k to account<br />

for the time needed for each eye movement (Card, Moran,<br />

& Newell, 1983).<br />

The present f<strong>in</strong>d<strong>in</strong>gs also suggest that the presence of<br />

category names facilitates category selection at least when<br />

the search target does not have a clear category selection. In<br />

these cases, the participants may still use the more efficient<br />

top-down strategy, rely<strong>in</strong>g on the category name <strong>in</strong> the<br />

description to perform the visual search for its physical<br />

properties. Category ambiguity still <strong>in</strong>curs a greater time<br />

cost, possibly to confirm category membership or consider<br />

alternate categories before mak<strong>in</strong>g the selection.<br />

Not surpris<strong>in</strong>gly, the trial number was a significant factor<br />

for the amount of time used to select a l<strong>in</strong>k. In particular, the<br />

selection time decreased with the number of trials. This<br />

result is consistent with <strong>in</strong>creased use of top-down strategies<br />

as users become more familiar with the categories <strong>and</strong> their<br />

location on the page.<br />

The presence of efficient top-down strategies has<br />

implications for the effective design of web pages <strong>and</strong> menu<br />

systems. In applied web design usability research, the<br />

superord<strong>in</strong>ate category names are sometimes referred to as<br />

“trigger words” to signify <strong>in</strong> a post-hoc manner their be<strong>in</strong>g a<br />

user’s <strong>in</strong>ternally formulated search goal <strong>and</strong> trigger<strong>in</strong>g a<br />

selection response from the user (Spool, Perfetti, & Brittan,<br />

2004). To the extent that users form key “trigger words” <strong>and</strong><br />

then scan for their physical properties, care must be taken to<br />

<strong>in</strong>clude these words <strong>in</strong> the selection labels. Otherwise, users<br />

may miss the targeted selection even if its label is highly<br />

relevant to the user’s <strong>in</strong>formation goal. This consideration is<br />

consistent with Spool’s advice of identify<strong>in</strong>g common<br />

trigger words for users <strong>and</strong> <strong>in</strong>clud<strong>in</strong>g them <strong>in</strong> l<strong>in</strong>k labels.<br />

Certa<strong>in</strong>ly any postulation of dichotomy is prone to<br />

oversimplification. The question is whether mak<strong>in</strong>g such a<br />

dist<strong>in</strong>ction is useful <strong>in</strong> the underst<strong>and</strong><strong>in</strong>g of different<br />

mixture of processes that may go on <strong>in</strong> l<strong>in</strong>k-based search.<br />

Although computational model<strong>in</strong>g <strong>in</strong> web search has seen<br />

great improvements, empirical <strong>in</strong>vestigation of the issue <strong>in</strong><br />

our op<strong>in</strong>ion is still lack<strong>in</strong>g. The present research represents a<br />

step <strong>in</strong> f<strong>in</strong>d<strong>in</strong>g factors that elicit top-down process<strong>in</strong>g <strong>and</strong><br />

measures that <strong>in</strong>dicate when top-down process<strong>in</strong>g occurs.<br />

1853<br />

Acknowledgments<br />

This work was funded by grant NNA06CA99A from<br />

NASA’s Human Research Program. We are <strong>in</strong>debted to Jim<br />

Lofton who developed the "UI Nav Test" web application,<br />

which we used to adm<strong>in</strong>ister our navigation studies. We also<br />

thank Paul Roth, Mike Niebl<strong>in</strong>g, Ben Burton, Nils Hanson,<br />

Dev<strong>in</strong> Carter <strong>and</strong> Anna Eskra for test<strong>in</strong>g <strong>and</strong> comment<strong>in</strong>g<br />

on prelim<strong>in</strong>ary versions of the web application, Jaymie<br />

Massoletti for assistance with data analysis, Usha<br />

Viswanathan for participants recruit, <strong>and</strong> Joel Lachter for<br />

equipment setup.<br />

References<br />

Card, S. K., Moran, T.P. & Newell, A. (1983). The<br />

Psychology of Human-Computer Interaction. Hillsdale,<br />

NJ: LEA.<br />

Fleetwood, M. D., & Byrne, M. D. (2006). Model<strong>in</strong>g the<br />

visual search of displays: A revised ACT-R model of icon<br />

search based on eye-track<strong>in</strong>g data. Human-Computer<br />

Interaction, 21, 153-197.<br />

Fu, W.-T., <strong>and</strong> Pirolli, P. (2007). SNIF-ACT: A cognitive<br />

model of user navigation on the World Wide <strong>Web</strong>.<br />

Human-Computer Interaction, 22, 355-412..<br />

Hornof, A. J. (2004). Cognitive strategies for the visual<br />

search of hierarchical computer displays. Human-<br />

Computer Interaction, 19, 183-223.<br />

Kitajima, M., Blackmon, M.H., <strong>and</strong> Polson, P.G. (2000). A<br />

comprehension-based model of web navigation <strong>and</strong> its<br />

application to web usability analysis. In S. McDonald, Y.<br />

Waern, & G. Cockton (Eds.), People <strong>and</strong> Computers XIV<br />

– Usability or Else: Proceed<strong>in</strong>gs of HCI 2000 (pp. 357-<br />

373). London: Spr<strong>in</strong>ger.<br />

Miller, C. S., Fuchs, S., Anantharaman, N., & Kulkarni, P.<br />

(2007). Compar<strong>in</strong>g two methods for predict<strong>in</strong>g navigation<br />

problems <strong>in</strong> <strong>in</strong>formation hierarchies. Human Factors <strong>and</strong><br />

Ergonomics Society Annual Meet<strong>in</strong>g Proceed<strong>in</strong>gs (pp.<br />

363-367). Santa Monica, CA: HFES.<br />

Miller, C. S., <strong>and</strong> Rem<strong>in</strong>gton, R.W. (2004). Model<strong>in</strong>g<br />

<strong>in</strong>formation navigation: Implications for <strong>in</strong>formation<br />

architecture. Human-Computer Interaction, 19, 225-271.<br />

S<strong>in</strong>ger, J. D. (1998). Us<strong>in</strong>g SAS PROC MIXED to fit<br />

multilevel models, hierarchical models, <strong>and</strong> <strong>in</strong>dividual<br />

growth models. Journal of Educational <strong>and</strong> Behavioral<br />

Statistics, 23, 323-355.<br />

Spool, J. M., Perfetti, C., & Brittan, D. (2004). Design<strong>in</strong>g<br />

for the scent of <strong>in</strong>formation. North Andover, MA: User<br />

Interface Eng<strong>in</strong>eer<strong>in</strong>g.

Hooray! Your file is uploaded and ready to be published.

Saved successfully!

Ooh no, something went wrong!