32 4. FACETED FRAMEWORK OF INTERACTIVE IR USER STUDIES
goals and assessed the eectiveness of the innovative retrieval and ranking algorithms in improving
participants’ word-learning outcomes.
User studies that manipulate system and/or interface features can oer empirical basis for
IIR system evaluation and also shed light on the development of search systems and the design
of user interfaces in various settings (e.g., home setting, workplace, public and school libraries). In
addition to these subfacets, system features also involve other important aspects, such as the devices
on which a search system can operate (e.g., desktop computer, tablet, smartphones), test collections
and corpora. Particularly, some IIR research focus on the dierences in user search behavior across
dierent types of devices and suggest that users often adjust their search strategies according to
some specic interface features (e.g., search result representation, SERP size, information scent
level) of the systems running on dierent devices (e.g., Harvey and Pointon, 2017; Ong et al., 2017;
Song et al., 2013). ese studies can provide actionable design implications for system and interface
designers and facilitate user-centered evaluations of personalized search systems.
4.1.4 STUDY PROCEDURE AND EXPERIMENTAL DESIGN
As Figure 4.2 illustrates, the procedure of an IIR user study is largely aected by the proposed re-
search questions, externally assigned or self-generated task types, user characteristics, as well as the
system and interface features manipulated or created by researchers. Based on the proposed research
questions and the related variables, researchers need to decide other components and steps of the
user study and to gure out the design decisions and compromises they have to make in order to
properly address the research problems with available resources.
Due to the nature of dierent research questions, some of the user studies adapted experi-
mental design from psychology and applied the associated methods in studying the search behaviors
of users with dierent characteristics and in evaluating innovative search systems. e treatments
in IIR user studies often involve the main facet(s) or core factors, such as task facet and task type
(e.g., factual task vs. intellectual task), user group (e.g., expert group vs. novice group), and system
features (e.g., experimental system group vs. baseline system group). To create a suitable, reliable
experimental setting, researchers have to not only design and rene the treatment condition(s), but
also think carefully about how to dene a reasonable baseline condition. us, the justication of
a user study (especially experimental) design should detailedly describe and explain both the treat-
ment part and baseline part. On top of that, researchers also have the obligation to come up with
a reasonable ground truth if the user study involves user-centered evaluation of any kind, such as
interface evaluation, new search tool evaluation, and ranking algorithm evaluation.
In addition to the basic components discussed above, a user study researcher also needs to
adjust the values of other subfacets associated with the study procedure and experimental design.
For instance, as a preparation work for many user studies, pilot study and pre-study training or
tutorial should be properly reported and discussed in user study papers. ese pre-study parts are