10 Evaluation Flashcards

Question 1

Q

Why Evaluate?

Answer

A

Well-designed products sell
To ensure that system matches the users‘ needs
To discover unforeseen problems
To compare your solution against competitors ( „We are x % better than…“)

Question 2

Q

Where to Evaluate?

Answer

A

Naturalistic Approach: Field Studies

Usability Lab

Question 3

Q

When to Evaluate and who evaluates when?

Answer

A

Evaluation should happen throughout the entire software development process

Early designs: evaluated by the design team, analytically and informally
Later implementations: evaluated by users, experimentally and formally

Question 4

Q

Evaluation Methods

Answer

A

Determine the Goals
Explore the Questions
Choose the Approach and Methods
Evaluate, Interpret & Present Data

Question 5

Q

Important aspects in creating an evaluation process?

Answer

A

Reliability: can the study be replicated?
Validity: is it measuring what you expected?
Biases: is the process creating biases?
Scope: can the findings be generalized?
Ethics: are ethics standards met?

Question 6

Q

External vs Internal Validity

Answer

A

External validity
-> confidence that results apply to real situations
-> usually good in natural settings
Internal validity
-> confidence in our explanation of experimental results
-> usually good in experimental settings

Question 7

Q

Ethics Approval

Answer

A

Researchers must respect the safety, welfare, and dignity of human participants in their research and treat them equally and fairly*

Criteria for approval:

research methodology
risks or benefits
the right not to participate, to terminate participation, etc.
the right to anonymity and confidentiality

Question 8

Q

Ethics - Before the test (5 things)

Answer

A

Only use volunteers
Inform the user
Maintain privacy
Make users feel comfortable
Don’t waste the user’s time

Question 9

Q

Ethics - During the test (4 things)

Answer

A

Maintain privacy
Make users feel comfortable
Don’t waste the user’s time
Ensure participant health and safety

Question 10

Q

Ethics - After the test

Answer

A

Inform the user
Maintain privacy
Make users feel comfortable

Question 11

Q

Usability Testing

Answer

A

Focus on: how well users perform tasks with the product (time to complete task and number & type of errors)

-> Controlled environmental settings

Question 12

Q

Signal & Noise Metaphor

Answer

A

Experiment design seeks to enhance the signal (variable of interest),
while minimizing the noise (everything else (random influences))

Question 13

Q

Controlled Experiment: Steps

Answer

A

Determine the goals, explore the questions, then formulate hypothesis
Design experiment, define experimental variables
Choose subjects
Run pilot experiment
Iteratively improve experiment design
Run experiment
Interpret results to accept or reject hypothesis

Question 14

Q

Experimental Variables

Answer

A

Independent Variables
Dependent Variables
Control Variables
Random Variables
Confounding Variables

Question 15

Q

Independent Variable - Definition & Examples

Answer

A

An independent variable is under your control
Independent because it is independent of participant behavior

Interface, device, button layout, visual layout, feedback mode, age, gender, background noise, expertise, etc.

Must have at least two levels (values/settings) -> test conditions

Question 16

Q

Dependent Variable - Definition & Examples

Answer

A

measured human behavior, depends on what the participant does
is measured during the experiment

Task completion time, speed, accuracy, error rate, throughput, target re-entries, task retries, presses of backspace, etc.

Question 17

Q

Control Variable - Definition & Examples

Answer

A

a circumstance that is kept constant

more control -> less variability, less generalizable

Question 18

Q

Random Variable - Definition & Examples

Answer

A

circumstance that is allowed to vary randomly -> more variability (bad), but more generalizable

Question 19

Q

Confounding Variable - Definition & Examples

Answer

A

circumstance that varies systematically with an independent variable

Question 20

Q

Experiment Task - Good Task Qualities:

Answer

A

Represent activities people do with the interface

Discriminate among the test conditions

Question 21

Q

Hypothesis vs Claim

Answer

A

A claim predicts the outcome of an experiment
Example: Reading a text in upper case takes longer than reading it in sentence case
A hypothesis claims that changing independent variables influences dependent variables
Example: Changing the case (independent variable) influences reading time (dependent variable)

> Experiment goal: Confirm hypothesis
> Statistical approach: Reject null hypothesis

Question 22

Q

Statistical Tests - 2 Types

Answer

A

Parametric
-> Data are assumed to come from a distribution, such as the normal distribution, t-distribution, etc.
Non-parametric
-> Data are not assumed to come from a distribution

Question 23

Q

Statistical Tests - Which test for nominal and ordinal (gender, age groups, …)

Answer

A

Non-parametric tests (e.g., Chi-square test)

Question 24

Q

Statistical Tests - Which test for Interval and Ratio (temperature in C or K, …)

Answer

A

Parametric tests (e.g., t-test, ANOVA), or Non-parametric tests

Question 25

Q

too few vs too many participants?

Answer

A

Too few: experimental effects fail to achieve statistical significance
Too many: statistical significance even for very small effect sizes

Question 26

Q

Within-subjects, Between-subjects

Answer

A

Within-subjects: each participant is tested on each condition
Between-subjects: each participant is tested on one condition only

Question 27

Q

Order Effects and how to avoid them

Answer

A

Order effects / learning effects can occur when the same participant is doing a similar task multiple times

-> only relevant for within-subject factors

Avoid them by:

participants divided into groups, with different orders for test conditions (latin square)

Question 28

Q

Longitudinal Studies

Answer

A

research that seeks to promote and investigate learning

-> practice is the independent variable

Question 29

Q

Analytical Evaluation Methods (2)

Answer

A

heuristic evaluation

cognitive walkthrough

Question 30

Q

Golden rules of UI design

Answer

A

Keep the interface simple
Speak the user‘s language
Be consistent and predictable
Make things visible and provide feedback
Minimize the user‘s memory load
Design for error: Avoid errors, help to recover from errors, offer undo
Design clear exits and closed dialogs
Include help and documentation
Offer shortcuts for experts
Make the system responsive

Question 31

Q

Heuristic Evaluation- How many evaluators?

Answer

A

3-5 evaluators

Question 32

Q

Cognitive Walkthrough

Answer

A

Experts “walk” through the design prototype with usage scenario(s)

Experts analyze each task following 3 questions:

Will the correct action be sufficiently evident to the user?
Will the user notice that the correct action is available?
Will the user associate and interpret the response from the action correctly?

Question 33

Q

Model-Based Evaluation - 3 Examples

Answer

A

GOMS
Keystroke Level Model („daughter“ model of GOMS)
Fitt‘s Law

Question 34

Q

GOMS - Name and Main princile

Answer

A

use model of execution time for basic tasks to predict how long a sequence of actions takes

GOMS = Goals, Operators, Methods, Selection rules

(Selection rules decide which method to select when there is more than one)

Question 35

Q

Keystroke Level Model

Answer

A

refinement of GOMS that provides a quantitative model about execution times
assigns each operator a context-independent average duration