Kahn Research Group - Validity 101
free knowledge > validity 101
 

Note: This essay does not reflect our in store offerings. While we hope to update it soon, it is now a part of graduate and undergraduate curriculum at over 30 universities worldwide, leading us to decide to post it as is for the time being.

Validity 101 – What it is, why it is essential, where others fail, and how we succeed

What it is

Validity is the most important concept in research, determining whether or not the findings have any value in the real world. Without validity, research is futile, resulting in useless information that serves no purpose. The heart of validity lies in the methodology; research methods lacking validity produce invalid results, regardless of how diligently the methodology is followed.

There are a number of different types of validity, but for our purposes we will focus on two, internal and external. Internal validity refers to the relationship among variables, essentially whether or not a cause and effect relationship can be established. While there are numerous threats to internal validity, two are particularly relevant to behavioral research/usability testing, selection and experimenter bias. Selection refers to the sample used, becoming a threat when participants are not chosen at random. Experimenter bias comes into play when the experimenter must evaluate the subject but is unable to do so objectively and either overestimates or underestimates the participant in some way.

External validity refers to whether or not the findings are applicable in the real world, and it can be compromised by almost anything – improper sampling, research setting, and pretest knowledge to name a few. One notable threat to external validity is the Hawthorne effect, a term used to describe the changes in a subject’s behavior when he or she is aware that he or she is being observed. In essence, the Hawthorne effect tells us that people do not behave in the same manner when they know they are being watched as compared to when they do not know they are being watched. This concept is crucial in observational research, as it presents a huge threat to validity.

Why it is essential

A research methodology must maintain both internal and external validity for both it and its results to have any value. While conditions surrounding a research study often require that internal validity be sacrificed for external validity (and vice-versa), every effort must be made to maintain as much validity as possible. Every compromise deflates the value of the research, eventually resulting in useless data.

Where others fail

Most usability testing/customer experience evaluation methodologies suffer from serious validity problems, both internal and external.

The problems start before the research does; participants are not selected at random, but culled from records of people who have signed up in search of incentive, or rewards for participation. To start, these participants had to go through the process of signing up online, a time-consuming process that requires a certain degree of technological know-how in addition to a patient temperament. The simple fact that they have the time to do this, nonetheless go to the lab for the testing, suggests that they are far from the average sample of Internet users. Some at-home non-observational testing where participants are asked a series of questions while browsing a site requires participants to change their proxy settings, a process that the average internet user would be at least uncomfortable doing, and likely unable. This leads to a very skewed sample of tech-savvy users, not the ideal participant for testing of this type.

The next validity problem plaguing all other usability/customer experience firms is the lab setting, an unnatural and foreign environment for participants. First, their motivation for visiting the site is very different from that of the normal site visitor because they are there for the incentive, not a need to shop. This motivational difference has profound effects on the subjects’ behavior, altering the speed at which they move through a site as well as how they approach the entire exercise. Additionally, the labs are equipped with different hardware than what they are accustomed to, and, as anyone who has ever borrowed someone else’s computer will tell you, this change is quite disorienting, leading to altered behavior.

Finally, the Hawthorne effect runs rampant in these evaluations. The subjects know they are being observed and approach the entire task in a different manner than they normally would. Attention is more focused, speed increases, more attempts are made before giving up, and careless mistakes such as typos decrease. When subjects interact with experimenters during the evaluation, they often attempt to please the experimenter by conforming to expectations and being overly complimentary. The same problem can impair the experimenter as well, preventing him or her from conducting an objective assessment.

These validity issues plague all usability testing/customer experience firms. All, that is, except Kahn Research Group.

How we succeed

Kahn Research Group addresses all of these validity issues with a methodology that cannot be beat. To start, we do not rely on just one source of data, but three: surveys, site logs, and observational data. This proprietary methodology (patent pending), the Triangulation Method, provides the most accurate view of the user experience available today.

The Triangulation Method

Surveys

We start by designing two questionnaires, one for successful site visitors (purchase or registration or however you define success) and one for unsuccessful visitors. We then integrate these questionnaires into your system, programming them to “pop-up” when visitors either complete a purchase or leave the site. The questionnaires appear at random, generating a random, and hence valid, sample. To increase participation rates and honesty, an incentive is offered for completing the survey.

Site logs

Additionally, we mine the site logs for data on our respondents, again ensuring a random sample. We examine this log extraction for numerous data points, looking at, for example, time spent on pages, pathways through the site, and numerous other variables. By tying this data to the attitudinal data, we can understand how behavior differs by psychographic, demographic, and technological profile, as well as any other factor chosen.

Observational Data

The observational research, however, is where we really shine above our competitors. We utilize one of two methodologies depending upon the goals of the study and resources available to the client.

Our first method eliminates the Hawthorne effect, and, while it does take place outside of the subject’s home or office, it is conducted in an environment that is far less foreign than a lab. Rather than bring people into a lab, we bring the lab to the people, setting up web-enabled kiosks in locations appropriate for the client. For example, a bank’s informational website would be tested from within a bank branch, a travel site in an airport, or a sports-related site at a sporting event. With cameras focused to capture body, facial, and vocal language, the subject’s behavior is documented unbeknownst to him or her. In addition to providing the most valid observational data available, this method has a very positive side effect: it promotes the site directly to your target audience. When the subject is done interacting with the website he or she is intercepted and surveyed on the experience.

Our second method eliminates the lab setting concerns while minimizing the Hawthorne effect by going out and observing people using the site in their natural setting, be it home or office. Participants are chosen from registration information (ensuring a sample of actual users), by selecting demographically appropriate people that have never used the site before, or both. Due to the somewhat invasive nature of this approach, a large incentive is offered in exchange for participation. The subject is recorded for one hour while the researcher leaves the location. By leaving the premises, the researcher significantly reduces the impact of the Hawthorne effect while increasing the comfort level of the subject, thereby increasing the validity of the results as well. Upon returning to the subject’s home or office, the researcher conducts a long, open-ended interview in order to ascertain the participant’s perception of the site.

Both methods have distinct advantages and disadvantages, and one is usually more appropriate for a site than the other. Our first method is particularly effective if you have a targeted audience that can be found at a particular location. However, service-oriented sites such as online banking and online stamps might be better served if observed in the place where they are most frequently used. The cost of in-home or in-office testing must be considered as well; with a $50 or more incentive being offered, the number of subjects should be kept to a minimum, resulting in data that is more qualitative than quantitative. Our kiosk method, in contrast, is relatively inexpensive, even free when exit interviews are not conducted, and yields both qualitative and quantitative data. But either way, our methodology yields the most valid usability/customer experience findings available.

 

 

 

Kahn Research Group - Validity 101
 
 

| Home | In-store Behavioral Tracking | In-store Videography | In-store Attitudinal Research | Online Behavioral Research | Online Attitudinal Research | Online Observational Research | Expert Evaluations & Consulting | Free Knowledge | Contact Us | About Us |

© 2000 Kahn Research Group LLC | All rights reserved.