In this article we'll take a look at the different components that make up the experiment health score
Understanding Your Experiment Scorecard
Your experiment health score is designed to track critical parts of the experimentation program and to ensure that the most important principles are being adhered to. This ranges from having proper sources for your experiments, prioritising and hypothesising at the right stages and not tampering with experiments post-completion.
A Multi-Stage Scoring System
There are 3 stages in the typical experiment's lifecycle and that is; pre-live, live and post-live. The experiment health score is focused on 2 of these phases:
Pre-Live Scoring (Total: 60 points)
Here we track all your planning, prioritisation and critical info added to your experiment and the scoring goes as follows:
[Coming Soon] Objectives preset
Linked to objectives & targets: 10,
Retrospectively linked to objectives & targets: 5,
No linking (orphaned experiment): 0
Source of Experiment
Research: 10
Ideas: 5
Ad-hoc: 0
Prioritisation Completed
Fully done: 10,
Partially done: 5,
Not done at all: 0
Experiment Planning
Clear dates defined and went to plan: 10,
Clear dates defined but didn't follow that plan: 0,
Hypothesis Added
Added anytime during ideas & hypothesis stage: 10,
Added later once experiment has already been created: 5,
Process Adherence
Followed kanban flow and process: 10,
Skipped various statuses: 5,
Off course (kept going back and forth): 0
Post-Live Scoring (Total Score: 40 points)
This stage is all about capturing learnings & recommendations and ensuring winning experiments get implemented
Analysis (Insights & Learnings)
Present: 5,
Absent: 0
Next Steps (Recommendations)
Present: 5,
Absent: 0,
Experiment Completion
Ended on or days within expected end-date: 10,
Ended too early: 5,
Ended significantly earlier than expected end-date: 0
Rollouts
Set as Implemented or Not Implemented: 10,
Set as Pending or Not Sure: 5,
Not Set: 0,
Dependencies
Finished without unresolved blockers: 10,
Finished with unresolved blockers: 0
No dependencies: Skip (auto-10)
Thus, an experiment that perfectly followed process and best practice, will have the ability to reach a 100/100 score
Tracking High Alert Integrity Issues
Introducing the new integrity checker. With the sole focus of improving your overall experiment quality, the integrity checker will point out some critical issues that might compromise the integrity of your experiment as well as warning you not to tamper with finalised experiments
Pre-Emptive Warning
As you're about to edit or update a finalised experiment, a warning will pop up warning you that this might affect the integrity of your experiment.
Post-Completion Changes
Once an experiment is finalised, it's highly recommended not to make any further changes as to avoid corrupting the integrity of your data. Any changes to key fields made once an experiment is finalised, will result in point deductions:
Changed or altered hypothesis: -10,
Changed, removed or altered goals: -10,
Changed the already set outcomes: -10,
Removing previously added Insights or Recommendations: -10,
Changed, replaced or removed variation screenshots: -10
Highlighting Critical Integrity Issues
HARKing
To ensure the integrity of your experiment, your hypothesis should never change once the experiment goes live. It's for this reason that we will check for changes made to your hypothesis.
You will be warned that possible Hypothesising After Results Are Known has happened. This will mark your experiment with an integrity issue and subtract from your overall health score
Peeking
We all get excited when our experiment starts out really promising with an increase in conversion of 150%. However... It's never not how the experiment starts out, but how it finishes.
Your experiment needs to run long enough to reach a significance level to ensure that your results are no fluke. It's for this reason that we will check how early you finished your experiment
You will be warned that possible Result Peeking has happened. This is caused when an experiment stops significantly earlier than anticipated. Your experiment will be marked with an integrity issue and subtract from your overall health score
Instantly Spot Problematic Areas
Should any changes be made to the critical fields, as mentioned above, not only will the experiment be deducted but you'll also be able to see the field changes as visualised on your Experiment Timeline. This will help to track, train and reinforce best practices throughout your team.