Skip to main content

How do I validate Strength Score using Power BI?

Evaluate whether Strength Score accurately reflects workflow quality using real production data.

Written by Lauren Baird
Updated this week

Answer

Validate Strength Score in Power BI using real production submissions. The goal is not just to confirm that a score exists. The goal is to confirm that the score actually reflects workflow quality in a way that makes sense for your program.

A good validation process checks whether:

  • high-quality submissions score higher

  • rushed or low-effort submissions score lower

  • the score distribution feels reasonable

  • the results match what reviewers would expect

Because Strength Score does not appear in ANVL Insights, validation must be done in Power BI after the workflow has been live in production long enough to collect meaningful data.


Steps

Confirm the workflow is ready for validation

  1. Make sure Strength Score is already configured on the workflow.

  2. Make sure the workflow is active in a production site.

  3. Make sure users are completing the workflow as part of normal work.

  4. Do not try to validate Strength Score using only UAT or test submissions.

Allow real usage time

  1. Let the workflow run in production for about 2 weeks.

  2. Allow normal user behavior during this time.

  3. Try to collect enough submissions to see variation in user behavior and completion quality.

Strength Score needs real behavior patterns to be meaningful.

Review the score distribution in Power BI

  1. Open the Trending Metrics or Weekly Summary report in Power BI.

  2. Filter to:

    • the specific workflow

    • the desired time range

    • the relevant site or sites

  3. Review the overall distribution of scores.

  4. Look for:

    • high scores

    • mid-range scores

    • low scores

You are looking for patterns, not perfection.

Sample real workflows and compare quality

  1. Select a sample of workflows from each score range:

    • about 5 high-scoring workflows

    • about 5 mid-range workflows

    • about 5 low-scoring workflows

  2. Manually review those workflows.

  3. Ask:

    • do the high scores match clearly strong submissions?

    • do the low scores match rushed or incomplete submissions?

    • do the mid-range scores feel reasonable?

    • is the score rewarding the behavior you intended to encourage?

Look for misalignment

  1. Identify whether the score is behaving in a way that does not match expectations.

  2. Common warning signs include:

    • strong workflows scoring too low

    • weak workflows scoring too high

    • scores clustering too tightly with little variation

    • one behavior dominating the score too heavily

Adjust and re-evaluate if needed

  1. Adjust weights first if the score is overemphasizing the wrong behavior.

  2. Adjust parameter thresholds if they are too strict or too lenient.

  3. Avoid adding more parameters unless needed.

  4. Republish the workflow if changes are made.

  5. Allow more real usage time.

  6. Review the updated results again in Power BI.

Iteration is normal.

What good looks like

  1. Score ranges feel intuitive.

  2. High-quality completion is consistently rewarded.

  3. Low-effort submissions score meaningfully lower.

  4. Reviewers and stakeholders trust the metric.

Strength Score does not need to be perfect to be useful, but it should feel fair and credible.


Did this answer your question?