Skip to main content

Reporting on Agentic Campaigns

Written by Charley Bader

The Agentic Report is your live view of how the campaign is performing, how the agent is distributing traffic, and where performance is being created across audiences and content.

Use it to answer three questions:

  • Is it working?

  • How reliable is the performance signal?

  • Where should traffic be focused next?

For Agentic Campaigns, you are also able to see our standard campaign reporting, at a global level and in real-time.


The Three Tabs

Overview

Impact, trends, traffic allocations, and variant outcomes. This is your go-to view for understanding results.

Insights

A deeper view of how the agent is making decisions, and where performance is being created across your audience.

Activity

A log of every optimisation run, with transparent reasoning behind each decision the agent made.


Recommended Reading Order

  1. Header - check the phase and what the agent is optimising for

  2. Performance - see top line metrics about how your campaign is performing

  3. Performance Breakdown - expand to see uplift metrics and revenue projections

  4. Allocations - see where the agent is routing traffic

  5. Secondary Goal Effects - confirm nothing important is going backwards

  6. Insights tab - when you want to go deeper on why the agent is making the decisions it is

  7. Activity tab - when you want to see the run log


Key Terms

Control: Baseline visitors who didn't receive a personalised experience.

Experience: Visitors who were shown a personalised experience.

Phase: Where the agent is in its learning lifecycle: New → Broad Exploration → Narrowing In → Optimisation → Always On.

Probability to Beat Control: The likelihood that the observed uplift is real, not just noise.

Certainty: A measure of how confident the model is in its current results, based on the volume and consistency of data collected so far.

Analysis Window: The time window the headline metrics are calculated over (default: All Time).

Annual Projected Uplift: The projected annualised value of the campaign. Hidden until the selected window is more likely than not to beat control.

Learning Milestone: A point in the trend chart marking when the agent crossed a meaningful learning threshold.

Credible Interval: The range within which the true value of a metric most likely falls, based on current data. Early in a campaign the range will be wide, reflecting limited certainty. As more data is collected, the range narrows.


Detailed Reporting Pages & Features

Campaign Details

The top of the Overview tab summarises how the campaign was set up and where the agent is in its learning:

  • Agent Strategy - the strategy the campaign was set up with (e.g. Experience Optimisation).

  • Optimising For - the primary goal the agent is optimising against (e.g. Revenue Per User).

  • Days Live - how long the campaign has been running.

  • Experience Users - total number of users who have received an experience.

  • Phase - where the agent is in its learning lifecycle, with an estimate of how long until it reaches Always On (shown while in Optimisation).

Below this row sits a plain-language description of what the current phase means, followed by a progress bar across the five phases: New → Broad Exploration → Narrowing In → Optimisation → Always On. These phases are based on volume of traffic in the first instance. In the absence of sufficient volume the phase will become based on length of time the campaign has been live.

Performance

The Performance section opens with a toolbar:

  • Performance Metric - the metric being shown (defaults to your primary goal; switch to view performance against another goal).

  • Analysis Window - the time window the metric is calculated over (default: All Time). You can also switch to look at a variety of time periods.

  • CSV download - exports the current performance view.

  • Window Days - number of days in the selected time period.

  • Probability to Beat Control - how likely it is that the observed uplift is real, not noise.

  • Certainty - how confident the model is in its current results.

Below the toolbar are headline cards comparing experience versus control:

  • Experience Users vs Control Users - how many visitors fell into each group.

  • Traffic Split - cumulative split across the selected analysis window period.

  • Primary Goal Metrics vs Control

The two primary goal cards change to match the primary goal - e.g. Orders / Conversion Rate for a conversion goal.

Below the cards is the Performance Metric trend chart - control vs experience, with Learning Milestone markers showing the points at which the agent crossed a meaningful learning threshold. anchored either on time live (e.g. after the first 7 days of learning, after the first 14 days) or on volume reached (e.g. after the first 25,000 users, after the first 250,000 users). You can switch the reporting to scope to any of these milestones - for example, measuring performance only from the moment the agent reached 25k users, or only across the last 14 days. Use them to strip out the noisier exploration period early in a campaign, or to compare like-for-like windows across campaigns at different stages of maturity.

Performance Breakdown

Click View Performance Breakdown under the trend chart to expand a fuller view of the selected analysis window. The breakdown is split into two columns.

Performance for the selected window:

  • Total Uplift with a credible interval range.

  • Relative Uplift with a credible interval range.

  • Performance Metric

  • Probability to Beat Control.

  • Certainty.

Alongside, Annual Projected Uplift turns the extra revenue already measured into a projection for the next 12 months. It stays hidden until the selected window is more likely than not to beat control. Once visible, it walks through three assumptions used to estimate extra revenue over the next 12 months.

  1. How much of a year did we measure? - defaults to the share of yearly traffic estimated from the calendar days measured. Refined once a full year of session history is available.

  2. How much traffic will be held back as control? - defaults to your control share minimum; everyone else sees the experience.

  3. How confident are you in the uplift? - defaults to the campaign's measured certainty. Raise for the full observed uplift, or lower for a more conservative estimate.

These can be edited manually to show how revenue projections vary in line with changes.

Allocations

Allocations are the proportion of eligible traffic directed to each experience. Unlike fixed split testing, these percentages adjust continuously as the agent gathers evidence.

The Cumulative Allocation % by Day chart shows each experience's share of traffic over time. The current allocation percentage for each experience is displayed alongside its name above the chart. For relevant strategies you can also toggle between experience and timing.

Seeing allocations shift is a normal, healthy sign that the agent is learning:

  • The agent increases share to experiences showing stronger performance.

  • It keeps some exploration active so alternatives can still be tested.

  • Daily shifts are expected as new evidence is incorporated.

Focus on trajectory, not isolated single-day movements. Expect more fluctuation during exploration, and more stability as the campaign matures into Optimisation.

Secondary Goal Effects

Secondary Goal Effects shows how the campaign is impacting other goal metrics beyond the primary goal.

At the top of the section are filter tabs by goal type: All, Transactional, Custom, Engagement, Funnel, Intent, Affinities.

Below the tabs is the table, with columns for Metric, Control, Experience, Relative Uplift, Probability to Beat Control, Certainty.

At the bottom of the table is the Show Credible Intervals toggle - switch this on to show the credible interval range alongside each metric instead of just the point estimate. This is the range within which the true value of a metric most likely falls, based on current data. Early in a campaign the range will be wide, reflecting limited certainty. As more data is collected, the range narrows.

This is also where transactional metrics like Conversion Rate and Revenue Per User appear when you're optimising for something else - they continue to be tracked here automatically. Each section of the report also has its own CSV download button at the top right.

Insights Tab

The Insights tab is where you go to understand why the agent is making the decisions it's making. The Performance section on the Overview tab tells you whether the campaign is working; Insights tells you who it's working for and which signals the agent is leaning on.

Use it when you want to:

  • Understand which user contexts are driving the agent's allocations

  • Spot audiences the agent has no strong preference for, a sign you may need to add a new variant aimed at them

  • See how confident the agent is in the decisions it's making for each experience

  • Find candidates for follow-up campaigns by identifying segments where one experience consistently wins

The Insights tab also surfaces the groupings the agent has chosen for itself. For example, if the agent has identified "Top Product Price Band Affinity" as a signal worth using, you'll see how it has bucketed users inside that signal (e.g. budget, mid-range), these groupings come from the agent, not from segments you've set up.

At the top of the tab is a filter icon - every panel below scopes to whatever audience you set. The total user count for the current filter appears on the right.

Quick Insights

Three quick-question cards. Pick one to drive the panel below:

  • Top Signals - These are the context points the agent has chosen as most important. Which user contexts drive allocations the most? Lists the strong signals: contexts where allocations swing significantly (for example Intent Stage, Shopper Mindset, Likelihood to Abandon Session). Use this when you want to understand what's really driving the agent's decisions, rather than what you assumed would matter.

  • Best Fit - Which users should get a given experience? Spotlight an experience and see the contexts where the agent allocates it most heavily versus baseline. Use this to identify the audience an experience performs best with, often a signal that there's a follow-up campaign worth running for that segment specifically.

  • No Preference - Which users don't resonate with anything we've shown them? Audiences where allocation is split close to evenly across experiences, a signal that none of your variants are landing strongly. Use this to find audiences where you may need to add a new variant.

Allocations vs Baseline

Spotlight an experience and compare its allocations against the campaign baseline across each user context. The table shows Context, % Users, Best Experience, Allocation, and the difference vs Baseline.

Allocation Decision Confidence

For each experience served, how confident the model was that it was the best choice for that user. Distribution is bucketed across Low, Moderate, and High.

Agent Activity Tab

A transparent log of what the agent has done within the campaign.

Agent Activity Summary

Three summary cards at the top:

  • Agent Runs - total optimisation cycles.

  • Agent Decisions - total tasks performed across all runs.

  • Est. Time Saved - an estimate of the manual analyst time the agent has saved.

Activity Log

A detailed breakdown of every activity the agent has run, listed most-recent first. Each entry shows the activity name, a short description, an optional "time saved" estimate, and a timestamp.

Did this answer your question?