Insights

The Hidden Cost of Off-the-Shelf Simulations

Off-the-shelf simulations can deliver a good day in the room and still fail three months later. The reason is structural — and it's identifiable before you commission.

For most L&D teams, off-the-shelf looks like the responsible default. A proven product. Known pricing. Facilitators who have run it before. No three-month development cycle, no bespoke price tag, no risk of commissioning something that arrives and doesn't work.

You can book it in for next quarter and have the participant packs by the time the invoice clears. And in the room, off-the-shelf usually delivers. Participants engage. The debrief surfaces useful reflections. The post-session feedback scores look healthy. By any reasonable measure, it was a good day. The measure that matters, though, is the one nobody audits — what participants actually do differently three months later.

For a specific class of training objective, the honest answer is: not much.

The appeal is real

It's worth being straight about why off-the-shelf is popular. A lot of it is earned. Mature off-the-shelf simulations have been refined across hundreds of deliveries — the facilitator script is tight, the timings are reliable, the common failure modes have been designed out. For some programmes, that operational polish is genuinely the product you want. This article isn't an argument against the category.

It's an argument about where the category's hidden cost lives, and why that cost is invisible until it's already been paid.

Behaviour change is cued by context, not content

The finding at the centre of learning-transfer research — running from Thorndike's work on identical elements at the start of the twentieth century through Baldwin and Ford's landmark review in the late 1980s — is that skills learnt in one context transfer to another in proportion to how much the two contexts share. The more a training environment resembles the environment where the skill is used, the more likely the behaviour carries across. The less it resembles it, the more the learning stays stranded where it was acquired.

The implication for simulation design is unflattering to generic scenarios. Participants who spend a day working through the fortunes of a fictional widget manufacturer have learnt to make decisions in the context of that fictional widget manufacturer. Their brains do not silently translate the pattern into “and therefore in our regional loan-approvals workflow.” Translation is a conscious act, and in most cases it isn't performed — the lessons stay with the widgets.

What generic scenarios strip out

The decisions people actually face at work are loaded with organisational texture. It isn't “balance cost against growth” in the abstract. It's “do I push back on the regional MD's Q3 number when HR flagged retention risk last week and the finance director is privately briefing that we'll miss the close-out anyway.” The feel of competing, partial, politically-weighted pressures is where judgement lives. It's also the texture that an off-the-shelf product cannot reproduce without destroying its own marketability.

So it doesn't try. The scenarios are pitched at a level of abstraction where the widest possible audience can nod along — cost versus growth, speed versus quality, short-term revenue versus long-term brand. These are real tensions. They're also not decisions; they're frames. The exercise that results is recognisable but not transferable.

The structural bind

This is the point most critiques of off-the-shelf miss. The vendors aren't being lazy, cheap, or cynical. The economics of their category require generality.

For an off-the-shelf simulation to be commercially viable, the same artefact has to sell into many organisations. The more specific a scenario is to one organisation's reality, the fewer organisations it fits. Generality and contextual fit are therefore inverse properties of a single product: a simulation cannot be maximally specific and maximally saleable at the same time. The generic feel of off-the-shelf isn't a flaw in the execution. It's the defining feature of the business model.

That matters, because it means the fit problem can't be fixed by a better off-the-shelf product. It's structural. The best generic simulation in the world is still a generic simulation.

Customisation as cosmetic fit

The usual response to this — from buyers and vendors alike — is customisation. Swap the company name, re-skin the branding, tune a few parameters, adjust the industry flavour of the scenarios. This addresses cosmetic fit. It does not address mechanical fit.

The skeleton of the simulation — the decisions available, the consequences attached to them, the scoring model, the assumptions baked into the scenario events — remains generic. The skin is bespoke; the bones are not. Participants are still being asked to make decisions inside a generic business, in a generic sector, responding to generic market events. Their own name over the door doesn't change what they're rehearsing.

The sharper version of the problem

If generic simulations only failed to teach the right lesson, they'd be an inefficient use of a training day. The sharper concern is that in the wrong setting, they can reinforce the wrong one.

A simulation's mechanics encode an implicit model of how the simulated world rewards behaviour. If those mechanics reward acting fast, overruling the cautious voice, and committing resources aggressively — and the participant's real organisation is compliance-heavy, consensus-driven, and heavily regulated — the simulation has rehearsed a strategy that will misfire when it meets reality. The participant returns to work more confident in a mental model that doesn't fit their environment. Something has changed; it just isn't what anybody intended.

This doesn't happen with every off-the-shelf engagement, and it isn't a case against the category. It's a case for matching the training to the environment the training is supposed to affect.

When off-the-shelf is the right call

There are objectives where generality is a feature, not a limitation. Programmes aimed at universal principles — negotiation fundamentals, basic resource allocation under uncertainty, exposure to systems thinking — often benefit from being stripped of contextual noise. Early-career audiences who need to encounter a way of thinking, rather than apply judgement to their specific environment, are well served. Programmes at scale — thousands of managers across multiple geographies in a single year — may have no bespoke alternative that is both affordable and logistically feasible. And simulations positioned explicitly as the generic first rung of a longer ladder, ending in applied work, do exactly what they're meant to.

The failure mode isn't using off-the-shelf. It's using off-the-shelf for an objective that quietly depended on contextual specificity, and discovering that dependency only at the point where the behaviour change doesn't show up.

How to know which side your objective sits on

Three questions, asked of your own brief before you commission anything, surface the answer.

Are the objectives about generic decision-making, or about decisions specific to this organisation's environment? If the former, off-the-shelf is probably a reasonable fit. If the latter, generality becomes a liability.
Could the same programme be delivered, word for word, to a competitor and produce the same outcome? If yes, the programme isn't leaning on contextual specificity — and off-the-shelf will likely perform as well as bespoke. If no, you're buying context, and generic content cannot supply it.
What do you expect participants to do differently on Monday — and would the simulated scenarios have rehearsed that behaviour in this environment? This is the question that matters most, because it forces the objective into a specific, observable form before any money is committed.

Reframing the hidden cost

The licence fee isn't the hidden cost. The hidden cost is the sum of everything that accrues when the simulation is a poor fit for the objective: a training day at full loaded rate for every participant, a facilitator doing in-room translation work that the artefact should have done, behaviour change that doesn't show up at the six-month mark, and the slow erosion of L&D's internal credibility when “that programme we ran” has no visible tail.

Off-the-shelf is not the wrong answer. It's the wrong answer for the wrong objective — and the difference between the two is testable before anyone signs a purchase order. The choice isn't between generic-and-affordable or bespoke-and-expensive. It's between buying generality and buying specificity, and the question worth asking is which of those the programme actually needs.

Weighing up off-the-shelf?

If you're weighing up off-the-shelf for a programme where the objective is genuinely context-dependent, a conversation before you commission is often worth it — just a clearer read on whether generic will carry the weight you need. No commitment.

Get in touch →