Incident Investigation

How to Run a Barrier Failure Review After a Serious Incident

A Headline Podcast guide for reviewing barrier failure after a serious incident, so leaders can separate broken controls from weak explanations.

By 9 min read

Key takeaways

  1. 01A barrier failure review starts by naming the unwanted energy or exposure, because injury descriptions alone do not reveal which control failed.
  2. 02Useful findings classify the barrier as absent, failed, bypassed, or ineffective, since each category requires a different action.
  3. 03Field verification must be tested against the actual failure mode, not against the existence of a signature, checklist, or permit.
  4. 04Corrective actions should be assigned to the true barrier owner, such as engineering, maintenance, operations, supervision, procurement, or EHS governance.
  5. 05Senior leaders should close the review by asking where else the same barrier could fail under similar operational pressure.

A serious incident review often starts with the wrong question. Leaders ask who missed the procedure before they ask which barrier was expected to stop the energy, isolate the exposure, interrupt the sequence, or force a pause.

This guide shows how a senior EHS leader can run a barrier failure review after a serious incident without collapsing the investigation into blame, paperwork, or generic retraining. The aim is to identify the control that failed, the condition that weakened it, and the management decision that allowed the weakness to remain present.

On Headline Podcast, Andreza Araujo and Dr. Megan Tranter often describe safety as a real leadership conversation, not a vocabulary exercise. Barrier failure review is one of those conversations because it tests whether the organization wants an explanation that feels clean or a control decision that changes the work.

What you need before starting

Before the first review meeting, gather the incident sequence, photographs, permits, isolation records, inspection history, maintenance notes, training records, supervision notes, design drawings if applicable, shift handover records, and the first witness accounts. If the incident involved high energy, mobile equipment, confined space, hot work, work at height, chemicals, stored pressure, or electrical exposure, the review also needs someone who understands the technical control path.

The review owner should define one practical boundary. The barrier failure review does not replace the full investigation. It gives the investigation a disciplined control lens so the team does not spend two weeks debating behavior while the failed barrier remains available for the next shift.

That distinction matters because first-hour incident evidence ages quickly. A damaged guard is repaired, a bypass is removed, a permit is rewritten, and the scene starts to look more controlled than it was when the event occurred. The review must protect the original condition before good intentions erase it.

Step 1. Define the unwanted energy or exposure

Start with the energy or exposure that reached the person, asset, environment, or process. Name it plainly. Mechanical movement, gravity, pressure, electricity, chemical release, heat, traffic movement, suspended load, stored energy, or absence of oxygen each requires a different control logic.

The group should write a one-sentence exposure statement before discussing causes. For example, a worker entered the line of fire of a suspended load, or a technician contacted equipment that still held stored hydraulic pressure. This sentence keeps the review anchored in physics and work design rather than personality.

James Reason's work on organizational accidents is useful here because it separates the visible event from the latent conditions that allowed defenses to line up poorly. If the team cannot name the exposure, it will usually name a person instead.

The common error is starting with the injury description. Injuries describe consequence. Barrier reviews need the exposure pathway, because the same exposure may produce no injury on Monday and a fatality on Friday.

Step 2. Identify the barrier that should have stopped the sequence

Once the exposure is clear, ask what should have stopped it. The answer may be a physical barrier, engineering interlock, isolation point, permit condition, atmospheric test, traffic separation, lifting exclusion zone, supervision hold point, alarm, procedure step, or competence requirement.

Do not accept awareness, care, attention, or common sense as a barrier. Those words may describe personal expectations, but they do not define a control that can be verified before work starts. A useful barrier can be inspected, tested, observed, assigned, maintained, or stopped when unavailable.

This is where the review can link with barrier decay. A barrier rarely disappears in one dramatic moment. It is usually normalized through missing parts, informal workarounds, rushed verification, unclear ownership, production pressure, or inspection routines that never test the condition that matters.

Co-host Andreza Araujo has explored this management pattern in Safety Culture: From Theory to Practice, where culture is treated as what an organization permits, measures, corrects, and rewards under pressure. A barrier that fails repeatedly is often a cultural signal before it becomes an investigation finding.

Step 3. Classify the barrier as absent, failed, bypassed, or ineffective

The review should not treat all barrier problems as the same. An absent barrier means the control was not in place. A failed barrier means it was present but did not work. A bypassed barrier means people worked around it. An ineffective barrier means it worked as designed but was not strong enough for the real exposure.

This classification changes the action. An absent barrier may require planning and supervision changes. A failed barrier may require maintenance, testing, or design review. A bypassed barrier may expose schedule pressure, poor usability, or weak enforcement. An ineffective barrier may require engineering change because the original risk assessment underestimated the task.

Ask the team to support the classification with evidence. If the lockout was present, which isolation point failed? If the exclusion zone existed, how was it breached? If the permit required gas testing, when was the test performed, by whom, and under which atmospheric condition?

The trap is writing the finding as procedure not followed. That phrase may be accurate, although it is not yet useful. The review must explain whether the procedure was unavailable, impractical, ignored, contradicted by production reality, or insufficient for the hazard.

Step 4. Rebuild the sequence around barrier decisions

Create a timeline that marks barrier decisions, not only event timestamps. Include when the task was planned, when the hazard was recognized, when the barrier was selected, when it was verified, when it degraded, when someone could have paused, and when the exposure became active.

This timeline should sit next to the broader incident record. If the organization already uses Five Whys for SIFs, the barrier timeline keeps the method from drifting into a chain of worker choices. It forces each why to return to a control condition.

Good timelines include quiet moments. The missed inspection, the unclear handover, the accepted shortcut, and the unanswered concern may matter more than the final visible act. Those moments show where leadership systems had a chance to detect drift before harm appeared.

A useful test is simple. If a senior leader reads the timeline, can that leader see where a management system decision influenced the barrier? If not, the sequence is probably still too close to the operator's hands and too far from the organization's controls.

Step 5. Test whether the barrier was verified before work started

A barrier that exists on paper but is not verified in the field is not yet a dependable control. The review should ask who verified it, when verification occurred, what evidence proves it, and whether the verification method would have detected the failure mode involved in the incident.

For high-energy work, verification must be more than a signature. A permit signed before conditions changed, a checklist completed from memory, or a supervisor walk that never tests the critical point can create the appearance of control while leaving the exposure untouched.

The market often underestimates this step because leaders assume documentation equals protection. Documentation is evidence that a conversation or check was intended. It is not evidence that the barrier was available unless the check directly tested the condition that could kill or seriously injure someone.

Frank Bird and Herbert Heinrich are often cited for precursor thinking, and the practical lesson is still relevant. Repeated weak verification events deserve attention before the organization has to explain why the same weakness appeared in a fatal or life-altering incident.

Step 6. Separate technical failure from management tolerance

The review must distinguish a technical failure from the tolerance that allowed the failure to remain credible. A valve can fail technically, but the tolerance may sit in overdue maintenance, weak inspection quality, ignored defect reports, unavailable spare parts, or a planning system that accepts temporary fixes for months.

Ask what the organization already knew. Previous near misses, audit findings, maintenance backlogs, operator complaints, supervisor notes, and quality deviations often show that the barrier was sending signals before the serious event. If those signals existed, the finding should name the escalation failure, not only the component failure.

This step connects naturally with serious incident potential classification. A weak barrier in a low-consequence event can still reveal fatal risk if the energy, exposure, and recurrence path are credible.

The common error is treating every technical fix as closure. Replacing the broken part may restore the equipment, but it does not explain why the organization tolerated the conditions that made the broken part consequential.

Step 7. Decide what must change before restart

Barrier failure reviews should produce restart decisions, not only final report language. Before similar work resumes, leaders need to decide which interim controls are required, who approves them, how they will be verified, and what condition would stop the job again.

For some incidents, restart may require engineering repair, isolation redesign, retraining, procedure rewrite, supervisor briefing, or temporary work suspension. For others, the immediate decision may be a targeted verification campaign across similar assets, because the failed barrier may exist in more than one place.

Dr. Megan Tranter's Headline perspective on clarity in messy leadership moments fits this stage. Restart pressure is exactly when vague communication becomes dangerous. Leaders need a written condition for restart, a named owner, and a visible field check that workers can recognize.

The worst restart decision is informal confidence. If the team says the work can restart because everyone has been reminded, the review has probably missed the control issue.

Step 8. Assign actions to the true barrier owner

Corrective actions should follow the barrier owner. Engineering owns design and physical safeguards. Maintenance owns inspection and repair routines. Operations owns work sequencing and staffing. Procurement may own tools or contractor capability. Supervision owns field verification and pause authority. EHS owns method quality, governance, and challenge, but not every control.

Each action should name the barrier, the failure mode, the owner, the verification method, and the evidence required for closure. If the action says retrain workers, require the team to prove that knowledge was the dominant failure mechanism. If the proof is weak, the action is probably too convenient.

This is why corrective action aging should never be reviewed only by due date. A closed action that leaves the weak barrier in place is worse than an overdue action, because it gives leaders false confidence.

Across Headline conversations about leadership and safety, the recurring point is practical accountability. The person who can change the barrier must own the action, otherwise the investigation becomes a document that asks the wrong function to fix the wrong problem.

Step 9. Close with an executive barrier question

Before the review closes, ask one executive question: where else could this same barrier fail under similar pressure? The answer turns an incident review into a risk governance decision.

The question should not produce a vague corporate campaign. It should produce a short list of similar tasks, sites, assets, contractors, shifts, or operating modes where the same barrier logic exists. Then leaders can decide whether to inspect, pause, redesign, audit, or escalate.

This final step is where the review becomes more than an investigation technique. It gives senior leaders a way to see whether the incident is isolated, repeated, tolerated, or structurally designed into the work. That is the difference between explaining the past and protecting the next shift.

If the organization cannot answer the executive barrier question, the incident is not ready to be closed. The report may be written, but the risk conversation is unfinished.

Final checklist for a barrier failure review

Use this checklist before closing the review meeting. It keeps the process practical while protecting the technical depth that serious incidents require.

  • The unwanted energy or exposure is named in one sentence.
  • The expected barrier is identified and supported by evidence.
  • The barrier is classified as absent, failed, bypassed, or ineffective.
  • The timeline shows barrier decisions, not only event timestamps.
  • Field verification quality is tested against the actual failure mode.
  • Known warning signals are checked against previous data.
  • Restart conditions are written, owned, and field-verifiable.
  • Actions are assigned to true barrier owners.
  • Executives decide where else the same barrier could fail.

A serious incident is not closed when the report names a cause. It is closed only when the failed barrier is understood, restored, verified, and searched for elsewhere before the next exposure repeats.

Conclusion

A barrier failure review gives leaders a more demanding question than who made the last visible mistake. It asks what should have stopped the exposure, why that control was not dependable, and where the same weakness may already exist.

Use the next serious incident review to test one discipline: every finding must name a barrier, a failure mode, an owner, and a verification method. Without those four elements, the organization may have an explanation, but it does not yet have control.

Topics incident-investigation barrier-failure critical-controls serious-incidents ehs-manager safety-leadership

Frequently asked questions

What is a barrier failure review?
A barrier failure review is a structured post-incident review that identifies which control should have stopped an unwanted energy or exposure, how that control failed or degraded, who owns it, and how the organization will verify that the weakness has been corrected.
When should a company run a barrier failure review?
A company should run a barrier failure review after a serious incident, a high-potential near miss, a repeated control failure, or any event involving high energy, stored energy, mobile equipment, work at height, chemicals, confined space, hot work, or other credible fatal-risk exposure.
How is barrier failure different from human error?
Human error describes a visible action or decision. Barrier failure asks what control was expected to prevent the exposure, whether that control was present, whether it worked, whether it was bypassed, and whether leaders had tolerated conditions that made failure likely.
Who should own corrective actions after a barrier failure?
The action owner should be the function that can change the failed barrier. Engineering may own design, maintenance may own inspection routines, operations may own work sequencing, supervision may own field verification, procurement may own tools or contractor capability, and EHS may own method quality and governance.
What is the biggest mistake in a barrier failure review?
The biggest mistake is closing the review with retraining or procedure reminders before proving that knowledge was the dominant failure mechanism. Serious incidents usually require leaders to test physical controls, verification quality, ownership, recurrence, and management tolerance.

About the author

Host & Editorial Lead

Andreza Araujo is an international reference in EHS, safety culture and safe behavior, with 25+ years leading cultural transformation programs in multinational companies and impacting employees in more than 30 countries. Recognized as a LinkedIn Top Voice, she contributes to the public conversation on leadership, safety culture and prevention for a global professional audience. Civil engineer and occupational safety engineer from Unicamp, with a master's degree in Environmental Diplomacy from the University of Geneva. Author of 16 books on safety culture, leadership and SIF prevention, and host of the Headline Podcast.

  • Civil Engineer (Unicamp)
  • Occupational Safety Engineer (Unicamp)
  • Master in Environmental Diplomacy (University of Geneva)
Summarize with AI