Incident Investigation

Incident Investigation: 6 Traps That Keep the Real Cause Out of the Room

A diagnostic article for EHS managers and investigators on how first reports, witness pressure, and weak evidence handling keep the real cause out of the room.

By 9 min read
investigative scene on incident investigation 6 traps that keep the real cause out of the room — Incident Investigation: 6 Tr

Key takeaways

  1. 01The first report is the opening version of events, not the cause.
  2. 02James Reason's model matters because active failure is usually the last visible break in a longer chain of latent conditions.
  3. 03A good investigation separates chronology from explanation and maps the control failure that made the event possible.
  4. 04Precursor events, weak handovers, and repeat overrides are clues, not noise.
  5. 05The first 24 hours should protect evidence, sequence interviews, and verify which control failed before restart.

An incident investigation is a decision process, not a search for the first sentence that sounds neat. The first report gives the opening version of events, but the cause stays hidden until the team tests evidence, control failure, and precursor events against the field.

This article is for EHS managers, incident investigators, and plant leaders who need to turn a messy event into a decision the same shift can trust. Across 25+ years leading EHS in multinational operations, Andreza Araujo has seen investigations go shallow when the room rewards speed more than accuracy. In more than 250 cultural transformation projects, the same pattern repeats: the first story gets written fast, the real mechanism gets written late, and the organization learns the wrong lesson.

In Sorte ou Capacidade, Andreza Araujo argues that serious events are systemic, not random bad luck. In Um Dia Not To Forget, the lesson is even sharper, because the event only helps the organization if leadership is willing to look at the chain that made it possible. In Safety Culture: From Theory to Practice, the same theme appears from another angle, since culture is visible in the questions leaders ask after the first shock passes.

During Andreza Araujo's PepsiCo South America tenure, the accident ratio fell 50% in 6 months. That result matters here because it shows how much the quality of the first managerial response shapes the rest of the system. When leaders slow down enough to test evidence before they test blame, the next decision becomes more accurate.

The article also uses James Reason's Human Error and Managing the Risks of Organizational Accidents as the academic frame, because latent failures, active failures, and barrier drift explain why the first report is rarely the full cause. The practical job is to move from story to mechanism without losing the human facts that the field already knows.

Why the first report is not the cause

The first report is useful because it marks the starting point of attention, but it is not the same as explanation. A supervisor, operator, contractor, or witness often sees one fragment of the event, which means the first narrative is usually partial even when it is honest and well intended.

James Reason helps here because the visible action is often the final break in a longer chain of latent conditions. In other words, the person who touched the last visible step is not automatically the source of the deeper mechanism. If the investigation stops at the first account, it confuses proximity with causality.

The right question is not "who spoke first?" It is "what did this person see, what did they not see, and which controls were already weakened before the event became visible?" That shift keeps the investigation from turning one fragment into a complete explanation.

Trap 1: the first story sounds complete

A complete-sounding story is often just the story with the fewest gaps. When the room gets a single witness, one photo, and a supervisor summary, the group can feel it already knows enough, especially if the event appears simple and nobody is injured badly enough to create pressure for more work.

That feeling is dangerous because narrative closure is emotionally efficient. The team wants to move on, the manager wants to protect the schedule, and the investigator wants a clean draft. Yet a neat draft is not the same as a verified sequence of events.

That is why witness collection belongs next to evidence preservation, not after the report is half written. If you want a disciplined sequence, compare this point with how to preserve incident evidence in the first 24 hours and witness statements and the interview errors investigators miss. The first task is not to settle the story. It is to keep the story from hardening too early.

Trap 2: the active failure gets all the blame

The active failure is the visible action that finally lets the event happen, which is why people notice it first. The problem is that visibility is not importance. A dropped tool, an incorrect valve position, or a missed isolation check may be the last action in the chain, but it still does not explain why the chain was ready to fail.

This is where investigators need Reason's model and a better sense of barrier drift. If a permit, a hold point, a handover, or a verification step was already weak, then the active failure is only the final symptom of a system that had already lost margin. In Safety Culture: From Theory to Practice, Andreza Araujo treats that drift as a leadership problem because culture shows up in what leaders tolerate before the event.

Across more than 250 projects, Andreza Araujo has seen teams protect the visible person while leaving the deeper condition untouched. That is a false kindness, because it gives the organization a human target while the control gap stays open for the next shift.

Trap 3: chronology is mistaken for explanation

Time order helps, but sequence is not mechanism. A timeline tells you what happened first, next, and last. It does not tell you why the control failed, why the hazard escaped detection, or why the team accepted a condition that should have triggered a pause.

That difference matters because incident teams often build a slide deck that looks precise while remaining shallow. The event is drawn as a chain of timestamps, and the timestamps create confidence. Yet the organization still cannot answer the more important question, which is what changed in the system before the first visible break.

Chronology Explanation
What happened first, next, and last. What mechanism made the sequence possible.
Useful for context and interview order. Useful for decisions, redesign, and prevention.
Can be built from one strong witness. Needs several sources of evidence and control review.

If you need a practical method for this gap, compare the timeline with how to build an incident evidence map in 48 hours. An evidence map forces the team to test whether the timeline has a mechanism behind it, or whether it is still only a chronology.

Trap 4: precursor events stay unnamed

Heinrich and Bird both pushed safety thinking toward precursor logic, which is still useful when it reminds leaders that serious outcomes are usually preceded by smaller signals. The mistake is to treat those signals as noise rather than as clues that a control is bending.

A precursor is not an annoyance. It is the last visible chance to understand where the system bent before it broke. If the investigation ignores near misses, repeated overrides, maintenance backlog, or weak handovers, it loses the part of the chain that was most likely to be fixable before the event.

That is also why investigators should not confuse a large list of minor events with understanding. The list matters only if it shows the same control failing again and again. If it does not, the organization is counting events without learning from them.

For method choice, the comparison in five whys, fishbone, and barrier failure review helps teams choose the tool that fits the evidence. The method should follow the mechanism, not the other way around.

Trap 5: control failure becomes a footnote

Many investigation reports describe the event well but treat the broken control as a footnote. That is backwards, because the control failure is often the real object of prevention. If the permit, barrier, guard, verification step, or hold point did not work, then the report needs to explain why that control was trusted in the first place.

When the team asks only what the worker did, it misses what management accepted. A weak control can survive for months because nobody owns its verification. A missing handover check can survive because the system rewards closing the incident record faster than fixing the process. A missing isolation proof can survive because everyone assumes the written rule is enough.

In Um Dia Not To Forget, the deeper lesson is that the incident becomes a governance test, not just an operational one. A serious event should therefore force one hard question: which control failed first, and who was responsible for proving that control before the work began?

What stronger investigations do in the first 24 hours

A stronger first day is not longer. It is stricter. The investigator who has a clear sequence for the first 24 hours usually learns more than the team that holds six meetings and writes one tidy paragraph.

  • Freeze the scene and protect evidence before people start explaining it away.
  • Collect the first account, but do not treat it as the cause.
  • Interview witnesses in a sequence that preserves independent memory, which is why the guide on witness interview errors after incidents matters.
  • Map the controls that were supposed to prevent, detect, or limit the event.
  • Decide whether work can restart only after the control basis is verified.

That sequence respects both the field and the system. It protects memory, which fades fast, and it protects mechanism, which is the only thing that can explain why the event happened at all. If the team cannot name the broken control by the end of the first day, the investigation is still collecting opinions.

What to teach the next investigator

The next investigator needs a better habit, not just another template. Andreza Araujo has spent more than 25 years showing that culture changes when people learn to ask different questions under pressure. In investigation work, that means asking where the control failed, what precursor was ignored, and why the organization accepted the condition long enough for the event to happen.

In Safety Culture: From Theory to Practice, the lesson is that quality of attention shapes quality of outcome. A team that is trained to chase the first story will usually close the investigation too fast. A team that is trained to test evidence, compare sources, and name the broken control will usually find a better answer even when the event is messy.

That does not mean every report becomes perfect. It means the organization stops rewarding a fast conclusion that makes the problem feel smaller than it is. The next investigator should be able to say, with confidence, that the report explains the mechanism, not just the sequence.

What to do next on the same shift

If the event happened today, start with the evidence map, then move to witness sequencing, and only then write the summary. If the event happened on a contractor job, include the contractor supervisor in the control review because the system that failed may cross organizational boundaries.

For leaders who want a deeper companion on the human and cultural side, listen to Headline Podcast and compare this investigation discipline with the leadership routine in investigation method selection. The point is not to collect more paper. The point is to keep the real cause from leaving the room before the organization has tested it.

When the report is finally written, it should answer one clear question: what control failed, why did it fail, and what will the next shift do differently because of that answer?

FAQ

What is the biggest mistake in incident investigation?

The biggest mistake is treating the first report as the cause. The first report is useful, but it is only one fragment of the event. If the team stops there, it usually confuses chronology with mechanism and misses the control failure that really matters.

Why do investigators blame the closest person?

They do it because the closest person is usually the easiest visible explanation. James Reason's work helps show why that shortcut is weak, because the visible action is often the final break in a chain of latent conditions, not the deep source of the event.

What should the first 24 hours focus on?

Protect evidence, sequence witness interviews, map the controls, and decide whether the job can restart. If the investigation does not preserve memory and mechanism early, it often loses the strongest facts before the report starts.

How do I know the report is too shallow?

If the report has only one story, one timeline, and one blamed step, it is probably too shallow. A stronger report can show precursor events, broken barriers, and the reason the control was trusted before it failed.

Which Andreza Araujo book fits this topic best?

Sorte ou Capacidade is the sharpest starting point because it frames incidents as systemic rather than accidental. Um Dia Not To Forget is the next step when you want the governance and memory side of a serious event to be taken seriously.

For readers who want the companion method, the article How to Build an Incident Evidence Map in 48 Hours gives the practical sequence, and Headline Podcast keeps the leadership side of the conversation moving.

Topics incident-investigation incident-evidence witness-interviews latent-failures james-reason evidence-map root-cause-analysis

Frequently asked questions

What is the biggest mistake in incident investigation?
Treating the first report as the cause. The first report is useful, but it is only one fragment of the event. If the team stops there, it usually confuses chronology with mechanism and misses the control failure that really matters.
Why do investigators blame the closest person?
They do it because the closest person is usually the easiest visible explanation. James Reason's work helps show why that shortcut is weak, because the visible action is often the final break in a chain of latent conditions, not the deep source of the event.
What should the first 24 hours focus on?
Protect evidence, sequence witness interviews, map the controls, and decide whether the job can restart. If the investigation does not preserve memory and mechanism early, it often loses the strongest facts before the report starts.
How do I know the report is too shallow?
If the report has only one story, one timeline, and one blamed step, it is probably too shallow. A stronger report can show precursor events, broken barriers, and the reason the control was trusted before it failed.
Which Andreza Araujo book fits this topic best?
Sorte ou Capacidade is the sharpest starting point because it frames incidents as systemic rather than accidental. Um Dia Not To Forget is the next step when you want the governance and memory side of a serious event to be taken seriously.

About the author

Andreza Araújo

Safety Culture Expert | Senior EHS Executive

Andreza Araújo is a safety culture expert and senior EHS executive with more than 25 years of experience in environment, health and safety. She is a Civil Engineer and Occupational Safety Engineer from Unicamp, holds a Master's degree in Environmental Diplomacy from the University of Geneva, and completed sustainability studies at IMD Switzerland. Andreza has served in Global Head of EHS roles in Fortune 500 environments, leading cultural transformation programs across multinational operations. She has represented Brazil as a speaker at the United Nations in Paris and has spoken at the International Labour Organization in Turin. She is the author of more than 16 books on safety culture in Portuguese, Spanish, English and German. Her work has earned more than 10 EHS awards, including two recognitions from Indra Nooyi, former PepsiCo CEO.

  • Civil & Safety Engineer (Unicamp)
  • M.A. Environmental Diplomacy (University of Geneva)
  • Sustainability Cert (IMD Switzerland)
  • People Management & Coaching (Ohio University)
  • UN Paris speaker representative for Brazil
  • ILO Turin speaker
  • LinkedIn Top Voice
  • Indra Nooyi PepsiCo CEO recognition (2x)

Documentaries

Watch Andreza's documentaries

Three productions on safety culture, organizational failure and the human lessons behind major disasters.

Podcasts

Listen to Andreza's podcasts

She hosts three shows on safety leadership, EHS and organizational culture, in English and Portuguese.

Summarize with AI