Safety Indicators and Metrics

Near-Miss Quality vs Stop-Work Use vs Observation Depth: Which Leading Indicator Fits

Compare near-miss quality, stop-work use, and observation depth so EHS managers choose leading indicators that change decisions before harm.

By 8 min read
metrics dashboard representing near miss quality vs stop work use vs observation depth which leading indicator — Near-Miss Qu

Key takeaways

  1. 01Compare leading indicators by exposure relevance, decision speed, worker trust, evidence quality, and resistance to gaming.
  2. 02Score near-miss quality by specificity, credible worst case, failed control, and named action owner rather than report volume.
  3. 03Track stop-work use as intervention capacity, including worker-initiated stops, supervisor response, control decisions, and retaliation signals.
  4. 04Use observation depth to verify critical controls during real work instead of rewarding shallow behavior counts.
  5. 05Separate recognition, intervention, and verification in your dashboard before discussing the next EHS review on Headline Podcast.

BLS reported 2.5 million nonfatal workplace injuries and illnesses in private industry in 2024, yet that lagging number cannot tell an EHS manager which weak signal deserved attention last Tuesday. This comparison helps leaders choose between near-miss quality, stop-work use, and observation depth as leading indicators that can move decisions before harm appears.

A leading indicator is a safety measure that points to changing risk before an injury or illness is recorded. OSHA explains that leading indicators drive change, while lagging indicators measure effectiveness after the fact, which is why the most useful dashboard reads reporting quality, worker intervention, and field evidence together.

Why does this leading-indicator choice matter?

The choice matters because each leading indicator answers a different management question in 2026: whether people recognize weak signals, whether they interrupt unsafe work, and whether field observations show the real condition of controls. A company that counts all three as generic proactive activity may fill a dashboard while still missing fatal-risk exposure.

BLS reports that private industry employers recorded 2.5 million nonfatal workplace injuries and illnesses in 2024, down 3.1 percent from 2023. That trend matters, but it arrives after the work has already succeeded or failed.

On the Headline Podcast, Andreza Araujo and Dr. Megan Tranter often return to the same leadership problem: numbers can look serious while conversations stay shallow. Co-host Andreza's book Far Beyond Zero challenges the belief that a low injury count proves operational control, because absence of reported harm can reflect luck, fear, classification pressure, or weak listening.

The trap is not choosing a leading indicator. The trap is choosing the easiest one to count, then rewarding volume without asking whether the indicator changed a decision, stopped exposure, or corrected a control before the next shift.

Evaluation criteria for leading indicators

A useful leading indicator should be judged by five criteria: exposure relevance, decision speed, worker trust, evidence quality, and resistance to gaming. If the metric cannot change staffing, stop work, redesign a control, or trigger executive escalation, it belongs in an activity report rather than a safety dashboard.

Five criteria separate preventive evidence from busy work, because leading indicators often fail when they measure effort rather than risk. One hundred observations can be weaker than six high-quality reports if the larger number repeats obvious PPE comments while the smaller set identifies deteriorating barriers around hazardous energy.

Across more than 250 cultural transformation projects, Andreza Araujo has identified that measurement changes behavior only when leaders attach it to visible decisions. A plant manager who asks for near-miss count alone teaches volume. A plant manager who asks which near misses revealed a missing engineering control teaches risk recognition.

For Headline's audience, the leadership standard is practical. The indicator should tell an operations leader where to go, whom to involve, what to verify, and which control deserves money or authority this week.

1. Near-miss quality: best for weak-signal learning

Near-miss quality is best when the organization needs to improve the way people recognize, describe, and escalate weak signals before an incident occurs. The strongest version does not count reports as proof of culture; it scores whether the report names the exposure, the failed or missing control, the credible worst outcome, and the action owner.

The value of this indicator is diagnostic depth. A near-miss report that says "almost hit by forklift" gives a manager a topic, while a high-quality report explains aisle design, pedestrian separation, visibility, speed, supervision, and the specific point where the control failed.

As Andreza Araujo argues in The Illusion of Compliance, the real measure of a safety system is what happens when no one is watching. Near-miss quality tests that idea because workers often report what the culture has taught them is safe to say, not everything they noticed.

Use this indicator when underreporting, superficial reports, or repeated low-value classifications are the dominant problem. It becomes weaker when leaders punish bad news, because workers will either stay silent or report only harmless observations that cannot threaten the score.

2. Stop-work use: best for intervention capacity

Stop-work use is best when leaders need to know whether people will interrupt exposure at the point of risk. The indicator should not celebrate every pause as equal; it should distinguish planned pauses, worker-initiated interventions, supervisor-supported stops, repeated causes, and whether the stop led to a control change.

The value of stop-work data is decision speed. A near miss may be documented after the shift, while a stop-work action tests whether the organization can protect the task while production pressure is still alive.

OSHA requires employers to report severe work-related injuries within 24 hours when they involve an amputation, in-patient hospitalization, or loss of an eye. That reporting threshold is severe by design, which is exactly why a stop-work metric should focus upstream on high-energy exposure before a reportable outcome exists.

The risk is performative authority. Many companies say everyone has stop-work authority, although workers learn quickly whether the first person who uses it is thanked, ignored, challenged, or quietly penalized through overtime, assignments, or reputation.

3. Observation depth: best for control verification

Observation depth is best when the company needs field evidence about whether controls are present, understood, and working during normal work. It improves on observation count by scoring the quality of the observation: task specificity, control focus, worker dialogue, follow-up, and whether the observer verified a condition rather than only noticing behavior.

The value of observation depth is control realism. A shallow observation says a worker wore gloves. A deep observation asks whether the energy-isolation verification was performed, whether the worker could explain the critical step, and whether the supervisor had removed schedule pressure from the task.

NIOSH describes the hierarchy of controls as a preferred order, moving from elimination and substitution through engineering controls, administrative controls, and PPE. Observation depth matters because many dashboards overcount visible behavior while underchecking higher-order controls that actually reduce exposure.

In Andreza Araujo's safety-culture work, including Safety Culture: From Theory to Practice, the point of observation is not surveillance. The point is dialogue that reveals beliefs, shortcuts, constraints, and control gaps that a form alone would never show.

4. Which indicator is easiest to game?

Near-miss volume is the easiest to game when the organization rewards count, because workers can submit low-risk items that protect the metric without exposing difficult truths. Stop-work use can also be distorted when leaders count every planned pause as empowerment, while observation depth is harder to fake only when the scoring requires evidence from the actual task.

The defense is triangulation. If near-miss quality improves, stop-work use stays at zero, and observations remain shallow, the organization may have better paperwork but not better intervention capacity. If stop-work actions rise while near-miss quality falls, people may be reacting late because earlier signals were missed or ignored.

Andreza Araujo's experience across 25+ years leading EHS at multinationals points to a practical rule: any metric tied to recognition, bonuses, or public ranking will eventually be managed by the culture. Leaders need audit questions that test whether the measure changed risk, not whether the measure looked active.

The dashboard should therefore show quality thresholds. For example, score a near miss only when it names a credible worst case, score stop-work only when it records the control decision, and score observation depth only when it verifies at least one critical control or task constraint.

5. Decision matrix for EHS managers

The decision matrix should show which indicator fits which management decision, because near-miss quality, stop-work use, and observation depth are not substitutes. They belong together, but each should carry a different weight depending on the maturity problem the site is trying to solve.

CriterionNear-miss qualityStop-work useObservation depth
Best useWeak-signal learning and reporting trustImmediate intervention at the point of exposureVerification of controls during real work
Main weaknessCan become story volume without actionCan be discouraged by production pressureCan become behavioral policing if poorly framed
Best ownerEHS with supervisor reviewOperations with EHS assuranceSupervisors and managers with worker participation
Decision triggerEscalate repeated weak signals and improve classificationStop, redesign, authorize resources, or reset the planRepair controls, coach supervisors, or change the work method
Gaming testAudit report specificity and worst-case credibilityAudit worker-initiated stops and retaliation signalsAudit task specificity and control evidence

For most sites, the correct answer is not one indicator. Near-miss quality tells leaders whether people see and describe risk, stop-work use tells whether they interrupt risk, and observation depth tells whether leaders verify the controls that should make safe work possible.

6. Recommendation by business context

Low-maturity sites should start with near-miss quality because they often need to rebuild reporting trust before dashboards can be believed. A site that receives many reports with vague language, repeated housekeeping items, or no credible worst-case description is not ready to claim strong leading indicators.

High-energy operations such as mining, utilities, construction, logistics, oil and gas, and heavy manufacturing should give more weight to stop-work use and observation depth. In those environments, a worker may face mobile equipment, energized systems, lifting operations, confined spaces, or work at height, where one weak control can become catastrophic before a monthly trend changes.

During the PepsiCo South America period, where Andreza Araujo's safety work contributed to a 50 percent reduction in the accident ratio in six months, the lesson was not that one metric saved the program. The lesson was that visible leadership, field follow-up, and disciplined measurement had to move together.

For multi-site companies, start with a 90-day pilot in one risk family. Define what a high-quality near miss must contain, what counts as a meaningful stop-work action, and what observation depth must verify, then compare the three indicators against overdue corrective actions and serious exposure.

7. What should leaders ask every month?

Leaders should ask whether weak signals are becoming more specific, whether people are stopping work without retaliation, and whether observations are verifying controls rather than counting visible behavior. Those three questions keep the dashboard connected to decisions rather than activity volume.

A useful monthly review has four parts. First, sample ten near-miss reports and score their quality. Second, review every stop-work action for response time, control decision, and worker treatment. Third, audit observation depth against one critical task. Fourth, compare findings with corrective-action closure and serious-incident potential.

The Headline Podcast language of real conversations matters here because safety metrics often fail through polite dashboards. Leaders see green boxes, workers know which controls are weak, and supervisors learn which questions the executive team will never ask.

Each month spent counting proactive activity without testing decision quality allows weak signals to age, while production pressure teaches people which risks are safe to report and which ones are better left unsaid.

Conclusion

Near-miss quality is strongest for weak-signal learning, stop-work use is strongest for intervention capacity, and observation depth is strongest for control verification. The best leading-indicator set uses all three, but it gives the highest weight to the indicator that exposes the current maturity problem.

If your dashboard still treats every proactive count as equal, start by separating recognition, intervention, and verification. For more conversations on safety leadership, evidence, and practical culture change, visit Headline Podcast.

Topics leading-indicators near-miss-quality stop-work-authority behavioral-observation safety-metrics ehs-manager headline-podcast

Frequently asked questions

What is the best leading indicator for safety?
There is no single best leading indicator for every operation. Near-miss quality is strongest when the problem is weak signal recognition, stop-work use is strongest when the problem is intervention at the point of exposure, and observation depth is strongest when leaders need evidence that controls work during real tasks.
Is near-miss reporting a leading indicator?
Near-miss reporting can be a leading indicator when the organization measures report quality, credible worst case, failed or missing controls, and action ownership. It becomes weak when the dashboard rewards only volume, because people can submit harmless reports while serious exposure remains hidden.
How should stop-work authority be measured?
Stop-work authority should be measured by meaningful use, not slogans. Track worker-initiated stops, the exposure involved, response time, control decision, restart criteria, and whether the worker experienced any formal or informal retaliation after raising the concern.
What is observation depth in safety metrics?
Observation depth measures whether a field observation verified the real task and its controls. A deep observation checks task conditions, critical controls, worker dialogue, supervisor constraints, and follow-up, rather than recording only visible behavior such as PPE use.
How do EHS managers prevent leading indicators from being gamed?
EHS managers prevent gaming by auditing quality thresholds. A near miss should name exposure and credible severity, a stop-work action should record the control decision, and an observation should verify a task-specific control. Incentives should reward risk reduction, not raw counts.

About the author

Andreza Araújo

Safety Culture Expert | Senior EHS Executive

Andreza Araújo is a safety culture expert and senior EHS executive with more than 25 years of experience in environment, health and safety. She is a Civil Engineer and Occupational Safety Engineer from Unicamp, holds a Master's degree in Environmental Diplomacy from the University of Geneva, and completed sustainability studies at IMD Switzerland. Andreza has served in Global Head of EHS roles in Fortune 500 environments, leading cultural transformation programs across multinational operations. She has represented Brazil as a speaker at the United Nations in Paris and has spoken at the International Labour Organization in Turin. She is the author of more than 16 books on safety culture in Portuguese, Spanish, English and German. Her work has earned more than 10 EHS awards, including two recognitions from Indra Nooyi, former PepsiCo CEO.

  • Civil & Safety Engineer (Unicamp)
  • M.A. Environmental Diplomacy (University of Geneva)
  • Sustainability Cert (IMD Switzerland)
  • People Management & Coaching (Ohio University)
  • UN Paris speaker representative for Brazil
  • ILO Turin speaker
  • LinkedIn Top Voice
  • Indra Nooyi PepsiCo CEO recognition (2x)

Documentaries

Watch Andreza's documentaries

Three productions on safety culture, organizational failure and the human lessons behind major disasters.

Podcasts

Listen to Andreza's podcasts

She hosts three shows on safety leadership, EHS and organizational culture, in English and Portuguese.

Summarize with AI