Maintenance KPIs: The 15 Metrics That Prove Reliability Is Improving

By Daniel Brooks on May 25, 2026

maintenance-kpis-metrics

If your maintenance team can't tell you the MTBF of your top 10 critical assets in under 60 seconds, you're not running a reliability program — you're running a reactive firefighting operation dressed up in a spreadsheet. Every percentage point of PM compliance you lose costs roughly $80,000 per year in unplanned downtime for a mid-sized plant. Every hour added to MTTR cascades through OEE, throughput and customer commitments. The plants winning in 2026 aren't tracking more metrics — they're tracking the right 15, in real time, with automated drill-down to root cause. Book a free maintenance KPI audit with iFactory engineers →

Executive Summary

Reliability Is a Measurement Discipline — Not a Maintenance Department

U.S. manufacturers losing 5–20% of production capacity to unplanned downtime share one root cause: they measure activity, not outcomes. The 15 KPIs in this guide separate leading from lagging indicators, define world-class benchmarks, and connect each metric to the operational decision it should drive. Tracked together inside a CMMS or EAM platform, they form the closed-loop reliability framework that converts maintenance from a cost center into a margin-protection function.

90%+
World-class PM compliance benchmark
85%
OEE target for top-quartile plants
50%+
Wrench time target for productive technicians
2–4%
Healthy maintenance cost as % of RAV

Why Maintenance KPIs Decide Whether Your Plant Compounds or Decays

Maintenance KPIs are the measurable indicators that translate equipment behavior into business decisions. They answer four questions every plant manager must answer weekly: Is my equipment getting more or less reliable? How fast do we recover when it fails? Are we maintaining proactively or reacting to breakdowns? And what is reliability costing us per unit produced? Without these numbers, every maintenance investment is a guess and every reliability claim is anecdotal. With them, maintenance becomes a forecasting function — predicting failures, scheduling interventions, and protecting throughput before it's lost.

The 2026 distinction between high-performing and average plants is no longer whether they track KPIs — almost everyone claims to. The distinction is whether they track leading indicators alongside lagging ones, whether the data updates in real time from work-order systems, and whether each KPI is tied to a decision threshold that triggers action. Plants that review month-old spreadsheet KPIs in a Tuesday meeting are documenting failure. Plants that surface KPI drift in real-time dashboards are preventing it.

Leading vs. Lagging Indicators: The Distinction That Changes Strategy

Every maintenance KPI falls into one of two categories, and confusing them is the most expensive mistake reliability programs make. Lagging indicators — MTBF, MTTR, downtime hours, total maintenance cost — measure what already happened. They are essential for trend analysis and executive reporting, but you cannot change yesterday's failure by staring at last month's MTBF. Leading indicators — PM compliance, schedule adherence, backlog age, wrench time — predict what will happen next. When leading indicators improve, lagging indicators follow within 60–120 days. A program that tracks only lagging metrics is permanently reactive.

Lagging Indicators
What Already Happened
  • MTBF — equipment reliability outcome
  • MTTR — repair speed outcome
  • Asset Availability — uptime outcome
  • OEE — production efficiency outcome
  • Maintenance Cost % of RAV — financial outcome
  • Unplanned Downtime Hours — disruption outcome
Use for: executive reporting, trend analysis, investment justification
Leading Indicators
What Will Happen Next
  • PM Compliance — proactive discipline
  • Schedule Adherence — planning quality
  • Backlog Week-Supply — workload balance
  • Wrench Time — workflow efficiency
  • Planned vs. Reactive Ratio — strategy mix
  • Work Order Aging — execution velocity
Use for: weekly operational decisions, root-cause action, forecasting

Plants that balance both tiers cut unplanned downtime by 30–45% within two quarters. Book a demo to see how iFactory surfaces leading indicators alongside lagging ones on a single dashboard.

The 15 Maintenance KPIs That Actually Prove Reliability Is Improving

The metrics below are organized into four tiers — reliability, execution, financial, and asset-level — covering the full closed loop from equipment health to financial outcome. Each KPI includes its formula, world-class benchmark, and the decision it should drive when it crosses a threshold. Book a demo to see all 15 surfaced live in the iFactory CMMS dashboard against your own asset data.

The 15 KPIs — Formula, Benchmark, and Decision Trigger
# KPI Formula World-Class Benchmark Decision Trigger
1MTBFOperating hours ÷ failuresAsset-class specific; trend up month-over-monthDeclining 3 months → reassess PM scope
2MTTRTotal repair time ÷ repair eventsUnder 4 hours for most assetsOver 6 hours → audit parts, SOPs, training
3Asset Availability(Uptime ÷ scheduled time) × 10095%+ for critical assetsBelow 90% → critical reliability review
4OEEAvailability × Performance × Quality85%+ world-class; 60% industry avgDrop 5+ points → loss-cause analysis
5PM CompliancePMs completed on time ÷ PMs scheduled90%+ (within 10% interval window)Below 85% → expect MTBF decline in 60–90 days
6Schedule AdherenceScheduled work completed ÷ planned85%+ attainedBelow 70% → planning process is broken
7Planned Maintenance %Planned hours ÷ total maint. hours80%+ planned, under 20% reactiveBelow 70% → firefighting mode
8Wrench TimeActual repair time ÷ technician shift50%+ on toolsBelow 35% → fix kitting, mobile, parts staging
9Backlog Week-SupplyTotal backlog hours ÷ weekly capacity4–6 weeksOver 8 weeks → work not closing; under 2 → no runway
10Emergency Work %Emergency WOs ÷ total WOsUnder 10%Over 20% → reliability program failing
11Maint. Cost % of RAVAnnual maint. cost ÷ replacement asset value2–4% for manufacturingOver 5% → aging fleet or reactive culture
12Maint. Cost / UnitTotal maint. cost ÷ units producedTrend down quarter-over-quarterRising while volume stable → cost leak
13Spare Parts Inventory TurnoverCOGS of parts ÷ avg inventory value2–4 turns/year for MROBelow 1 → obsolete stock; above 6 → stockout risk
14Mean Time To Detect (MTTD)Avg time from anomaly to alertUnder 15 minutes with IoT sensorsHours-long detection → invest in condition monitoring
15Overdue PM AgingDays past scheduled completionZero PMs over 7 days lateAny PM over 14 days → reliability risk
See These 15 KPIs Live on Your Plant Data — Free 30-Minute Dashboard Walkthrough →

The KPI Maturity Curve: Where Your Plant Sits Today

Reliability programs don't jump from spreadsheets to predictive AI overnight. They move through a measurable maturity curve — and knowing your current stage is the single most important diagnostic before investing in new tools. The four-stage progression below maps the typical journey from reactive to prescriptive maintenance, with the KPI capability and business outcome at each level. Book a demo to benchmark your facility against this curve in a 30-minute working session.

Stage 1
Reactive
No formal KPIs. Maintenance happens after failure. Spreadsheets capture work orders weeks late.
Available Data: downtime estimates, parts spend
Typical OEE: 35–55%
Stage 2
Planned
CMMS deployed. PM schedules formalized. Lagging KPIs tracked monthly. Reliable MTBF and MTTR.
Available Data: MTBF, MTTR, PM compliance, downtime
Typical OEE: 55–70%
Stage 3
Condition-Based
IoT sensors feed real-time KPI dashboards. Leading + lagging indicators live. PMs triggered by condition, not calendar.
Available Data: full 15-KPI dashboard, asset-level drill-down
Typical OEE: 70–82%
Stage 4
Predictive & Prescriptive
AI models predict failures 14–21 days ahead. KPIs evolve from "Downtime Hours" to "Downtime Prevented." Prescriptive analytics recommend interventions.
Available Data: RUL forecasts, prescriptive work orders, energy-per-asset
Typical OEE: 82–92%

The 6 Most Common KPI Tracking Mistakes — And How to Avoid Them

Tracking the right KPIs incorrectly is worse than not tracking them at all — it creates false confidence that masks deteriorating reliability. These are the six mistakes that consistently appear in maintenance audits across U.S. manufacturing plants, with the corrective discipline for each. Book a demo to get an audit-ready KPI configuration that prevents every mistake on this list.

01
Counting Late PMs as Compliant
A PM completed 3 weeks past schedule does not prevent the failure it was designed to prevent. Always enforce the 10% interval rule: a monthly PM must be completed within ±3 days to count as compliant.
02
Excluding Parts-Wait from MTTR
If a pump is down 14 hours but the team waited 12 hours for a seal, MTTR is 14 hours — not 2. Excluding logistics time hides the real recovery bottleneck and protects broken spare-parts processes.
03
Reporting Plant-Wide Averages
Plant-wide average MTBF of 500 hours looks acceptable — until you discover one critical conveyor runs at 45 hours while the rest average 700. Always drill down asset-by-asset for any critical equipment class.
04
Mixing PM and Corrective in MTTR
Scheduled PM is not a repair event. Including planned PM duration in MTTR inflates the metric and removes the signal about unplanned-repair performance. Track corrective MTTR separately.
05
Tracking Only Lagging Metrics
Reviewing MTBF and downtime monthly means you're managing failures that already happened. Pair every lagging KPI with the leading indicator that predicts it: PM compliance predicts MTBF, schedule adherence predicts backlog, wrench time predicts cost.
06
No Owner or Threshold per KPI
A KPI on a dashboard without a named owner and a defined action threshold is decoration, not management. Every metric should answer: who acts on it, at what value, within what timeframe. Without these three fields, KPI reviews become passive reporting instead of active reliability work.

How iFactory AI Turns 15 KPIs Into One Decision Loop

Tracking 15 metrics in spreadsheets compounds work; tracking them in iFactory AI compounds insight. The platform's CMMS and EAM modules calculate every KPI in real time from work-order, asset, and IoT sensor data — then surface drill-downs, automated alerts, and role-specific dashboards so every level of the organization sees what they can act on. Plant managers see availability and cost. Maintenance supervisors see backlog and schedule adherence. Technicians see wrench time and overdue PMs on mobile. Book a demo to see your own assets mapped to this framework.

Real-Time KPI Dashboard
  • All 15 KPIs updated from live work-order data
  • Asset-level drill-down on every metric
  • Role-based views for executives, managers, technicians
  • 30 / 90 / 365-day trending with anomaly highlight
Automated Threshold Alerts
  • PM compliance under 85% → supervisor escalation
  • MTTR exceeds 4 hours → root-cause workflow
  • Backlog crosses 8 weeks → capacity planning alert
  • Overdue PM beyond 14 days → reliability risk flag
Predictive KPI Forecasting
  • AI flags KPIs trending toward failure thresholds
  • 14–21 day predictive failure lead time on critical assets
  • "Downtime Prevented" metric for proactive reporting
  • Prescriptive recommendations on PM scope adjustments
Expert Review

"The KPI Itself Is Worthless. The Decision It Triggers Is Everything."

In our work with U.S. manufacturers across automotive, food & beverage, and discrete assembly, the pattern is consistent: plants that fail to improve reliability are almost never short on data. They're short on the discipline of connecting each KPI to a pre-defined decision threshold and an owner. A 92% PM compliance number sitting in a Tuesday review deck doesn't reduce downtime — but the same number triggering an automated workflow when it drops to 84% does. The fastest reliability gains we see come not from new sensors or new software, but from defining the threshold and owner for every metric on the dashboard before adding a single new measurement. The 15 KPIs in this guide are not aspirational. They're the minimum viable scoreboard. The work begins after the dashboard is live.

— iFactory AI Reliability Engineering Practice

Conclusion: Build the Dashboard, Then Build the Discipline

The 15 KPIs in this guide are not a menu — they are a closed-loop system. Leading indicators (PM compliance, schedule adherence, wrench time, backlog) predict the lagging outcomes (MTBF, MTTR, availability, OEE), which in turn drive the financial metrics (cost per unit, maintenance % of RAV). Skipping any tier breaks the loop. Tracking them in spreadsheets breaks the velocity. The plants that will win on margin and reliability in 2026 are the ones treating their maintenance KPI dashboard as the most important operating document in the facility — reviewed weekly at the supervisor level, monthly at the plant-manager level, and quarterly at the executive level, with thresholds, owners, and actions defined for every metric before the report is generated.

iFactory AI provides the integrated CMMS, EAM, and predictive maintenance platform that makes this closed loop possible without a six-month implementation. Every KPI in this guide is calculated automatically, drilled down by asset, and escalated when it crosses a threshold — so your team manages reliability instead of compiling reports about it.

Maintenance KPI Audit — Complimentary for Qualified U.S. Plants
Stop Reporting on Failures. Start Predicting Them.

iFactory engineers will benchmark your current maintenance KPIs against world-class targets, identify the 3 leading indicators most likely to unlock reliability gains in 60–90 days, and map your assets into a live dashboard. Most audits complete in one week and surface $200K+ in recoverable downtime cost.

15
KPIs surfaced from day one
60–90 d
First measurable reliability gain
8–12 hr
Weekly admin time eliminated
$200K+
Avg downtime cost recovered Y1

Frequently Asked Questions

Which 3 maintenance KPIs should a plant start with if it's tracking nothing today?
Start with PM Compliance, MTBF, and MTTR. PM Compliance is the leading indicator that predicts reliability; MTBF measures how reliable your equipment actually is; MTTR measures how fast you recover when it fails. Together these three give the most complete picture with the least data-collection burden. Add Schedule Adherence and Wrench Time once those three are stable.
What's the difference between MTBF and MTTF — and when does each apply?
MTBF (Mean Time Between Failures) measures the time between repairable failures, and is the right metric for assets that get fixed and returned to service — pumps, motors, conveyors, compressors. MTTF (Mean Time To Failure) measures time until total irreparable failure, and applies to non-repairable components like bearings, bulbs, or sensors that are replaced rather than repaired. Most manufacturing dashboards rely on MTBF; MTTF is more common in component-level reliability engineering.
Is 85% OEE realistic for a typical U.S. manufacturing plant?
85% OEE is the world-class benchmark for discrete manufacturing and is achievable, but the industry average sits closer to 60%. What matters more than hitting a specific number is the trend direction — consistent monthly improvement from your own baseline is more valuable than chasing a benchmark. A 5-point OEE gain typically unlocks production capacity equivalent to adding an entire shift, with no capital investment required.
How often should each maintenance KPI be reviewed?
Cadence depends on the audience. Daily: overdue PMs, today's schedule, work-order completion (technician/supervisor level). Weekly: schedule compliance, backlog week-supply, emergency work %, wrench time (maintenance manager). Monthly: MTBF, MTTR, PM compliance, availability (plant manager). Quarterly: cost as % of RAV, cost per unit, OEE (executive strategic review). Consistency matters more than frequency — a KPI reviewed every Monday at 8am becomes a management tool.
Why is tracking KPIs in spreadsheets considered a failure mode?
Three reasons. First, spreadsheet KPIs are typically 15–30 days stale by the time anyone reviews them, so decisions are made on outdated data. Second, manual entry introduces errors that compound — when technicians log start and end times from memory at shift end, MTBF and MTTR accuracy drop by roughly 60% versus real-time mobile capture. Third, spreadsheets can store data but cannot trigger action — they can't escalate a falling PM compliance number or alert when backlog crosses a threshold. A CMMS calculates every KPI in real time from live work-order data and connects each metric to an automated decision workflow.

Share This Story, Choose Your Platform!