Downtime Tracking Checklist for Manufacturing Plants

Downtime tracking is the data discipline that converts unplanned production stops from operational anecdotes into an improvement-driving dataset. The difference between a plant that reduces downtime year-over-year and one that manages the same recurring failures in perpetuity is not the maintenance budget or the equipment age — it is the quality of the downtime data and the rigour of the review process applied to it. This downtime tracking checklist covers every element of a complete downtime management system: event capture with timestamps and reason codes, downtime categorisation and code structure, Pareto analysis, MTBF and MTTR tracking, and the action-ownership process that converts downtime data into reliability improvement.

3,600

Monthly searches for downtime tracking checklists

23%

Average unplanned downtime as % of available time in discrete manufacturing

Pareto

Top 20% of downtime causes typically account for 80% of lost production time

Auto-capture

iFactory captures downtime from machine signals — no manual entry, no gaps in the record

Automated Downtime Tracking

Capture Every Downtime Event Automatically and Link It to OEE

iFactory captures downtime events from machine PLCs or operator input, timestamps every start and stop, applies your reason code structure, and links each event to OEE Availability — no manual entry, no gaps.

Timestamped events with reason codes, linked to OEE Availability automatically

Book a Demo Support Call

Area 1

Downtime Definition — Agree on What You Are Measuring

Before a single downtime event is captured, the organisation must agree on what counts as downtime. The most common source of downtime data disputes is inconsistent definition: one supervisor counts changeovers as downtime, another records them as planned stops. One line operator logs every two-minute jam, another only logs stops that require maintenance intervention. These definition inconsistencies produce a downtime dataset that cannot be used for cross-line comparison, trending, or improvement targeting.

Defn · 01

Unplanned vs. Planned

Unplanned downtime: any stop that was not scheduled in the production plan for that shift. Planned downtime: scheduled maintenance, planned changeovers, breaks, no-order periods. Both must be tracked but in separate categories.

Defn · 02

Minimum Duration Threshold

Stops below the threshold (typically 2–5 minutes) are either excluded from downtime or tracked as minor stoppages in a separate category. Define the threshold before go-live and apply it consistently to every line.

Defn · 03

Changeover Separate from Breakdown

Changeover is a Six Big Losses Setup/Adjustment loss — it has a different improvement methodology (SMED) from equipment breakdown (predictive and preventive maintenance). Mixing them in the same downtime category obscures both.

Defn · 04

Operator-Caused vs. Equipment-Caused

An operator error that stops a machine is downtime. An equipment failure is downtime. Both count against Availability OEE but require different corrective actions. Code them separately from day one.

Defn · 05

Process and Material Stops

Material shortage, quality hold, and process engineering stop are downtime events — not "not downtime" because maintenance did not respond. They count against Availability and must be coded and tracked like any other downtime cause.

Defn · 06

Document the Agreed Definition

The downtime definition is written into the operating procedure for the line. New operators, new supervisors, and new maintenance technicians are trained on the definition before their first shift. Consistent understanding is the foundation of consistent data.

Area 2

Downtime Reason Codes — The Quality of Your Analysis Depends on This

Reason codes are the most important design decision in a downtime tracking system. A poorly designed reason code list — too generic, too long, poorly labelled, or missing key failure modes — produces a downtime dataset where 30% of events are coded "Other" and the Pareto is meaningless. A well-designed reason code list captures every actual failure mode in a two-level hierarchy that enables both high-level trend analysis and specific root cause investigation.

Build the Code List from Actual Data, Not a Generic Template

Before creating the reason code list, analyse three to six months of historical downtime records — whatever exists, even if incomplete. Identify the actual failure modes that occur on your equipment. The code list should include specific names your operators recognise, not generic engineering categories they have never heard.

Two-Level Hierarchy Is the Minimum

Level 1: broad category (Electrical, Mechanical, Tooling, Process, Material, Operator, Changeover). Level 2: specific cause within the category (Electrical → Servo drive fault; Mechanical → Bearing failure; Tooling → Insert wear). One-level codes do not support root cause analysis.

Monitor "Other" Usage Weekly

"Other" reason code usage above 5% means either the code list is missing a common failure mode, operators are not using the code list correctly, or both. Every week, review all "Other" entries and add specific codes for any that appear more than twice. Within 90 days, "Other" should represent less than 2% of events.

Make Code Selection Fast

If selecting a reason code takes more than 15 seconds, operators will use "Other" or skip the field entirely. Mobile-optimised code selection with search-as-you-type or a short list of most-used codes displayed first is critical for data quality at the operator level.

Area 3

Downtime Analysis — Pareto to Improvement Action

A downtime tracking system that captures data without generating weekly analysis and action is a record-keeping system, not a reliability improvement programme. The Pareto principle applies consistently in manufacturing downtime: 20% of failure modes account for 80% of lost production time. Finding and eliminating the top one or two downtime causes — with root cause analysis and permanent corrective action — produces more reliability improvement than addressing 20 minor causes simultaneously.

Daily Downtime Pareto

Generate a daily Pareto of downtime by reason code per line. Post it visibly in the production area or on the shift dashboard. The single top downtime cause for the day should be identifiable by any operator or supervisor at a glance.

MTBF Trending per Equipment Class

Mean Time Between Failures per equipment type reveals whether reliability is improving or degrading. A declining MTBF trend on a specific machine class is an early warning signal for predictive maintenance intervention before the failure frequency escalates.

MTTR Trending per Code Category

Mean Time To Repair reveals whether maintenance response efficiency is improving. High MTTR on a specific failure type indicates: wrong spare parts on hand, technician skill gap, diagnostic process too slow, or repair procedure not documented.

Repeat Failure Escalation

Any downtime cause that appears three or more times in 30 days on the same asset triggers a formal root cause analysis — not another reactive repair. Repeat failures are the most visible evidence of a systemic issue that reactive maintenance will never solve.

Action Ownership and Closure

Every improvement action from the downtime Pareto review has a named owner, a specific action description, and a completion date. Actions reviewed at weekly production meeting. Open actions older than the defined resolution window are escalated.

Downtime Analytics Platform

Automate Downtime Capture, Pareto, and OEE Link in iFactory

iFactory captures downtime from machine signals or operator input, calculates MTBF and MTTR per equipment class, generates daily and weekly Pareto, and links every event to OEE Availability.

Daily Pareto, MTBF, MTTR, and trend dashboards — automatic per shift

Book a Demo Support Call

Checklist

Downtime Tracking Checklist — 30 Items

Use this checklist when implementing or auditing a manufacturing downtime tracking programme. Items cover system setup, event capture, reason code structure, analysis cadence, action tracking, and data quality.

Setup Downtime Tracking Infrastructure 5 items

#	Checklist Item	Type	Priority	Photo	Required	Critical
1	Downtime definition agreed: unplanned stop only, or includes planned maintenance and changeovers	Pass/Fail	High	—	✓	✓
2	Minimum downtime threshold defined — stops below threshold (e.g. 2 min) excluded or tracked separately	Pass/Fail	High	—	✓	✓
3	Capture method selected: automatic machine signal, operator tablet entry, or supervisor log	Pass/Fail	High	—	✓	✓
4	Every machine/line in scope has a unique asset ID in the downtime tracking system	Pass/Fail	High	—	✓	✓
5	Downtime data linked to OEE Availability calculation — same data source, not separate entry	Pass/Fail	High	—	✓	✓

Capture Downtime Event Capture 6 items

#	Checklist Item	Type	Priority	Photo	Required	Critical
6	Start timestamp recorded at moment of stop — not estimated later	Pass/Fail	High	—	✓	✓
7	End timestamp recorded at moment of restart — not estimated at shift end	Pass/Fail	High	—	✓	✓
8	Operator or maintenance technician ID recorded with every downtime event	Pass/Fail	High	—	✓	✓
9	Reason code selected from structured list — "Other" usage tracked and kept below 5%	Pass/Fail	High	—	✓	✓
10	Equipment sub-system identified: electrical, mechanical, tooling, material, operator, process	Selection	High	—	✓	✓
11	Notes field used for first-occurrence descriptions — not relied on for repeat events	Pass/Fail	Med	—	✓	—

Codes Reason Code Structure 5 items

#	Checklist Item	Type	Priority	Photo	Required	Critical
12	Reason code list built from actual historical downtime — not generic template	Pass/Fail	High	—	✓	✓
13	Reason codes organised in 2-level hierarchy: category (equipment failure) → specific cause (spindle motor fault)	Pass/Fail	High	—	✓	✓
14	Maintenance, process, material, and operator-caused stops all in separate code families	Pass/Fail	High	—	✓	✓
15	Changeover and setup coded separately from unplanned downtime	Pass/Fail	High	—	✓	✓
16	Reason code list reviewed and updated quarterly — new failure modes added promptly	Pass/Fail	Med	—	✓	—

Analysis Downtime Analysis & Pareto 5 items

#	Checklist Item	Type	Priority	Photo	Required	Critical
17	Daily downtime Pareto generated per line — top 3 causes visible at shift review	Pass/Fail	High	—	✓	✓
18	Weekly downtime Pareto reviewed in production meeting — top cause actioned	Pass/Fail	High	—	✓	✓
19	MTBF (Mean Time Between Failures) tracked per equipment class	Pass/Fail	High	—	✓	✓
20	MTTR (Mean Time To Repair) tracked per downtime category	Pass/Fail	High	—	✓	✓
21	Repeat downtime events (same asset, same reason code, 3+ times in 30 days) trigger formal RCA	Pass/Fail	High	—	✓	✓

Action Downtime Action Tracking 5 items

#	Checklist Item	Type	Priority	Photo	Required	Critical
22	Every downtime event above threshold has an assigned maintenance response — open or closed	Pass/Fail	High	—	✓	✓
23	Chronic downtime causes (top 3 by frequency) have active improvement projects	Pass/Fail	High	—	✓	✓
24	Downtime reduction targets set per line — not only global OEE targets	Pass/Fail	Med	—	✓	—
25	Improvement actions linked to specific downtime reason codes — not general "improve reliability"	Pass/Fail	High	—	✓	✓

Quality Downtime Data Quality 4 items

#	Checklist Item	Type	Priority	Photo	Required	Critical
26	Monthly data quality audit: manual entries spot-checked against machine signals	Pass/Fail	Med	—	✓	—
27	"Other" reason code usage below 5% — any higher triggers code list review	Pass/Fail	High	—	✓	✓
28	No unresolved gaps in downtime timeline — every shift has complete start/end accounting	Pass/Fail	High	—	✓	✓
29	Downtime data accessible to both maintenance and production teams — not siloed	Pass/Fail	High	—	✓	✓
30	Downtime trend visible 13 weeks rolling — seasonal and campaign effects identifiable	Pass/Fail	Med	—	✓	—

Types: Pass/Fail Numeric Text Selection Priority: High Med Toggles: ✓ Required ✓ Yes — No

FAQ

Frequently Asked Questions

What is downtime tracking in manufacturing?

Downtime tracking is the systematic recording of every production stop — its start time, end time, duration, equipment, and cause — to produce a dataset that enables reliability analysis, OEE calculation, and maintenance prioritisation. Effective downtime tracking requires agreed definitions, structured reason codes, timestamped data capture, and a regular review process that converts downtime data into improvement actions. Without downtime tracking, maintenance operates reactively and OEE Availability cannot be calculated accurately.

What is the difference between planned and unplanned downtime?

Planned downtime includes scheduled stops that are known in advance and built into the production schedule: planned maintenance windows, scheduled changeovers, breaks, and no-order periods. Planned downtime is excluded from the OEE Availability denominator. Unplanned downtime is any stop that was not scheduled — equipment breakdown, material shortage, quality hold, tooling failure, or operator error. Unplanned downtime is the Availability loss in OEE and is the target for downtime reduction programmes.

What are downtime reason codes and why do they matter?

Downtime reason codes are the structured classification system that turns a downtime duration into actionable diagnostic data. Without reason codes, you know how much time was lost but not why. With a well-designed two-level reason code hierarchy, you can Pareto downtime by cause, calculate MTBF per failure mode, identify repeat failures that require root cause analysis, and separate maintenance-driven losses from process and material-driven losses. The quality of the reason code list determines the quality of every downtime analysis your organisation will ever produce. Book a Demo to see iFactory's reason code configuration.

What is MTBF and MTTR in downtime tracking?

MTBF (Mean Time Between Failures) is the average time between unplanned stop events for a specific piece of equipment or equipment class. A declining MTBF trend means failures are becoming more frequent — a signal for predictive or preventive maintenance intervention. MTTR (Mean Time To Repair) is the average time from when a failure occurs to when production restarts. High MTTR indicates maintenance response or repair effectiveness issues — wrong spares, skill gaps, or undocumented repair procedures. Both metrics are calculated automatically by iFactory from timestamped downtime event data.

How does iFactory automate downtime tracking?

iFactory connects to machine PLCs, sensors, or operator input devices to capture downtime events automatically at the moment they occur — no end-of-shift manual entry. Each event is timestamped, assigned to the correct asset, and presented to the operator for reason code selection on a mobile device. The system calculates MTBF and MTTR per equipment class, generates daily and weekly Pareto automatically, and links every downtime event to the OEE Availability calculation. Book a Demo to see the downtime module.

Start Tracking Downtime Correctly

Replace Manual Downtime Logs with Automated iFactory Downtime Tracking

iFactory captures every downtime event with timestamp and reason code, links it to OEE, calculates MTBF and MTTR, and generates daily and weekly Pareto to eliminate chronic losses.

Automatic capture from PLC signals — no manual entry, no gaps

Book a Demo Support Call

Greenfield Industrial Project Execution: Best Practices and Consulting Insights

Greenfield Project Consulting: Strategy, Planning and Value Creation

Greenfield Industrial Consulting Services | Smart Factory Advisory

How Digital Twins Are Revolutionizing Greenfield Factory Design in 2026

Greenfield Factory Layout & Engineering Advisory | Plant Planning Experts

AI-Powered Predictive Maintenance for Greenfield Plants: Complete Implementation Guide

Downtime Tracking Checklist for Manufacturing Plants