Greenfield AI Vision Camera SLA Design Guide | 99.4% Accuracy Target

By Riley Quinn on June 27, 2026

greenfield-ai-vision-camera-sla-design-99-4-accuracy-target-engineering

A vendor quoting "99% accuracy" is quoting a lab benchmark — measured under controlled lighting on a curated image set. When that system reaches your factory floor, accuracy emerges from camera placement, lighting geometry, training quality, FP/FN budget, and the retraining cadence written into the contract. The 99.4% target is an engineered outcome, achievable when the SLA specifies the right metrics, measurement methodology, and remediation obligations before the first camera ships.

See how iFactory engineers 99.4% accuracy into your vision SLA — FP/FN budgets, retraining cadence, and measurement protocol designed for your production line before contract signature.

SLA Engineering Blueprint
99.4% Is a Specification — Not a Claim
Every accuracy target needs four sub-specs. Without them you cannot measure whether the vendor is meeting the SLA.
99.4%
Detection Rate
994 / 1,000 defects caught

True Positives ÷ (TP + False Negatives)

Six defects escape per 1,000. That escape rate determines your warranty exposure and recall probability — not the headline number.

SLA must state: ≥99.4% measured on production samples — not lab images
≤0.5%
False Positive Rate
5 false rejects / 1,000 parts

False Positives ÷ (TN + False Positives)

At 2% FP on a 1,000 parts/hr line: 20 false rejects per hour. Operators start overriding the system. Quality protection disappears.

SLA must state: ≤0.5% at actual production throughput, measured weekly
<200ms
P99 Inference Latency
Camera trigger → reject gate

Worst-case 1% of decisions, not average

At 60 parts/min, 500ms latency misfires the gate onto the wrong part. P99 — not mean — is what causes production errors.

SLA must state: P99 ≤200ms under full production load
99.5%
System Uptime
43.8 hrs uninspected / year

Inspection minutes ÷ Production minutes × 100

Every uninspected minute is either a quality escape or a throughput hold. The SLA must define the fallback protocol.

SLA must state: ≥99.5% per month with defined fallback inspection
A contract specifying only "99% accuracy" with no FP rate, latency ceiling, uptime floor, or measurement methodology is not an SLA — it is a marketing phrase with a percentage attached. Reject it and require all four components before signing.

The FP/FN Budget: Right Trade-Off for Your Production Context

Every AI vision SLA involves a trade-off: higher sensitivity catches more defects but increases false rejects. The correct balance depends on the cost consequence in your specific production context — and that profile is different for an automotive safety part, food packaging, and a consumer electronics enclosure.

Production Context
Cost of 1 False Negative
Cost of 1 False Positive
SLA Budget
Automotive safety Airbag, brake, steering
$10M+ recall / liability
$5–$50 rework
Detection ≥99.99%, FP up to 5%
Pharmaceutical packaging FDA regulated
$50M+ recall / patient harm
$2–$20 repackage
FN rate ≤0.001%, FP up to 3%
Food & beverage Label / seal inspection
$500K–$2M brand recall
$0.50–$2 product loss
iFactory default: ≥99.4% / ≤0.5%
Consumer electronics Enclosure / cosmetic
$50–$200 cosmetic return
$200 high-value part scrapped
FP ≤0.2%, Detection ≥99%
General discrete Surface / dimensional
$100–$500 warranty claim
$50–$200 rework
iFactory default: ≥99.4% / ≤0.5%

Not sure which FP/FN budget fits your line? Book an SLA design session with iFactory — we calculate cost-per-FP and cost-per-FN for your production context and specify the accuracy budget that minimizes total quality cost.

How 99.4% Is Engineered — The Five Layers

Accuracy is not a model property. It is the output of five engineering layers working together. Lab benchmarks of 99.9% routinely drop to 92–95% in the first weeks of production because one of these layers was not specified in the deployment contract.

L1

Camera & Lighting Specification

Resolution, working distance, and lighting geometry must be calculated to achieve minimum pixel size per defect class. A 0.2mm scratch on a 200mm part requires ≤0.1mm/px — constraining lens focal length and sensor size before a camera brand is chosen. Lighting type (dark-field, coaxial, dome) determines whether the defect is even visible to the model.

SLA must specify: Min pixel size per defect class, camera model, working distance, and lighting type
Failure mode: Camera specified without resolution calculation — sub-mm defects invisible regardless of model quality
L2

Training Data Volume & Quality

Production-grade models require 200–2,000 labeled images per defect class covering the full range of production variation. Data augmentation multiplies effective training set. Models trained on under 100 images per class routinely fail to generalize to real production variance — lighting shifts, different material lots, tool wear.

SLA must specify: Minimum images per defect class, augmentation strategy, defect definition per class
Failure mode: Lab-curated images only — accuracy drops 15–30% from benchmark when production lighting varies
L3

Shadow Running & Go-Live Validation

Before the system controls the reject gate, it must run in shadow mode alongside existing inspection for a defined period — typically 1–2 weeks across 10,000+ real parts. Shadow mode produces the only real measure of production accuracy. The go-live decision must be gated on shadow performance, not lab benchmark.

SLA must specify: Minimum shadow duration, minimum sample size (≥10,000 parts), accuracy threshold before handover
Failure mode: Live handover without shadow — first accuracy failure discovered via customer return, not comparison log
L4

Retraining Cadence & Trigger Criteria

Models degrade when conditions change: new material lots, tooling changes, lighting drift, seasonal temperature shifts. The SLA must specify scheduled retraining (monthly in Year 1, quarterly thereafter) and trigger-based retraining when accuracy drops below a defined floor between cycles.

SLA must specify: Scheduled cadence, trigger threshold, and max time from trigger to redeployment
Failure mode: No retraining obligation — model silently degrades for months, FP rate doubles unnoticed
L5

Continuous Measurement & Remediation

An SLA is only enforceable if accuracy is measured independently in production — not self-reported by the vendor. Weekly statistical sampling of AI decisions against gold-standard inspection is the only valid methodology. Remediation obligations must specify maximum response time and consequence when the threshold is missed.

SLA must specify: Weekly measurement methodology, dashboard access, and remediation consequence for breach
Failure mode: Vendor self-reported accuracy — SLA is unverifiable, breach is unprovable

Need all five layers specified for your line? Talk to iFactory's SLA design team — we write the accuracy specification, retraining cadence, and measurement protocol into your vision contract before a camera ships.

99.4% Accuracy Engineered In — With the SLA to Prove It

iFactory's AI vision SLA specifies detection rate, FP budget, inference latency, system uptime, shadow validation protocol, retraining cadence, and remediation obligations — all before deployment. The accuracy you contract is measured weekly, in production, on your parts.

SLA Vendor Scorecard: 10 Requirements Before You Sign

The gap between an accuracy claim and an accuracy commitment is a contract clause. Use this checklist to evaluate any AI vision proposal. Reject vendors that cannot commit to all ten.

Detection rate ≥X% as a contractual number Without a number, there is no measurement and no breach. "High accuracy" is not a commitment.
False positive rate ≤X% specified separately 99% detection with 5% FP is operationally worse than 98% detection with 0.5% FP at most volumes.
P99 inference latency ceiling stated Average latency is irrelevant — the worst-case 1% determines whether the reject gate fires on the right part.
System uptime ≥X% per month Every uninspected minute during downtime is either an escape risk or a throughput cost that must be defined.
Shadow validation protocol before go-live The only way to measure production accuracy before handover. Lab and production accuracy always diverge.
Minimum training images per defect class Models trained on under 100 images per class fail to generalize. The SLA must commit to minimum volumes.
Scheduled retraining cadence Without a schedule, retraining happens reactively after accuracy has already degraded visibly in production.
Trigger-based retraining threshold Specifies the accuracy drop that mandates emergency retraining between scheduled cycles.
Independent accuracy measurement methodology Vendor self-reported accuracy cannot be the basis of an enforceable SLA. Sampling must be independent.
Remediation obligation when accuracy falls below threshold An SLA with no consequence clause for breach is not an SLA — it is a performance aspiration.

Want this scorecard applied to a vendor proposal? Book an SLA review with iFactory — we evaluate any AI vision vendor proposal against these ten requirements and identify gaps before you sign.

Expert Perspective

The most dangerous deployment is not the one that fails at launch — it is the one that hits 97% at go-live, never gets retrained, and silently drifts to 91% over eight months. By then, operators have normalized the false rejects, override rates are part of the job description, and the defect escape rate has doubled. The SLA that prevents this is not the one specifying 99.4% at go-live. It is the one specifying 99.4% continuously, measured weekly, with a retraining trigger and a consequence clause for breach.
— iFactory AI Vision Engineering Team, Greenfield Manufacturing Quality Practice
37%

more critical defects caught by AI vs. expert human inspectors under optimal conditions

3–4 wk

to cross 99%+ accuracy via active learning after shadow validation go-live

1,900%

ROI at major steel plant — accuracy 70%→99.8%, $2M+ saved annually

Your Vision SLA Engineered Before the First Camera Ships

iFactory writes the SLA before deployment — specifying detection rate, FP budget, P99 latency, uptime floor, shadow validation, retraining cadence, and remediation obligations. The accuracy you contract is measured weekly in production on your actual parts — not on a benchmark dataset that never sees your factory floor.

Frequently Asked Questions

What is a realistic AI vision accuracy target for greenfield factories?

Production-grade AI vision achieves 99–99.8% detection with 0.1–0.5% FP rate when camera placement, lighting, and training data are correctly specified. Lab benchmarks of 99.9%+ routinely drop 2–5 points on the production floor before tuning — shadow mode comparison on 10,000+ real parts is the only valid accuracy measurement before handover.

What is the difference between false positive and false negative in vision inspection?

A false negative is a defective part passed as good — it escapes to the customer and drives warranty, recall, and liability cost. A false positive is a good part incorrectly rejected — it is scrapped or reworked. Which matters more depends on your consequence profile: safety-critical parts tolerate high FP rates to drive FN toward zero; high-value cosmetic parts require the opposite trade-off. Both must be contractually specified.

How often does an AI vision model need retraining?

Monthly in Year 1 as the model encounters the full production variation range; quarterly thereafter once a stable baseline is established. Trigger-based retraining fires when weekly measured accuracy drops below the contracted floor — typically 48–72 hours to retrained model deployment. Active learning accelerates this: every human correction enters the pipeline immediately, so models improve continuously between scheduled cycles.

What should a greenfield plant require in an AI vision RFP?

Require vendors to contractually commit to: detection rate as a minimum number per defect class; FP rate maximum; P99 latency ceiling; monthly uptime floor; shadow validation protocol with minimum sample size; minimum training images per defect class; scheduled and trigger-based retraining obligations; independent weekly accuracy measurement; and a remediation consequence clause. Any vendor declining to commit to any of these ten is declining accountability for the SLA they are marketing.

How does active learning improve AI vision accuracy after go-live?

Every human correction — false reject or missed defect — is automatically captured and added to the retraining dataset. Models typically improve from 90–92% at initial deployment to 99%+ within 3–4 weeks as they learn the full range of real production variation. A realistic SLA specifies a ramp: 95%+ at go-live, 97%+ within 2 weeks, 99.4%+ from week 6 onward — then holds that floor continuously with weekly measurement.


Share This Story, Choose Your Platform!