Machine Learning for Traffic Volume Prediction on Highways

Capacity planning for a highway used to be an exercise in historical averages and the engineer's judgment. Pull the average annual daily traffic from the last five years, add a growth assumption, build for the projected demand, and hope the projection holds. It usually didn't. Storm patterns shifted. Land use changed. A nearby corridor closed for a year and reshaped flow for a decade. The result was infrastructure that was either under-capacity for the demand it actually received or over-capacity for demand that never showed up — both expensive failures, both rooted in forecasts that assumed the future would look like the past. Machine learning changes the forecast. Modern models combine sensor telemetry, weather feeds, calendar effects, road network topology, and historical patterns into predictions that operate at three useful horizons: short-term (5 minutes to 1 hour ahead) for live operational control, medium-term (1 hour to 24 hours) for daily planning and incident management, and long-term (days to weeks) for capacity, work zones, and asset planning. Published research now reports accuracy that traditional ARIMA models couldn't approach: MAPE around 14% on 6-minute predictions, R² above 0.99 on daily volume forecasts, and improvements continuing as graph neural networks learn the spatial relationships between sensors. Get In Touchto see how ifactoryai brings the model stack, the data pipeline, and the operational integrations together — designed for transport authorities planning capacity and managing live traffic at the same time.

LSTM · GRU · CNN-LSTM · GCN · Graph Neural Networks · Spatio-Temporal AI

Forecast Traffic Volume the Way the Best Transport Authorities Already Do It.

iFactory operationalizes the deep-learning model stack — turning historical sensor data, weather feeds, and network topology into capacity-planning forecasts that defend a decade of investment.

Get In Touch Book a Demo

The Three Forecast Horizons — and Why You Need All Three

Highway traffic prediction isn't one problem. It's three problems with three different time scales, three different model families, and three different operational uses. A platform that solves one horizon and ignores the others delivers a fraction of the available value.

Horizon 01

5 min – 1 hr

SHORT-TERM

Live Operational Control

Ramp metering, dynamic signal timing, variable message sign content, lane control. The forecast that turns "what's happening on the road right now" into the next 30-minute response.

Typical Model: LSTM / GRU / CNN-LSTM

Horizon 02

1 – 24 hrs

MEDIUM-TERM

Daily Planning & Incident Mgmt

Tomorrow's peak hour shape. Event-day routing. Pre-positioning crews and incident response. The horizon where operations leadership plans against expected demand.

Typical Model: GCN-LSTM / GCGRNN

Horizon 03

Days – Weeks

LONG-TERM

Capacity & Asset Planning

Work-zone scheduling, lane addition justification, pavement maintenance windows, environmental impact projections. The horizon where capital planning gets defended.

Typical Model: FCN-LSTM / Spatio-Temporal Hybrid

The Evolution of Traffic Forecasting Models

The history of traffic volume prediction is also a history of what counts as state-of-the-art in machine learning. Each generation captured something the previous one couldn't. Understanding the lineage matters because the right model for a given problem isn't always the newest — it's the simplest that captures the relationships in the data.

Generation

2000s

Parametric & Time-Series (ARIMA, Kalman)

Strong baseline; still the benchmark in published research. Captures temporal patterns at a single sensor. Limitation: blind to spatial relationships between sensors and to non-linear interactions.

Generation

2010s early

Classical ML (SVM, k-NN, Bayesian, Random Forest)

Non-parametric methods that handled non-linearity better than ARIMA but still treated each sensor mostly independently. Useful baselines, occasionally still the right choice for low-data deployments.

Generation

2015 onward

Recurrent Deep Learning (LSTM, GRU, Stacked Autoencoder)

The temporal-dependency breakthrough. LSTM and its variants consistently outperform random walk, SVM, and feedforward networks on traffic time-series. Still single-sensor or limited spatial awareness.

Generation

2018 onward

Hybrid Spatial-Temporal (CNN-LSTM, FCN-LSTM)

CNN layers capture spatial correlation between adjacent road segments; LSTM/GRU layers model the temporal dimension. The architecture that broke the single-sensor ceiling — published research reports R² above 0.99 on daily volume.

Generation

Current

Graph Neural Networks (GCN, GCGRNN, T-GCRNN, GAT-LSTM)

The state of the art. Graph convolutional networks model the actual road topology as a graph — sensors are nodes, road segments are edges. Captures upstream/downstream dependencies that CNN approximations miss. The model family driving the next leap.

Model Selection · Data Ingestion · Accuracy Validation

See the Right Forecast Model Configured for Your Network's Demand Profile

iFactory selects, trains, and operationalizes the model family that fits your sensor density and data history — and validates accuracy against the same benchmarks published research uses.

Get In Touch Book a Demo

The Six Data Inputs That Determine Forecast Quality

Model architecture matters, but data matters more. The defining difference between a published research model that achieves 99% accuracy and a deployed one that doesn't is almost always the data pipeline. Six input categories drive forecast quality — and any gap in this stack caps the model's potential regardless of architecture.

Input 01

Sensor Volume & Speed History

Inductive loops, radar units, AVI tag readers, video analytics counts. At least 12 months of complete history per sensor. The foundation everything else builds on.

Input 02

Road Network Topology

The graph structure that connects sensors — upstream/downstream relationships, lane counts, on-ramps, off-ramps, intersections. Required for any graph-based model.

Input 03

Weather & Atmospheric Data

Rainfall, temperature, visibility, wind. Volume sensitivities to weather are sharp and non-linear — and the highest-accuracy models include this feature explicitly.

Input 04

Calendar & Event Effects

Day-of-week, school calendar, holidays, special events. The features that capture why a "normal Tuesday" forecast fails on the Tuesday after a long weekend.

Input 05

Live Incident & Construction Feed

Active work zones, current crashes, lane closures. The features that prevent the model from happily predicting yesterday's normal volume on a corridor that's down a lane today.

Input 06

Sensor Reliability Metadata

Which detectors are failing, drifting, or offline. Published research shows that the best models maintain accuracy through 1–20% detector failure when they know which sensors to trust.

Accuracy Benchmarks: What the Best Models Currently Achieve

"Accurate" is meaningless without the benchmark. The four metrics below are the standard reporting language across published research, and these are the values transport authorities should expect from a production deployment — and should require their vendors to demonstrate against held-out data, not on the training set.

Metric	What It Measures	Best Reported Range
MAE (Mean Absolute Error)	Average absolute deviation in vehicles per period	~5 vehicles per 6-minute window (Hong Kong study)
RMSE (Root Mean Sq Error)	Penalizes large prediction errors more than small ones	~7.6 vehicles per 6-minute window
MAPE (Mean Absolute % Error)	Percentage error — directly interpretable	~14% on short-term (research benchmark)
R² (Coefficient of Determination)	How much variance the model explains	Above 0.99 on daily volume (FCN-LSTM hybrid)

“

The conversation I have with every transport authority starts the same way: they think they need a better model. They almost never do. What they need is a cleaner data pipeline, complete sensor coverage on the corridors that matter, and an honest validation framework that tests the model on next month's data instead of last month's. The model improvements from G4 to G5 are real, but they're the last 5% of accuracy gains. The first 80% comes from data quality. Build the pipeline, then pick the model.

— Director of Transportation Analytics, State DOT — 20 Years — ITE Member, TRB Standing Committee on Traffic Flow Theory

From Prediction to Action: Where the Forecasts Actually Land

A forecast nobody uses is a research project, not an operating system. The value of ML traffic prediction sits in the operational systems the forecast feeds — and the best deployments wire the prediction directly into the systems that already drive transport decisions.

Operational Use 01

Capacity Planning & Investment Defense

Long-horizon forecasts feed the corridor business case. Lane additions, interchange upgrades, and ITS deployments get justified against forecasted demand rather than historical assumptions — and the same model validates the projection year after year.

Operational Use 02

Work Zone Scheduling & Pavement Windows

Medium-term forecasts identify the lowest-impact windows for lane closures. The week of the year, the hour of the day, the corridor segment — work crews go in when the model says the traveling public will notice least.

Operational Use 03

Live Ramp Metering & Signal Timing

Short-term forecasts feed ramp meter algorithms and adaptive signal control. The system stops reacting to congestion that has already formed and starts metering for the congestion forming in the next 15 minutes.

Operational Use 04

Incident Response & Resource Pre-positioning

When a forecast says traffic on a corridor will run heavy tomorrow, incident response crews position pre-emptively along the route — converting reactive deployments into ones aligned with predicted demand.

Conclusion

The history of highway capacity planning is the history of forecasts that were wrong because they didn't have the tools to be right. Five generations of machine learning models later, the tools exist. ARIMA gave us the temporal baseline. LSTM and GRU broke the linearity ceiling. CNN-LSTM hybrids added spatial awareness. Graph neural networks finally model the road network the way traffic actually flows through it. Used together with the six data inputs that drive forecast quality, the published accuracy benchmarks — MAPE around 14%, R² above 0.99 — are achievable for real transport authorities, not just for researchers. The platforms that operationalize this model stack turn highway forecasting from an exercise in historical averages into a forward-looking instrument that capacity planning, work-zone scheduling, ramp metering, and incident response all depend on.

iFactory's platform brings the model stack, the data pipeline, the operational integrations, and the validation framework into one production system — designed for transport authorities planning capacity and running traffic at the same time. Book a Demo to see the forecast configured for your network's sensor density and demand profile.

Frequently Asked Questions

How much sensor history do we need to deploy ML traffic prediction?

A minimum of 12 months of sensor-level data is the typical floor — enough to capture seasonal patterns, holiday effects, and one full school-year cycle. 24 to 36 months produces materially better forecasts because the model sees multiple instances of rare patterns (severe weather, major events, atypical school calendars). For corridors with less history, the model still works but the medium and long horizons widen their confidence intervals. iFactory's deployment assesses available history up front and recommends the right model family for the data actually available.

What happens when our detectors fail or go offline?

Detector failure is the central operational challenge for any deployed traffic ML system. Published research demonstrates that well-designed CNN-LSTM models maintain accuracy through 1% to 20% random failure rates by using cluster-based models that group sensors with similar profiles and substitute neighboring-sensor data when a detector goes offline. iFactory's pipeline includes detector reliability monitoring as Input 06 — and the platform tells operators which sensors need maintenance rather than silently degrading the forecast.

Are graph neural networks really worth the added complexity over LSTM?

It depends on the network. For a single corridor with low sensor density, a well-tuned LSTM or CNN-LSTM typically delivers 95% of the value of a GNN at a fraction of the engineering cost. For a network-wide deployment with dense sensors and strong upstream-downstream dependencies, GNNs deliver materially better multi-step, multi-sensor forecasts — and that's why models like GCGRNN, T-GCRNN, and GAT-LSTM consistently win published benchmarks on the PeMS Caltrans dataset. The right answer per deployment is empirical, not ideological.

How does the platform integrate with our existing ATMS, CAD, and modeling tools?

iFactory connects to Advanced Traffic Management Systems (ATMS) via standard NTCIP interfaces, to CAD and incident systems via standard CAD APIs, and to modeling environments (VISUM, Aimsun, EMME, TransModeler) via their native import formats. Forecasts can be exposed as REST endpoints for in-house dashboards, as GTFS-like data feeds for transit integration, and as NTCIP-conformant outputs for downstream signal and ramp meter controllers. The platform sits on top of your existing stack rather than replacing any of the operational systems your team already uses. Book a Demo for an integration map specific to your environment.

Highway capacity built on averages costs more and serves less. Forecasts built on data correct both at once.

iFactory brings the model stack — LSTM through Graph Neural Networks — into one production platform, designed for transport authorities planning capacity, scheduling work, and running live traffic at the same time.

Get In Touch Book a Demo