A new AI framework called EAGLE can predict parcel delivery delays proactively by modelling both the timing of order flows and the relational structure of logistics hubs — a combination that prior systems have largely failed to exploit.
Most existing delay-prediction tools treat logistics data either as rows in a table or as time-series signals, missing the web of dependencies between warehouses, transport lanes, and distribution centres. The paper, posted to ArXiv in April 2025, argues that this fragmentation leaves supply chain operators responding to disruptions rather than anticipating them.
Why Graph Structure Changes the Prediction Problem
Supply chains are, by nature, networks. A delay at one hub ripples through connected nodes in ways that tabular models — which treat each shipment independently — cannot capture. EAGLE addresses this by pairing two complementary components: a Transformer patch encoder that reads sequences of order-flow data at each node, and an Edge-Aware Graph Attention Network (E-GAT) that models how hubs relate to and influence one another.
The edge-aware element is significant. Standard graph neural networks assign importance to nodes; E-GAT extends that attention mechanism to the edges — the transport lanes themselves — weighting connections by their operational characteristics rather than treating all links as equal.
The framework jointly models temporal order-flow dynamics and inter-hub relational dependencies, a combination that prior approaches have treated as separate problems.
The two components are trained together under a multi-task learning objective, meaning the model optimises for both accurate delay classification and stable generalisation simultaneously, rather than tuning one at the expense of the other.
Performance on a Real-World Dataset
The researchers evaluated EAGLE on the DataCo Smart Supply Chain dataset, a publicly available real-world logistics benchmark containing order timestamps, routing records, and shipping manifests. Across this dataset, the framework achieved an F1-score of 0.8762 and an AUC-ROC of 0.9773, according to the authors — both figures representing improvements over the baseline methods tested.
It is worth noting that these benchmarks are self-reported by the paper's authors and have not yet undergone peer review, as is standard for ArXiv preprints.
Stability is a recurring concern in deep learning deployments, where a model that performs well on average but varies wildly across training runs is difficult to trust in production. EAGLE's authors measured performance across four independent random seeds and recorded a cross-seed F1 standard deviation of just 0.0089 — which they describe as a 3.8 times improvement in stability over the best ablated variant of their own model. For logistics operators weighing whether to rely on a predictive system, consistency across runs matters as much as peak accuracy.
What the Hybrid Architecture Solves
The core contribution is architectural. Prior work has divided into two camps: machine learning models that ingest tabular shipment records, and anomaly-detection systems that track time-series signals like transit times or warehouse throughput. Each captures something real, but neither sees the full picture.
Tabular classifiers, for instance, can learn that certain shipping routes are historically slow, but they cannot reason about what happens when a major hub upstream goes offline. Time-series detectors can flag unusual patterns at a single node but lack the graph structure to trace how a disruption propagates downstream.
EAGLE's hybrid design attempts to hold both views at once. The Transformer encoder — described by the authors as lightweight, suggesting a focus on computational efficiency — processes the temporal rhythm of orders moving through each node. The E-GAT layer then aggregates signals across the network, allowing the model to ask not just "is this shipment late?" but "is it late because of something happening elsewhere in the graph?"
Implications for Supply Chain Operations
If the results hold under independent replication, the practical implications are meaningful. Proactive delay prediction — flagging a likely late delivery days before it occurs — allows operators to reroute shipments, notify customers early, or pre-position inventory. Reactive systems, by contrast, alert teams only once a delay has materialised, limiting the available responses.
The logistics sector has faced sustained pressure on delivery reliability, driven by demand volatility, carrier capacity constraints, and structural fragilities exposed by pandemic-era disruptions. Tools that surface risk earlier in the pipeline, rather than confirming it after the fact, align with industry investment in predictive operations.
The framework's reliance on the DataCo dataset, while appropriate for a research benchmark, also raises questions about generalisation. Real logistics networks vary enormously in topology, data quality, and operational cadence. Whether EAGLE's architecture transfers cleanly to proprietary datasets from large carriers or retailers would require further validation.
What This Means
For logistics operators and supply chain technology teams, EAGLE represents a concrete methodological advance — treating delay prediction as a graph problem rather than a tabular one — though independent validation on diverse, real-world networks will be necessary before the approach can be considered production-ready.