Published
Deep Learning Frameworks for Multi-Modal Data Fusion in Retail Supply Chains: Enhancing Forecast Accuracy and Agility
Published in October-December 2024 (Vol. 1, Issue 1, 2024)

Keywords
traffic flow forecastingintelligent transportation systemsspatio-temporal modelingmultimodal deep learningHaMDeepTshort-term traffic predictionattention mechanism1D CNNGRU3D CNN-GRUspatial-temporal correlationtraffic demand predictionmulti-modality traffic dataauxiliary deep learning architecturecorrelation layerrelative operation layertraffic sensor dataweather impact on trafficnetwork architecture optimizationST-Graph limitationtraffic data visualizationminiNYC datasetdeep learning for ITS
Abstract
Traffic flow forecasting is a key problem of intelligent transport systems and represents a challenging task due to the spatial-temporal correlation features and long temporal interdependence of the considered data. Conventional methods deal with this either by spatial forecasting given observed counts at previous times or by temporal forecasting given observed traffic counts in neighbouring locations. In order to fully exploit the spatio-temporal properties observed in the data, a hybrid multimodal deep learning method for short-term traffic flow forecasting called HaMDeepT is proposed. Specifically, the HaMDeepT method can jointly and adaptively learn the spatial-temporal correlation features and long temporal interdependence of multi-modality traffic data through an attention-based auxiliary multimodal deep learning architecture. The base module of this method consists of a 1D CNN and GRU with the attention mechanism. The forecasted spatio-temporal traffic demand (counts of traffic passing through different locations at regular time intervals) is dependent on far more critical spatial factors than other point sensors such as weather stations. This HaMDeepT method, in terms of 3D CNN-GRU, which uses a stack of 3D Convolutional Neural Networks (3D CNN) and Gated Recurrent Units (GRUs) combined with the Correlation and Relative Operation layers to model both the spatial context features and the temporal dependencies of traffic count data at all locations, has a better performance compared to other network architectures. It overcomes the drawback of a fixed and handcrafted graph Laplacian matrix representation of the spatial relationships of the locations used by the ST-Graph. It uses the Correlation layer to estimate the spatial correlation features for each traffic count data point with others, focusing on the stations with major impacts on the target location, and the Relative Operation layer to model the relative distances thereafter. Using these novel methods, the traffic flow forecasting results for the miniNYC dataset are more accurate and more intuitive visualisation of the spatial structure that affects the performance of the predictions.
Authors (2)
Srinivas Kalisetty
Integration and AI leadIntegration and AI leadIntegration and AI leadIntegration and AI lead
View all publications →Phanish Lakkarasu
Senior Site Reliability Engine...Senior Site Reliability EngineerSenior Site Reliability EngineerSenior Site Reliability Engineer
View all publications →Download Article
Best for printing and citation
File size: 0.0 MB
Format: PDF
Article Information
Published in:
October-December 2024 (Vol. 1, Issue 1, 2024)- Article ID:
- jaibdd110023
- Paper ID:
- JAIBDD-01-000023
- Published Date:
- 2026-02-24
Article Impact
Views:4,182
Downloads:1,153
scite_
Smart Citations
0Citing Publications
0Supporting
0Mentioning
0Contrasting
View Citations
See how this article has been cited at scite.ai
scite shows how a scientific paper has been cited by providing the context of the citation, a classification describing whether it supports, mentions, or contrasts the cited claim, and a label indicating in which section the citation was made.
scite shows how a scientific paper has been cited by providing the context of the citation, a classification describing whether it supports, mentions, or contrasts the cited claim, and a label indicating in which section the citation was made.
How to Cite
Kalisetty & Lakkarasu (2026). Deep Learning Frameworks for Multi-Modal Data Fusion in Retail Supply Chains: Enhancing Forecast Accuracy and Agility. Journal of Artificial Intelligence and Big Data Disciplines (JAIBDD), 1(1), xx-xx. DOI:https://doi.org/10.70179/vc6nrq25
Article Actions
More from this Issue
Revolutionizing Patient Outcomes: The Role of Generative AI and Machine Learning in Predictive Analytics for Healthcare
Valiki DileepRead more →
More by These Authors
Resilient Manufacturing in the Era of Industry 4.0: Leveraging AI and Edge Computing for Real-Time Quality Control and Predictive
2024 • Vol. 1, Issue 1
Read more →
