Multimodal Fusion for High-Accuracy Fatigue Driving Detection
DOI:
https://doi.org/10.54097/z0vxv660Keywords:
Multimodal Emotion Recognition, Fatigue Driving, Artificial Intelligence, Smart Driving.Abstract
Driving fatigue is a serious global problem that accounts for an extremely high percentage of road traffic accidents, and therefore advanced detection techniques are urgently needed. The article provides a comprehensive overview of the application of multimodal emotion recognition techniques for fatigue driving detection. The article systematically reviews and analyses various types of existing studies that fused visual cues (e.g., facial expressions, yawning) with physiological signals (e.g., EEG, ECG) to improve detection accuracy and robustness. The analysis shows that the multimodal fusion framework can effectively compensate for the limitations of unimodal approaches (e.g., sensitivity to environmental changes). In addition, the article identifies key challenges that hinder the practical implementation of the technology, including data scarcity, insufficient generalization capabilities, real-time processing limitations, and privacy issues. Finally, the article explores promising future research directions, such as lightweight model design, cross-domain adaptation, and dynamic mode selection. This review aims to provide a valuable reference for those engaged in the research and practice of intelligent driver assistance systems.
Downloads
References
[1] Koks E, Rozenberg J, Zorn C, et al. A global multi-hazard risk analysis of road and railway infrastructure Assets. Nature Communications, 2019, 10 (1).
[2] Islam M A, Dinar Y. Evaluation and Spatial Analysis of Road Accidents in Bangladesh: An Emerging and Alarming Issue. Transportation in Developing Economies, 2021, 7 (1).
[3] Annell S, Gratner A, Svensson L. Probabilistic collision estimation system for autonomous vehicles. 2016 IEEE 19th International Conference on Intelligent Transportation Systems (ITSC), 2016: 473 - 478.
[4] Katuwal K, Simkhada P, Gee I, et al. PW 2266 Road traffic injuries in nepal: a study of causes, patterns and control measures. Abstracts, 2018: A188.1 - A188.
[5] Nemcova A, Svozilova V, Bucsuhazy K, et al. Multimodal Features for Detection of Driver Stress and Fatigue: Review. IEEE Transactions on Intelligent Transportation Systems, 2021, 22 (6): 3214 - 3233.
[6] Abe T. PERCLOS-based technologies for detecting drowsiness: Current evidence and future Directions. SLEEP Advances, 2023, 4 (1).
[7] Takahashi K, Yamamoto K, Kuchiba A, et al. Hypothesis testing procedure for binary and multi‐class F1‐scores in the paired Design. Statistics in Medicine, 2023, 42 (23): 4177 - 4192.
[8] Cao S, Feng P, Kang W, et al. Optimized driver fatigue detection method using multimodal neural Networks. Scientific Reports, 2025, 15 (1).
[9] Priyanka S, Shanthi S, Saran Kumar A, et al. Data fusion for driver drowsiness recognition: A multimodal Perspective. Egyptian Informatics Journal, 2024, 27: 100529.
[10] Knapik M, Cyganek B, Balon T. Multimodal Driver Condition Monitoring System Operating in the Far-Infrared Spectrum. Electronics, 2024, 13 (17): 3502.
[11] Chen J, Dey S, Wang L, et al. Attention-Based Multi-Modal Multi-View Fusion Approach for Driver Facial Expression Recognition. IEEE Access, 2024, 12: 137203 - 137221.
[12] He Y, Zhao J. Temporal Convolutional Networks for Anomaly Detection in Time Series. Journal of Physics: Conference Series, 2019, 1213 (4): 042050.
Downloads
Published
Issue
Section
License
Copyright (c) 2025 Highlights in Science, Engineering and Technology

This work is licensed under a Creative Commons Attribution-NonCommercial 4.0 International License.







