谷歌浏览器插件
订阅小程序
在清言上使用

Shifted-Window Hierarchical Vision Transformer for Distracted Driver Detection

2021 IEEE REGION 10 SYMPOSIUM (TENSYMP)(2021)

引用 10|浏览2
暂无评分
摘要
Distracted driving is one of the leading causes of fatal road accidents. Current studies mainly use convolutional neural networks (CNNs) and recurrent neural networks (RNNs) to classify distracted action through spatial and spectral information. Following the success application of transformer in natural language processing (NLP), transformer is introduced to handle computer vision tasks. Vision transformer can mine long-range relationship and less loss of information between layers. Compared to a regular vision transformer, a hierarchical transformer with representation computed with shifted windows could limit the self-attention computation, yielding more computation efficiency. In this work, we conduct a review on shifted-window hierarchical vision transformers, following the exact implementation of Swin Transformer in classifying distracted drivers through the American University in Cairo Distracted Driver Dataset (AUC-DDD). Results show that shifted-window hierarchical transformer can achieve a classification accuracy of 95.72% in distracted driver detection.
更多
查看译文
关键词
Vision Transformer, Image Classification, Safe Driving, Intelligent Transportation System (ITS)
AI 理解论文
溯源树
样例
生成溯源树,研究论文发展脉络
Chat Paper
正在生成论文摘要