基于计算机视觉的自动驾驶算法研究综述Review on autonomous driving technology based on computer vision
张贵英,向函,赵勇
摘要(Abstract):
从基于传统特征和基于深度学习两方面对自动驾驶技术进行了综述。首先论述了基于传统特征的自动驾驶技术,如道路与车道线的检测、前车检测、行人检测和防撞系统等,由于识别目标种类繁多,基于传统特征的目标检测遇到了很难超越的瓶颈;接着阐述了基于深度学习的自动驾驶算法,采用卷积神经网可以直接学习和感知路面及道路上的车辆,可大幅度提升自动驾驶算法的性能;最后总结全文,并展望了未来的研究方向,即整合传统特征和深度学习特征,进一步提升深度驾驶的拟人化和实用化水平。
关键词(KeyWords): 自动驾驶;卷积神经网;计算机视觉;深度学习
基金项目(Foundation): 深圳市基础研究(学科布局)科技创新基金项目(JCYJ20160506172651253);; 贵州省科技厅联合基金项目(黔科合LH字[2014]7597号);; 遵义医学院硕士启动基金项目(编号:F-641)
作者(Author): 张贵英,向函,赵勇
DOI: 10.13391/j.cnki.issn.1674-7798.2016.06.004
参考文献(References):
- [1]Aly,M.Real time Detection of Lane Markers in Urban Streets[J].IEEE Intelligent Vehicles Symposium,2008:7-12.
- [2]Li Zhang,Ee-yong Wu.A Road Segmentation and Road Type Identification Approach Based on New-Type Histogram Calculation[J].2nd IEEE International Congress on Image and Signal Processing,2009:1-5.
- [3]Hui Kong,Jean-Yves Audibert,Jean Ponce.Vanishing Point Detection for Road Detection[J].IEEE Conference on Computer Vision and Pattern Recognition,2009:96-103.
- [4]Junhwa Hur,Seung-Nam Kang,Seung-Woo Seo.Multi-lane Detection in Urban Driving Environments using Conditional Random Fields[J].IEEE Intelligent Vehicles Symposium,2013:1297-1302.
- [5]Jan Siegemund,Uwe Franke,Wolfgang Forstner.A Temporal Filter Approach for Detection and Reconstruction of Curbs and Road Surfaces based on Conditional Random Fields[J].IEEE Intelligent Vehicles Symposium,2011,30(1):637-642.
- [6]Zehang Sun,George Bebis,Ronald Miller.On-road Vehicle Detection:A Review[J].IEEE Transactions on Pattern Analysis and Machine Intelligence,2006,28(5):694-711.
- [7]Y.-M.Chan,Shih-Shinh Huang,Li-Chen Fu,et al.Vehicle Detection and Tracking under Various Lighting Conditions using a Particle Filter[J].Intelligent Transport Systems,2012,6(1):1-8.
- [8]Payam Sabzmeydani,Greg Mori.Detecting Pedestrians by Learning Shapelet Feature[J].IEEE Conference on Computer Vision and Pattern Recognition,2010:1-8.
- [9]Zhe Lin,Larry S.Davis.A Pose-Invariant Descriptor for Human Detection and Segmentation[J].Proceedings of the 10th European Conference on Computer Vision:Part IV Springer-Verlag,2008:423-436.
- [10]Pedro Felzenszwalb,David Mc Allester,Deva Ramanan.A Discriminatively Trained,Multiscale,Deformable Part Model[J].IEEE Conference on Computer Vision and Pattern Recognition,2008.
- [11]Navneet Dalal,Bill Triggs.Histograms of Oriented Gradients for Human Detection[J].IEEE Computer Society Conference on Computer Vision and Pattern Recognition,2005,1:886-893
- [12]Anuj Mohan,Constantine Papageorgiou,Tomaso Poggio.Example-based Object Detection in Images by Components[J].IEEE Transactions on Pattern Analysis and Machine Intelligence,2001,23(4):349-361.
- [13]Stefan Walk,Nikodem Majer,Konrad Schindler,et al.New Features and Insights for Pedestrian Detection[J].IEEE Conference on Computer Vision and Pattern Recognition,2010:1030-1037
- [14]Ming-Ming Cheng,Ziming Zhang,Wen-Yan Lin,Philip Torr.BING:Binarized Normed Gradients for Objectness Estimation at 300fps[J].IEEE Conference on Computer Vision and Pattern Recognition,2014:3286-3293.
- [15]Yong Zhao,Yongjun Zhang,Ruzhong Cheng,et al.An Enhanced Histogram of Oriented Gradients for Pedestrian Detection[J].IEEE Intelligent Transportation Systems Magazine,2015.7(3):29-38.
- [16]Yongjun Zhang,Yong Zhao,Dongbing Quan,et al.A High-efficiency Eye Detection Method Based on Red-Eye Effect and Affine-SIFT[J].Journal of Information and Computational Science,2015,12(3):1201-1210.
- [17]Yongjun Zhang,Yong Zhao,Guoliang Li,et al.A New Feature for Night-time Pedestrian Detection[J].Journal of Inforamtion and Computational Science,2014,11(16):5801-5809.
- [18]David G.Lowe.Object recognition from local scale-invariant features[J].The Proceedings of the Seventh IEEE International Conference on Computer Vision,1999,2:1150-1157.
- [19]Gabriella Csurka,Christopher R.Dance,Lixin Fan,et al.Visual Categorization with Bags of Keypoints[J].Workshop on Statistical Learning in Computer Vision,2004,1:1-22.
- [20]Tommi S.Jaakkola,David Haussler.Exploiting Generative Models in Discriminative classifiers[J].Advances in Neural Information Processing Systems,1999:487-493.
- [21]Hubel,D.H,Wiesel T.N.Receptive Fields,Binocular Interaction and Functional Architecture in the Cat's Visual Cortex[J].The Journal of Physiology,1962,160(1):106-154.
- [22]Kunihiko Fukushima.Neocognitron:A Self-organizing Neural Network Model for a Mechanism of Pattern Recognition Unaffected by Shift in Position[J].Biological Cybernetics,1980,36(4):193-202.
- [23]Alex Krizhevsky,Ilya Sutskever,Geoffrey E.Hinton.Image Net Classification with Deep Convolutional Neural Networks[J].Advances in Neural Information Process-ing Systems,2012:1097-1105.
- [24]Ross Girshick,Jeff Donahue,Trevor Darrell,et al.Rich Feature Hierarchies for Accurate Object Detection and Semantic Segmentation[J].IEEE Conference on Computer Vision and Pattern Recognition,2014:580-587.
- [25]Kaiming He,Xiangyu Zhang,Shaoqing Ren,et al.Spatial Pyramid Pooling in Deep Convolutional Networks for Visual Recognition[J].IEEE Transactions on Pattern Analysis and Machine Intelligence,2014,37(9):1904-1916.
- [26]Ross Girshick.Fast r-cnn[J].Proceedings of the IEEE International Conference on Computer Vision,2015:1440-1448.
- [27]Shaoqing Ren,Kaiming He,Ross Girshick,et al.Faster R-CNN:Towards Real-Time Object Detection with Region Proposal Networks[J].Advances in Neural Information Processing Systems,2015:91-99.
- [28]Wenbin Zou,Nikos Komodakis.HARF:Hierarchy-associated Rich Features for Salient Object Detection[J].Proceedings of the IEEE International Conference on Computer Vision,2015:406-414.
- [29]Tie Liu,Zejian Yuan,Jian Sun,Jingdong Wang,Nanning Zheng,Xiaoou Tang,Heung-Yeung Shum.Learning to Detect a Salient Object[J].IEEE Transactions on Pattern Analysis and Machine Intelligence,2011,33(2):353-367.
- [30]Wenbin Zou,Kidiyo Kpalma,Zhi Liu,Joseph Ronsin.Segmentation Driven Low-rank Matrix Recovery for Saliency Detection[J].24th British Machine Vision Conference.2013:1-13.
- [31]Vida Movahedi,James H.Elder.Design and Perceptual Validation of Performance Measures for Salient Object Segmentation[J].IEEE Computer Society Conference on Computer Vision and Pattern Recognition Workshops,2010:49-56.
- [32]Peter Kontschieder,Madalina Fiterau,Antonio Criminisi,Samuel Rota Bulo.Deep Neural Decision Forests[J].Proceedings of the IEEE International Conference on Computer Vision,2015:1467-1475.
- [33]Russakovsky Olga,Deng Jia,et al.Image Net Large Scale Visual Recognition Challenge[J].International Journal of Computer Vision,2015,115(3):211-252.34.
- [34]Pierre Sermanet,Yann Le Cun.Traffic Sign Recognition with Multi-scale Convolutional Networks[J].International Joint Conference on Neural Networks,2011:2809-2813.
- [35]Dan Cire,Ueli Meier,Jonathan Masci,et al.Multi-column Deep Neural Network for Traffic Sign Classification[J].Neural Networks the Official Journal of the International Neural Network Society,2012.32(1):333-338.
- [36]Karen Simonyan,Andrew Zisserman.Two-Stream Convolutional Networks for Action Recognition in Videos[J].Cochrane Database of Systematic Reviews,2014,1(4):568-576.
- [37]Alex Waibel.Alvinn:An Autonomous Land Vehicle in a Neural Network[J].Advances in Neural Information Processing Systems,1988,1(4):595-599.
- [38]Dean A.Pomerleau.Neural Network Perception for Mobile Robot Guidance[J].Springer International,1993:239.
- [39]Raia Hadsell,Pierre Sermanet,Jan Ben,et al.Learning Long-range Vision for Autonomous Off-road Driving[J].Journal of Field Robotics,2009,26(2):120-144.
- [40]Yann Le Cun,Urs Muller,Jan Ben,et al.Off-Road Obstacle Avoidance through End-to-End Learning[J].Nips,2005:739-746.
- [41]Chenyi Chen,Ari Seff,Alain Kornhauser,Jianxiong Xiao.Deep Driving:Learning Affordance for Direct Perception in Autonomous Driving[J].Proceedings of the IEEE International Conference on Computer Vision,2015:2722-2730.