[1] RUSSAKOVSKY O, DENG J, SU H, et al. ImageNet large scale visual recognition challenge[J]. International Journal of Computer Vision, 2015,115(3):211-252.
[2] REN S Q, HE K M, GIRSHICK R, et al. Faster R-CNN: Towards real-time object detection with region proposal networks[C]// Proceedings of the 28th International Conference on Neural Information Processing Systems. 2015:91-99.
[3] LIU W, ANGUELOV D, ERHAN D, et al. SSD: Single shot multibox detector[C]// European Conference on Computer Vision(ECCV 2016). Springer, 2016:21-37.
[4] REDMON J, DIVVALA S, GIRSHICK R, et al. You only look once: Unified, real-time object detection[C]// Computer Vision and Pattern Recognition (CVPR 2016). IEEE, 2016:779-788.
[5] 〖KG-*3〗REDMON J, FARHADI A. YOLO9000: Better, faster, stronger[C]// Computer Vision and Pattern Recognition(CVPR 2017). IEEE, 2017:6517-6525.
[6] VINYALS O, BLUNDELL C, LILLICRAP T, et al. Matching networks for one shot learning[C]// Proceedings of the 30th International Conference on Neural Information Processing Systems. ACM, 2016:3637-3645.
[7] CHOPRA S, HADSELL R, LECUN Y. Learning a similarity metric discriminatively, with application to face verification[C]// Proceedings of the 2005 IEEE Computer Society Conference on Computer Vision and Pattern Recognition. 2005:539-546.
[8] 〖KG-*4〗KOCH G, ZEMEL R, SALAKHUTDINOV R. Siamese neural networks for one-shot image recognition[C]// Proceedings of the 32nd International Conference on Machine Learning. 2015.
[9] SANTORO A, BARTUNOV S, BOTVINICK M, et al. One-shot learning with memory-augmented neural networks[J]. arXiv preprint arXiv:1605.06065, 2016.
[10]BERTINETTO L, VALMADRE J, HENRIQUES J F, et al. Fully-convolutional siamese networks for object tracking[C]// European Conference on Computer Vision. Springer, 2016:850-865.
[11]LI B, YAN J J, WU W, et al. High performance visual tracking with siamese region proposal network[C]// IEEE/CVF Conference on Computer Vision and Pattern Recognition. 2018:8971-8980.
[12]ZHANG T F, ZHANG Y, SUN X, et al. Comparison network for one-shot conditional object detection[J]. Computer Vision and Pattern Recognition, 2019: arXiv:1904.02317.
[13]ZHANG S F, ZHU X Y, LEI Z, et al. S3FD: Single shot scale-invariant face detector[C]// IEEE International Conference on Computer Vision (ICCV). IEEE, 2017:192-201.
[14]LIN T Y, DOLLAR P, GIRSHICK R, et al. Feature pyramid networks for object detection[C]// IEEE Conference on Computer Vision and Pattern Recognition (CVPR). 2017:936-944.
[15]LECUN Y, BOTTOU L, BENGIO Y, et al. Gradient-based learning applied to document recognition[J]. Proceedings of the IEEE, 1998,86(11):2278-2324.
[16]Darknet. Darknet: Open Source Neural Networks in C[EB/OL]. [2020-02-15]. http://pjreddie.com/darknet/.
[17]VAN ETTEN A. You only look twice: Rapid multi-scale object detection in satellite imagery[J]. Computer Vision and Pattern Recognition, 2018:arXiv:1805.09512.
[18]KRIZHEVSKY A, SUTSKEVER I, HINTON G E. ImageNet classification with deep convolutional neural networks[C]// Proceedings of the 25th International Conference on Neural Information Processing Systems. ACM, 2012:1097-1105.
[19]IOFFE S, SZEGEDY C. Batch normalization: Accelerating deep network training by reducing internal covariate shift[J]. Machine Learning, 2015:arXiv:1502.03167.
[20]TAIGMAN Y, YANG M, RANZATO M, et al. DeepFace: Closing the gap to human-level performance in face verification[C]// IEEE Conference on Computer Vision & Pattern Recognition. 2014:1701-1708.
[21]SCHROFF F, KALENICHENKO D, PHILBIN J. FaceNet: A unified embedding for face recognition and clustering[C]// IEEE Conference on Computer Vision & Pattern Recognition. 2015:815-823.
[22]ZAGORUYKO S, KOMODAKIS N. Learning to compare image patches via convolutional neural networks[C]// Computer Vision and Pattern Recognition. 2015: 4353-4361.
[23]LUO W J, LI Y J, URTASUN R, et al. Understanding the effective receptive field in deep convolutional neural networks[C]// Proceedings of the 30th International Conference on Neural Information Processing Systems. ACM, 2016:4905-4913.
[24]ZHOU B L, KHOSLA A, LAPEDRIZA A, et al. Object detectors emerge in deep scene CNNs[J]. Computer Vision and Pattern Recognition, 2014:arXiv:1412.6856.
[25]KINGMA D P, BA J. Adam: A method for stochastic optimization[J]. Machine Learning, 2014:arXiv:1412.6980.
[26]LIN T Y, GOYAL P, GIRSHICK R, et al. Focal loss for dense object detection[C]// 2017 IEEE International Conference on Computer Vision. 2017:2999-3007.
[27]HOWARD A, SANDLER M, CHEN B, et al. Searching for MobileNetV3[C]// IEEE/CVF International Conference on Computer Vision (ICCV). 2019:1314-1324.
|