Data mining and image recognition
Intellectual systems and technologies
Image and signal processing
MACHINE LEARNING
A.E. Lynchenko, A.V.Sheshkus, V.L.Arlazarov Identity document classifiaction algorithm based on similarity metric robust to projective distortions
A.E. Lynchenko, A.V.Sheshkus, V.L.Arlazarov Identity document classifiaction algorithm based on similarity metric robust to projective distortions

Abstract.

This article presents an algorithm for document image recognition robust to projective distortions and with possibility of one-shot learning. This method is based on a similarity metric, which is learned using Siamese architecture. The proposed algorithm achieved recognition quality comparable to classifying convolutional network on an open dataset of identity document images MIDV-500.

Keywords:

pattern recognition, siamese neural network, convolutional neural network, deep learning.

PP. 167-173.

DOI: 10.14357/20790279180519

References

1. V.V. Arlazarov, K. Bulatov, T. Chernov and V.L. Arlazarov. “MIDV-500: A Dataset for Identity Documents”.
2. G. Hinton, L. Deng and D. Yu. “Deep Neural Networks for Acoustic Modeling in Speech Recognition: The Shared Views of Four Research Groups,” IEEE Signal Processing Magazine, pp. 82 – 97, DOI: 10.1109/MSP.2012.2205597, 18 10 2012.
3. R. Collobert and J. Weston. “A unified architecture for natural language processing: deep neural networks with multitask learning,” in Proceedings of the 25th international conference on Machine learning, Helsinki, Finland, 2008, DOI: 10.1145/1390156.1390177.
4. A. Krizhevsky, I. Sutskever and G. Hinton. “ImageNet Classification with Deep Convolutional Neural Networks,” Advances in Neural Information Processing Systems 25, pp. 1097-1105, 2012.
5. K. Bulatov, V.V. Arlazarov and T. Chernov. “Smart IDReader: Document Recognition in Video Stream,” International Conference on Document Analysis and Recognition (ICDAR), 9 11 2017, DOI: 10.1109/ICDAR.2017.347.
6. I. Dubnov, A. Merkov, V.L. Arlazarov and I. Nikolaev. “Evidence Maximization Technique for Training of Elastic Nets,” Journal of Optimization, 2016.
7. J. Goldberger, S. Roweis, G. Hinton and R. Salakhutdinov. “Neighbourhood Components Analysis,” in Neural Information Processing Systems (NIPS), Montreal, Canada, 2014.
8. S. Chopra, R. Hadsell and Y. LeCun. “Learning a Similarity Metric Discriminatively, with Application to Face Verification,” in Computer Society Conference on Computer Vision and Pattern Recognition (CVPR’05), 2005, DOI: 10.1109/CVPR.2005.202.
9. J. Bromley, I. Guyon, Y. LeCun and R. Shah. “Signature verification using a Siamese time delay neural network,” in Advances in neural information processing systems (NIPS), 1993.
10. O. Vinyals, C. Blundell, T. Lillicrap, K. Kavukcuoglu and D. Wierstra. “Matching Networks for One Shot Learning,” Advances in Neural Information Processing Systems 29, pp. 3630-3638, 2016.
11. D. Polevoy, A.Sheshkus, K.Bulatov, N. Skoryukina, T. Chernov and V. V. Arlazarov. “Key Aspects of Document Recognition Using Small Digital Cameras,” Vestnik RFBR, pp. 97-108, 2016, DOI: 10.22204/2410-4639-2016-092-04-97-108.
12. S. Lawrence, L. Giles and A. Back. “Face recognition: a convolutional neural-network approach,” IEEE Transactions on Neural Networks, pp. 98 – 113, 1 1 1997, DOI: 10.1109/72.554195.
 

2024-74-1
2023-73-4
2023-73-3
2023-73-2

© ФИЦ ИУ РАН 2008-2018. Создание сайта "РосИнтернет технологии".