Журнал «Труды Института системного анализа Российской академии наук» - V.V. Arlazarov, K.B. Bulatov, A.V. Uskov A model of object recognition system in video stream of a mobile device

Просматривается номер 2018-S1

Data mining and image recognition

N.S. Skoryukina, A.N. Milovzorov, D.V. Polevoy, V.V. Arlazarov Paintings recognition in uncontrolled conditions using one-shot learning

A.E. Zhukovsky Methods for interframe integration of document detection results in a video stream of a mobile device

I.A. Kunina, E. I. Panfilova, M.A. Povolotskiy Zebra-crossing detection on road images using dynamic time warping

O.A. Slavin, V.L. Arlazarov Method for classifying recognized pages of administrative documents on the basis of text key points

O.O. Petrova, K.B. Bulatov Methods of machine-readable zone recognition results post-processing

A.E. Marchenko, E.I. Ershov, D.A. Shepelev, D.S. Sidorchuk, V.P. Bozhkova, D.P. Nikolaev Designing of language of description of observable properties of recognized objects in the absence of samples

Intellectual systems and technologies

E.E. Limonova, N.L. Rzhenev, A.V. Uskov, M.I. Neiman-zade Fast implementation of Hamming distance on VLIW-architectures on the example of Elbrus platform

V.V. Arlazarov, K.B. Bulatov, A.V. Uskov A model of object recognition system in video stream of a mobile device

A.A. Ivanova, S.A. Gladilin, A.E. Zhukovsky, E.L. Pliskin Database for the administrative accounting of scientific publications

A.S. Ingacheva, A.V. Sheshkus, T. S. Chernov, E.E. Limonova, V.V. Arlazarov X-ray computed tomography scanner – a new tool in recognition

N.O. Beshaposhnikov, A.G. Kushnirenko, A.A. Levin A method for auto-calibration of the educational robot control parameters using computer vision library OpenCV

Image and signal processing

A.E. Zhukovsky, E.E. Limonova, D.P. Nikolaev Exact implementation of common image processing algorithms using fully convolutional networks

V.E. Prun Reducing the influence of high-absorbing inclusions on CT reconstructions using algebraic reconstruction technique

B.I. Savelyev, I.B. Mamay, D.P. Nikolaev, V.L. Arlazarov, K.B. Bulatov, N.S. Skoryukina A method of projective transformations graph adjustment for panorama stitching problem for images of planar objects

D.V. Tropin, D.P. Nikolaev, D.G. Slugin The method of image alignment based on sharpness maximization

J.A. Shemiakina, A.E. Zhukovsky, I.A. Konovalenko, D.P. Nikolaev Algorithm for automatic framing of digital images under projective transformation

MACHINE LEARNING

A.V. Gayer, A.V. Sheshkus, Y.S. Chernyshova Augmentation on the fly for the neural networks learning

V.V. Arlazarov, D.P. Matalov, S.A. Usilin Localization of the seal on the identity document image using machine learning approach

A.E. Lynchenko, A.V.Sheshkus, V.L.Arlazarov Identity document classifiaction algorithm based on similarity metric robust to projective distortions

V.A. Malykh, V.A. Lyalin On Classification of Noisy Texts

Y.S. Chernyshova, M.A. Aliev, A.V. Sheshkus Optical font recognition of images captured with mobile devices and its application for detecting identity documents forgery

D.A. Ilin Fast words boundaries localization in text fields for low quality document images

D.E. Ivanov, D.V. Polevoy, D.L. Sholomov Selection of informative elements for the training of a lightweight convolutional neural network classifier in the conditions of a strong imbalance of the training sample


	V.V. Arlazarov, K.B. Bulatov, A.V. Uskov A model of object recognition system in video stream of a mobile device
Abstract. This paper describes a problem of automatic objects recognition using video stream as digital object representation. Several variants of video stream system formulation are described, properties of dynamic recognition system model are discussed. Recognition results integration problem and stopping problem are described, which occur in recognition system with time parameters and without natural restriction on the number of input frames. Formal statements of both problems are presented in scope of a general integration model of the recognition system and its user. Keywords: pattern recognition, video stream, mobile devices, recognition systems, OCR. PP. 73-82. DOI: 10.14357/20790279180508 References 1. Bulatov K., Arlazarov V.V., Chernov T., Slavin O., Nikolaev D. “Smart IDReader: Document Recognition in Video Stream” // 14th IAPR International Conference on Document Analysis and Recognition (ICDAR). – 2017. –V. 6, – P. 39-44. 2. Arlazarov V.V., Zhukovsky A., Krivtsov V., Nikolaev D., Polevoy D. “Analysis of using stationary and mobile small-scale digital cameras for document recognition “ // Information technologies and computation systems. – 2014. – № 3. – P. 71-78. 3. Wemhoener D., Yalniz I.Z., Manmatha R. “Creating an Improved Version Using Noisy OCR from Multiple Editions” // 12th IAPR International Conference on Document Analysis and Recognition (ICDAR). – 2013. – P. 160-164. 4. Rokach L. “Ensemble-based classifiers” // Artificial Intelligence Review. – 2010. – Vol. 33, No. 1. – P. 1-39. 5. Kittler et al. “On Combining Classifiers” // IEEE Trans. Pattern Analysis and Machine Intelligence. – 1998. – Vol. 20, No. 3. – P. 226-239. 6. Ting K.M., Witten I.H. “Issues in Stacked Generalization” // Journal of Artificial Intelligence Research. – 1999. – Vol. 10, No. 1. – P. 271-289. 7. Kuncheva L.I., Bezdek J.C., Duin R.P. “Decision templates for multiple classifier fusion: an experimental comparison” // Pattern Recognition. – 2001. – Vol. 34, No. 2. – P. 299-314. 8. Nguyen T.T. et al. “A Novel Combining Classifier Method Based on Variational Inference” // Pattern Recognition. – 2016. – Vol. 49, No. C. – P. 198-212. 9. Petrovsky A.B. “Methods of group classification of multi-feature objects (part 1)” // Artificial intelligence and decision theory. – 2009. – № 3. – P. 3-14. 10. Petrovsky A.B. “Methods of group classification of multi-feature objects (part 2)” // Artificial intelligence and decision theory. – 2009. – № 4. – P. 3-14. 11. LeCun Y. et al. “Gradient-Based Learning Applied to Document Recognition” // Proceedings of the IEEE. – 1998. 12. Krizhevsky A., Sutskever I., Hinton G.E. “ImageNet Classification with Deep Convolutional Neural Networks” // Advances in Neural Information Processing Systems 25 / ed. by F. Pereira [et al.]. – Curran Associates, Inc., 2012. – P. 1097-1105. 13. Taigman Y. et al. “DeepFace: Closing the Gap to Human-Level Performance in Face Verification” // IEEE Conference on Computer Vision and Pattern Recognition. – 2014. – P. 1701-1708. 14. Moosavi-Dezfooli S., Fawzi A., Frossard P. “DeepFool: a simple and accurate method to fool deep neural networks” // CoRR. – 2015. – Vol abs/1511.04599. 15. Papernot N. et al. “The Limitations of Deep Learning in Adversarial Settings” // CoRR. – 2015. – Vol. abs/1511.07528. 16. Su J., Vargas D.V., Sakurai K. “One pixel attack for fooling deep neural networks” // CoRR. – 2017. – Vol. abs/1710.08864. 17. Sung Cheol Park, Min Kyu Park, Moon Gi Kang. “Super-resolution image reconstruction: a technical overview” // IEEE Signal Processing Magazine. – 2003. – V.20. – N. 3. – P. 21-36. 18. Semwal A., Chamoli A., Mukesh C.A., Salman A. “A Survey: The Methods & Techniques of Super- Resolution Image Reconstruction” // International Journal for Scientific Research & Development. – 2017. – V. 4. – I. 12. – P. 243-249. 19. International standard ISO/IEC 14496-12 “Information technology – Coding of audio-visual objects – Part 12: ISO base media file format”. ISO/IEC. – 2005. – 94 p. 20. Arlazarov V.L., Loginov A.S., Slavin O.A. “Characteristics of Optical Text Recognition Programs” // Programming and Computer Software. – 2002. – Vol. 28, No. 3. – P. 148-161. 21. Arlazarov V.V., Kliatsine V.M. “Solving the problem of confidence determination for symbol recognition result in Cognitive Forms system “ // Document processing. Concepts and instruments. Proceedings of ISA RAS. – 2004. – 208 p. 22. Kimura S. et al. “A Man-Machine Cooperating System Based on the Generalized Reject Model” // 14th IAPR International Conference on Document Analysis and Recognition (ICDAR). – 2017. – V.1. – P. 1324-1329.

2024-74-1

2023-73-4

2023-73-3

2023-73-2

Abstract.

Keywords: