Журнал «Труды Института системного анализа Российской академии наук» - A.E. Zhukovsky Methods for interframe integration of document detection results in a video stream of a mobile device

Просматривается номер 2018-S1

Data mining and image recognition

N.S. Skoryukina, A.N. Milovzorov, D.V. Polevoy, V.V. Arlazarov Paintings recognition in uncontrolled conditions using one-shot learning

A.E. Zhukovsky Methods for interframe integration of document detection results in a video stream of a mobile device

I.A. Kunina, E. I. Panfilova, M.A. Povolotskiy Zebra-crossing detection on road images using dynamic time warping

O.A. Slavin, V.L. Arlazarov Method for classifying recognized pages of administrative documents on the basis of text key points

O.O. Petrova, K.B. Bulatov Methods of machine-readable zone recognition results post-processing

A.E. Marchenko, E.I. Ershov, D.A. Shepelev, D.S. Sidorchuk, V.P. Bozhkova, D.P. Nikolaev Designing of language of description of observable properties of recognized objects in the absence of samples

Intellectual systems and technologies

E.E. Limonova, N.L. Rzhenev, A.V. Uskov, M.I. Neiman-zade Fast implementation of Hamming distance on VLIW-architectures on the example of Elbrus platform

V.V. Arlazarov, K.B. Bulatov, A.V. Uskov A model of object recognition system in video stream of a mobile device

A.A. Ivanova, S.A. Gladilin, A.E. Zhukovsky, E.L. Pliskin Database for the administrative accounting of scientific publications

A.S. Ingacheva, A.V. Sheshkus, T. S. Chernov, E.E. Limonova, V.V. Arlazarov X-ray computed tomography scanner – a new tool in recognition

N.O. Beshaposhnikov, A.G. Kushnirenko, A.A. Levin A method for auto-calibration of the educational robot control parameters using computer vision library OpenCV

Image and signal processing

A.E. Zhukovsky, E.E. Limonova, D.P. Nikolaev Exact implementation of common image processing algorithms using fully convolutional networks

V.E. Prun Reducing the influence of high-absorbing inclusions on CT reconstructions using algebraic reconstruction technique

B.I. Savelyev, I.B. Mamay, D.P. Nikolaev, V.L. Arlazarov, K.B. Bulatov, N.S. Skoryukina A method of projective transformations graph adjustment for panorama stitching problem for images of planar objects

D.V. Tropin, D.P. Nikolaev, D.G. Slugin The method of image alignment based on sharpness maximization

J.A. Shemiakina, A.E. Zhukovsky, I.A. Konovalenko, D.P. Nikolaev Algorithm for automatic framing of digital images under projective transformation

MACHINE LEARNING

A.V. Gayer, A.V. Sheshkus, Y.S. Chernyshova Augmentation on the fly for the neural networks learning

V.V. Arlazarov, D.P. Matalov, S.A. Usilin Localization of the seal on the identity document image using machine learning approach

A.E. Lynchenko, A.V.Sheshkus, V.L.Arlazarov Identity document classifiaction algorithm based on similarity metric robust to projective distortions

V.A. Malykh, V.A. Lyalin On Classification of Noisy Texts

Y.S. Chernyshova, M.A. Aliev, A.V. Sheshkus Optical font recognition of images captured with mobile devices and its application for detecting identity documents forgery

D.A. Ilin Fast words boundaries localization in text fields for low quality document images

D.E. Ivanov, D.V. Polevoy, D.L. Sholomov Selection of informative elements for the training of a lightweight convolutional neural network classifier in the conditions of a strong imbalance of the training sample


	A.E. Zhukovsky Methods for interframe integration of document detection results in a video stream of a mobile device
Abstract. The paper is devoted to the task of detecting the position of a document in a video stream received from a mobile device. Particular attention is paid to the methods of integrating the positions of the document obtained on a sequence of frames. The paper describes an algorithm based on the Kalman filter for selecting the document positions for a set of provided alternatives, their integration and refinement in the video stream. The analysis of the performance of the algorithm on the dataset provided in the of the ICDAR’15 competition on detection of documents from the smartphone is given. Keywords: document detection, video stream, integration, projective transformation, mobile cameras, Kalman filter. PP. 15-22. DOI: 10.14357/20790279180502 References 1. V.V. Arlazarov, A.E. Zhukovsky, V.E. Krivtsov, D.P. Nikolaev, D.V. Polevoy. Analiz osobennostey ispol’zovaniya statsionarnykh i mobil’nykh malorazmernykh tsifrovykh video kamer dlya raspoznavaniya dokumentov, [Analysis of specific character of usage fixed and mobile smallsize video cameras for document recognition], Informatsionnyye tekhnologii i vychislitel’nyye sistemy [Information Technologies and computing systems], Vol. 3, 2014, pp. 71-81 2. J. Liang, D. Doermann, H.Li. Camera-based analysis of text and documents: a survey, Int. J. of Document Analysis and Recognition, vol. 7, Issue 2, 2005, pp. 84-104 3. D. Doermann, J. Liang, H. Li. Progress in Camera-Based Document Image Analysis, IEEE Proc. 7th Int. Conf. on Document Analysis and Recognition, Vol.1, 2003, pp. 606-616 4. K. Bulatov, V.V. Arlazarov, T. Chernov, O. Slavin and D. Nikolaev. Smart IDReader: Document Recognition in Video Stream, 2017 14th IAPR Int. Conf. on Document Analysis and Recognition (ICDAR), 2017, pp. 39-44. doi: 10.1109/ICDAR.2017.347 5. Burie JC., Chazalon J., Coustaty M., Eskenazi S., Luqman M.M., Mehri M., Nayef N., Ogier JM., Prum S., Rusinol M. ICDAR2015 Competition on Smartphone Document Capture and OCR (SmartDoc), 13th Int. Conf. on Document Analysis and Recognition. 2015 6. V.V. Arlazarov, A.E. Zhukovsky, V.E. Krivtsov, V.V. Postnikv. Ispol’zovaniye grafa peresecheniy v zadache obnaruzheniya dokumenta na izobrazhenii, poluchennom so smartfona [Usage of the intersection graph in the task of camerabased document detection.] Ispol’zovaniye grafa peresecheniy v zadache obnaruzheniya dokumenta na izobrazhenii, poluchennom so smartfona, Iskusstvennyy Intellekt i Prinyatiye Resheniy [Artificial Intelligence and Decision Making], vol. 2, pp. 60-69, 2016 7. A. Zhukovsky et al. “Segments Graph-Based Approach for Document Capture in a Smartphone Video Stream,” 2017 14th IAPR Int. Conf. on Document Analysis and Recognition (ICDAR), 2017, pp. 337-342, doi: 10.1109/ICDAR.2017.63 8. Skoryukina N, Shemyakina Y., Arlazarov V.L., Faradjev I. Document localization algorithms based on feature points and straight lines, Proc. SPIE 10696, 10th Int. Conf. on Machine Vision (ICMV 2017), pp. 1-8, 2018, DOI: 10.1117/12.2311478 9. T.H. Cormen, C.E. Leiserson, R.L. Rivest, C. Stein. Introduction to Algorithms (second ed.). MIT Press and McGraw-Hill. ISBN 978-0-262-53196-2., 2001 10. R.E. Kalman. A New Approach to Linear Filtering and Prediction Problems, J. of Basic Engineering 82, 35, 1960. 11. R. Hartley, A. Zisserman. Multiple view geometry in computer vision, Cambridge University Press, New York, 2003 12. Y.A. Shemyakina, A.E. Zhukovsky, I.A. Faradjev. Issledovaniye algoritmov vychisleniya proyektivnogo preobrazovaniya v zadache navedeniya na planarnyy ob”yekt po osobym tochkam [Investigation of algorithms for calculating a projective transformation in the problem of targeting to a planar object from feature points], Iskusstvennyy Intellekt i Prinyatiye Resheniy [Artificial Intelligence and Decision Making], vol. 1, 2017, pp. 43-49 13. Y. Shemyakina, A. Zhukovsky, I. Faradjev. The Calculation of a Projective Transformation in the Problem of Planar Object Targeting by Feature Points, Proc. SPIE 10341, ICMV 2016, 10341 ed., 9th Int. Conf. on Machine Vision, 2017, vol. 10341, pp. 1-6, 2017, DOI: 10.1117/12.2268590 14. H. Bay, T. Tuytelaars, L. V. Gool. Surf: Speeded up robust features, European Conf. on Computer Vision (ECCV), 2006, pp. 404-417 15. M. Calonder, V. Lepetit, C. Strecha, P. Fua. BRIEF: Binary Robust Independent Elementary Features, 11th European Conf. on Computer Vision (ECCV), 2010 16. Fischler M.A., Bolles R.C. Random sample consensus: A paradigm for model fitting with applications to image analysis and automated cartography, Communications of the ACM, 24(6), 1981, pp. 381-395 17. M. Everingham, L.V. Gool, C. Williams, J. Winn, and A. Zisserman. “The PASCAL visual object classes (VOC) challenge” IJCV, vol. 88, no. 2, 2010, pp. 303–338

2025-75-2

2025-75-1

2024-74-4

2024-74-3

Abstract.

Keywords: