Журнал «Труды Института системного анализа Российской академии наук» - I.M. Shigabeev, James Rodriguez, N.Yu. Chernykh "DogPose

In this work, we present a dataset for a pose classification of dogs as well as a sample pipeline for employing this dataset into an AI-powered application that tracks dog activity throughout the day, giving its user information on whether his dogs sleep all day or it stays active even while the dog owner is not home. This application is essential for dog owners to spot the trends of increasing dog passivity.

Computer vision, Pose Estimation, Image Classification, Internet of things, Semisupervised dataset generation, Data Collection, Artificial Intelligence, Pattern Recognition.

1. Mathis A., Mamidanna P., Cury K.M. et al. DeepLabCut: markerless pose estimation of userdefined body parts with deep learning. // Nat Neurosci 21, pp.1281–1289 (2018).

2. Y. Iwashita, A. Takamine, R. Kurazume and M.S. Ryoo. First-Person Animal Activity Recognition from Egocentric Videos // International Conference on Pattern Recognition (ICPR), 2014, pp.4310-4315.

3. Körner M., Denzler J. JAR-Aibo: A Multi-view Dataset for Evaluation of Model-Free Action Recognition Systems. // New Trends in Image Analysis and Processing – ICIAP 2013, 2013, vol 8158, pp.527-535, https://doi.org/10.1007/978-3-642-41190-8_57.

4. He, Kaiming & Zhang, Xiangyu & Ren, Shaoqing & Sun, Jian. Deep Residual Learning for Image Recognition. // 2016 IEEE Conference on Computer Vision and Pattern Recognition (CVPR), pp.770-778.

5. Vaswani, Ashish & Shazeer, Noam & Parmar, Niki & Uszkoreit, Jakob & Jones, Llion & Gomez, Aidan & Kaiser, Lukasz & Polosukhin Illia. Attention Is All You Need. // Advances in Neural Information Processing Systems 30 (NIPS 2017), 2017, vol.30

6. Radford, Alec & Kim, Jong & Hallacy, Chris & Ramesh, Aditya & Goh, Gabriel & Agarwal, Sandhini & Sastry, Girish & Askell, Amanda & Mishkin, Pamela & Clark, Jack & Krueger, Gretchen & Sutskever, Ilya. Learning Transferable Visual Models From Natural Language Supervision // 2021, arXiv:2103.00020.

7. Kuznetsova, Alina & Rom, Hassan & Alldrin, Neil & Uijlings, Jasper & Krasin, Ivan & Pont-Tuset, Jordi & Kamali, Shahab & Popov, Stefan & Malloci, Matteo & Kolesnikov, Alexander & Duerig, Tom & Ferrari, Vittorio. The Open Images Dataset V4: Unified Image Classification, Object Detection, and Visual Relationship Detection at Scale // International Journal of Computer Vision, 2020. vol 128.

8. Dosovitskiy, Alexey & Beyer, Lucas & Kolesnikov, Alexander & Weissenborn, Dirk & Zhai, Xiaohua & Unterthiner, Thomas & Dehghani, Mostafa & Minderer, Matthias & Heigold, Georg & Gelly, Sylvain & Uszkoreit, Jakob & Houlsby, Neil. An Image is Worth 16x16 Words: Transformers for Image Recognition at Scale // International Conference on Learning Representations (ICLR), 2020.