Comparative Analysis of Classification and Segmentation Performance of Different Dog Breeds Based on Mask R-CNN

Comparative Analysis of Classification and Segmentation Performance of Different Dog Breeds Based on Mask R-CNN

Authors

  • Youtian Luo Jincheng College, Sichuan University, Chengdu 611731, Sichuan, China
  • Dan Li Jincheng College, Sichuan University, Chengdu 611731, Sichuan, China

DOI:

https://doi.org/10.53469/wjimt.2024.07(05).09

Keywords:

Mask R-CNN, Classification, Division, Contrast

Abstract

The classification of different image data is a common task in real life, which has been achieved as early as convolutional neural networks. With the emergence of R-CNN (Region CNN), simple classification tasks began to develop towards object detection tasks, followed by various segmentation tasks. After Faster R-CNN [4], Mask R-CNN [3] achieved instance segmentation while detecting objects. This algorithm model needs to distinguish different target objects in the image panel and learn a large number of characteristics to represent the details of each target object. In dog data images, there are differences and similarities in the characteristics of dog species, such as fur color, texture, and body shape. So in this article, the author selected and created two different dog datasets, representing targets with significant differences and similarities in features. Compare and analyze the performance of Mask R-CNN object detection and instance segmentation using these two dog datasets.

References

Aditya Khosla, Nityananda Jayadevaprakash, Bangpeng Yao and Li Fei-Fei. Novel dataset for Fine-Grained Image Categori- zation. First Workshop on Fine-Grained Visual Categorization (FGVC), IEEE Conference on Computer Vision and Pattern Rec- ognition (CVPR), 2011.

T.-Y. Lin, P. Doll,ar, R. Girshick, K. He, B. Hariharan, and S. Belongie. Feature pyramid networks for object detection. In CVPR, 2017.

K. He, G. Gkioxari, P. Doll,ar, and R. Girshick. Mask r-cnn. In ICCV, 2017.

S. Ren, K. He, R. Girshick, and J. Sun. Faster R-CNN: Towards real-time object detection with region proposal networks. In NIPS, 2015.

J. Long, E. Shelhamer, and T. Darrell. Fully convolutional net- works for semantic segmentation. InCVPR,2015.

Yang L, Song Q, Wang Z, et al. Parsing R-CNN for In- stance-Level Human Analysis[J]. 2018.

Ji, H., Xu, X., Su, G., Wang, J., & Wang, Y. (2024). Utilizing Machine Learning for Precise Audience Targeting in Data Science and Targeted Advertising. Academic Journal of Science and Technology, 9(2), 215-220.

Ma, Y., Shen, Z., & Shen, J. (2024). Cloud Computing and Hyperscale Data Centers: A Comparative Study of Usage Patterns. Journal of Theory and Practice of Engineering Science, 4(06), 11-19.

Ren, Z. (2024). VGCN: An Enhanced Graph Convolutional Network Model for Text Classification. Journal of Industrial Engineering and Applied Science, 2(4), 110-115.

Ren, Z. (2024). Enhanced YOLOv8 Infrared Image Object Detection Method with SPD Module. Journal of Theory and Practice in Engineering and Technology, 1(2), 1–7. Retrieved from https://woodyinternational.com/index.php/jtpet/article/view/42

Yuan, B., & Song, T. (2023, November). Structural Resilience and Connectivity of the IPv6 Internet: An AS-level Topology Examination. In Proceedings of the 4th International Conference on Artificial Intelligence and Computer Engineering (pp. 853-856).

Yuan, B., Song, T., & Yao, J. (2024, January). Identification of important nodes in the information propagation network based on the artificial intelligence method. In 2024 4th International Conference on Consumer Electronics and Computer Engineering (ICCECE) (pp. 11-14). IEEE.

Wang, Z. (2024, August). CausalBench: A Comprehensive Benchmark for Evaluating Causal Reasoning Capabilities of Large Language Models. In Proceedings of the 10th SIGHAN Workshop on Chinese Language Processing (SIGHAN-10) (pp. 143-151).

Wu, Z. (2024). Deep Learning with Improved Metaheuristic Optimization for Traffic Flow Prediction. Journal of Computer Science and Technology Studies, 6(4), 47-53.

Lyu, H., Wang, Z., & Babakhani, A. (2020). A UHF/UWB hybrid RFID tag with a 51-m energy-harvesting sensitivity for remote vital-sign monitoring. IEEE transactions on microwave theory and techniques, 68(11), 4886-4895.

Lu, Q., Guo, X., Yang, H., Wu, Z., & Mao, C. (2024). Research on Adaptive Algorithm Recommendation System Based on Parallel Data Mining Platform. Advances in Computer, Signals and Systems, 8(5), 23-33.

Wu, X., Wu, Y., Li, X., Ye, Z., Gu, X., Wu, Z., & Yang, Y. (2024). Application of adaptive machine learning systems in heterogeneous data environments. Global Academic Frontiers, 2(3), 37-50.

Downloads

Published

2024-09-29
Loading...