In recent years, societal changes have led to a growing prominence of pets in people's lives. However, uncontrolled pet reproduction in urban areas has given rise to a significant issue of stray animals, posing serious threats to human health and the environment. Conventional manual methods for counting stray animals face challenges in terms of efficiency and the risk of disease transmission. With technological advancements, image recognition, and sound identification, among other techniques, have emerged as crucial tools to address this issue. Image recognition, leveraging intuitive statistics based on external features, combined with the low-power attributes of sound identification and the health assessment capabilities of thermal imaging, collectively provide comprehensive technological support for stray animal population statistics. In the realm of image algorithms, both traditional target detection algorithms and deep learning methods such as RCNN and Faster RCNN employ convolutional neural networks to accurately identify and locate stray animals. Regarding sound algorithms, traditional Gaussian mixture models and hidden Markov models, as well as deep learning techniques involving convolutional neural networks, have effectively enhanced the accuracy of stray animal sound recognition. The integration of image and audio in a hybrid method significantly enhances stray animal monitoring. Employing advanced techniques in video tracking and sound recognition, this approach offers an efficient and practical solution, crucial for wildlife ecosystem surveillance and conservation. Research indicates that the application of deep learning methods in the domains of image and sound has significantly advanced compared to traditional approaches. In terms of image processing, I utilized the YOLO algorithm to perform grid division, feature extraction, and loss computation steps to achieve stray animal detection, demonstrating outstanding performance. Through the application of the GMM algorithm, we identified the vocal characteristics of stray animals and inferred their recognition effectiveness by employing likelihood functions. Our objective is to employ a combination of image and audio recognition with deep learning techniques to identify the population of stray animals within specific regions.
|