Output list
Preprint
Advances and Trends in the 3D Reconstruction of the Shape and Motion of Animals
Posted to a preprint site 22/08/2025
ArXiv.org
Reconstructing the 3D geometry, pose, and motion of animals is a long-standing problem, which has a wide range of applications, from biology, livestock management, and animal conservation and welfare to content creation in digital entertainment and Virtual/Augmented Reality (VR/AR). Traditionally, 3D models of real animals are obtained using 3D scanners. These, however, are intrusive, often prohibitively expensive, and difficult to deploy in the natural environment of the animals. In recent years, we have seen a significant surge in deep learning-based techniques that enable the 3D reconstruction, in a non-intrusive manner, of the shape and motion of dynamic objects just from their RGB image and/or video observations. Several papers have explored their application and extension to various types of animals. This paper surveys the latest developments in this emerging and growing field of research. It categorizes and discusses the state-of-the-art methods based on their input modalities, the way the 3D geometry and motion of animals are represented, the type of reconstruction techniques they use, and the training mechanisms they adopt. It also analyzes the performance of some key methods, discusses their strengths and limitations, and identifies current challenges and directions for future research.
Journal article
Published 2024
Computers and electronics in agriculture, 218, 108719
Aphids are persistent insect pests that severely impact agricultural productivity. The detection of aphid infestations is critical for mitigating their effects. This paper presents an artificial intelligence approach to detect aphids in crop images captured by consumer-grade RGB imaging cameras. In addition to detecting the presence of aphids, the size of the aphid is an important indicator of infestation severity. To address these, we present a Bayesian multi-task learning model to detect the presence of aphids and estimate their size simultaneously.
Our model employs a joint loss function, combining a classification loss and a customised size loss. The classification component aims to identify images containing aphids, whilst the customised size loss function estimates the size of the aphids. The latter is specifically designed to account for discrepancies between the estimated and actual ground truth sizes, enhancing the accuracy of the size estimation. The model utilizes a ResNet18 backbone, ensuring robustness and adaptability across various conditions.
The proposed model was evaluated using an agricultural pest dataset consisting of images of corn, rape, rice, and wheat crops. It achieved aphid presence detection accuracies of 75.77%, 66.39%, 70.01%, and 59% for corn, rape, rice, and wheat images, respectively. An in-depth evaluation of predictive uncertainties revealed areas of high confidence and potential inaccuracies for both size and presence of aphids in images, offering insight for future model refinement. We also conducted an ablation study to thoroughly analyse the contributions of each component in proposed model.
Our model offers a valuable tool that can be used in pest management strategies for facilitating more sustainable and efficient agricultural practices.