Dynamic View Synthesis of Thin Structures with Short-term Movements from Monocular Videos Using Neural Radiance Fields

Uchitha Rajapaksha; Hamid Laga; Dean Diepeveen; Mohammed Bennamoun; Ferdous Sohel

doi:10.1109/DICTA63115.2024.00015

Back

Conference proceeding

Dynamic View Synthesis of Thin Structures with Short-term Movements from Monocular Videos Using Neural Radiance Fields

Uchitha Rajapaksha, Hamid Laga, Dean Diepeveen, Mohammed Bennamoun and Ferdous Sohel

Proceedings - 2024 25th International Conference on Digital Image Computing: Techniques and Applications, DICTA 2024, pp.9-16

25th International Conference on Digital Image Computing: Techniques and Applications (DICTA 2024) (Perth, WA, 27/11/2024–29/11/2024)

2024

DOI: https://doi.org/10.1109/DICTA63115.2024.00015

Abstract

3D reconstruction

3D rendering

Dynamics

Geometry

Image reconstruction

Image sequences

Neural radiance field

Optical flow

Rendering (computer graphics)

thin objects

Three-dimensional displays

Tracking

Videos

Learning to generate motions of thin structures such as plant leaves in dynamic view synthesis is challenging. This is because thin structures usually undergo small but fast, non-rigid motions as they interact with air and wind. When given a set of RGB images or videos of a scene with moving thin structures as input, existing methods that map the scene to its corresponding canonical space for rendering novel views fail as the object movements are too subtle compared to the background. Disentangling the objects with thin parts from the background scene is also challenging when the parts show fast and rapid motions. To address these issues, we propose a Neural Radiance Field (NeRF)-based framework that accurately reconstructs thin structures such as leaves and captures their subtle, fast motions. The framework learns the geometry of a scene by mapping the dynamic images to a canonical scene in which the scene remains static. We propose a ray masking network to further decompose the canonical scene into foreground and background, thus enabling the network to focus more on foreground movements. We conducted experiments using a dataset containing thin structures such as leaves and petals, which include image sequences collected by us and one public image sequence. Experiments show superior results compared to existing methods. Video outputs are available at https://dythinobjects.com/.

Details

Title: Dynamic View Synthesis of Thin Structures with Short-term Movements from Monocular Videos Using Neural Radiance Fields
Authors/Creators: Uchitha Rajapaksha - Murdoch University
Hamid Laga - Murdoch University, Centre for Biosecurity and One Health
Dean Diepeveen - Murdoch University, School of Information Technology
Mohammed Bennamoun - The University of Western Australia
Ferdous Sohel - Murdoch University, Centre for Crop and Food Innovation
Publication Details: Proceedings - 2024 25th International Conference on Digital Image Computing: Techniques and Applications, DICTA 2024, pp.9-16
Conference: 25th International Conference on Digital Image Computing: Techniques and Applications (DICTA 2024) (Perth, WA, 27/11/2024–29/11/2024)
Publisher: IEEE
Grant note: DP22010219 / Australian Research Council (ARC Discovery) (10.13039/501100000923) Murdoch University's MIPS (10.13039/501100001799)
Identifiers: 991005751327407891
Murdoch Affiliation: School of Information Technology
Language: English
Resource Type: Conference proceeding

Metrics

36 Record Views