Logo image
Dynamic View Synthesis of Thin Structures with Short-term Movements from Monocular Videos Using Neural Radiance Fields
Conference proceeding

Dynamic View Synthesis of Thin Structures with Short-term Movements from Monocular Videos Using Neural Radiance Fields

Uchitha Rajapaksha, Hamid Laga, Dean Diepeveen, Mohammed Bennamoun and Ferdous Sohel
Proceedings - 2024 25th International Conference on Digital Image Computing: Techniques and Applications, DICTA 2024, pp.9-16
25th International Conference on Digital Image Computing: Techniques and Applications (DICTA 2024) (Perth, WA, 27/11/2024–29/11/2024)
2024

Abstract

3D reconstruction 3D rendering Dynamics Geometry Image reconstruction Image sequences Neural radiance field Optical flow Rendering (computer graphics) thin objects Three-dimensional displays Tracking Videos
Learning to generate motions of thin structures such as plant leaves in dynamic view synthesis is challenging. This is because thin structures usually undergo small but fast, non-rigid motions as they interact with air and wind. When given a set of RGB images or videos of a scene with moving thin structures as input, existing methods that map the scene to its corresponding canonical space for rendering novel views fail as the object movements are too subtle compared to the background. Disentangling the objects with thin parts from the background scene is also challenging when the parts show fast and rapid motions. To address these issues, we propose a Neural Radiance Field (NeRF)-based framework that accurately reconstructs thin structures such as leaves and captures their subtle, fast motions. The framework learns the geometry of a scene by mapping the dynamic images to a canonical scene in which the scene remains static. We propose a ray masking network to further decompose the canonical scene into foreground and background, thus enabling the network to focus more on foreground movements. We conducted experiments using a dataset containing thin structures such as leaves and petals, which include image sequences collected by us and one public image sequence. Experiments show superior results compared to existing methods. Video outputs are available at https://dythinobjects.com/.

Details

Metrics

36 Record Views
Logo image