Logo image
Impact of ecological redundancy on the performance of machine learning classifiers in vegetation mapping
Journal article   Open access   Peer reviewed

Impact of ecological redundancy on the performance of machine learning classifiers in vegetation mapping

P. D. Macintyre, A. van Niekerk, M. P. Dobrowolski, J. L. Tsakalos and L. Mucina
Ecology and evolution, Vol.8(13), pp.6728-6737
2018
PMCID: PMC6053567
PMID: 30038769
pdf
vegetation mapping479.86 kBDownloadView
CC BY V4.0 Open Access

Abstract

functional redundancy machine learning predictive modeling predictive vegetation mapping vegetation patterns vegetation-environment relationship
Vegetation maps are models of the real vegetation patterns and are considered important tools in conservation and management planning. Maps created through traditional methods can be expensive and time-consuming, thus, new more efficient approaches are needed. The prediction of vegetation patterns using machine learning shows promise, but many factors may impact on its performance. One important factor is the nature of the vegetation-environment relationship assessed and ecological redundancy. We used two datasets with known ecological redundancy levels (strength of the vegetation-environment relationship) to evaluate the performance of four machine learning (ML) classifiers (classification trees, random forests, support vector machines, and nearest neighbor). These models used climatic and soil variables as environmental predictors with pretreatment of the datasets (principal component analysis and feature selection) and involved three spatial scales. We show that the ML classifiers produced more reliable results in regions where the vegetation-environment relationship is stronger as opposed to regions characterized by redundant vegetation patterns. The pretreatment of datasets and reduction in prediction scale had a substantial influence on the predictive performance of the classifiers. The use of ML classifiers to create potential vegetation maps shows promise as a more efficient way of vegetation modeling. The difference in performance between areas with poorly versus well-structured vegetation-environment relationships shows that some level of understanding of the ecology of the target region is required prior to their application. Even in areas with poorly structured vegetation-environment relationships, it is possible to improve classifier performance by either pretreating the dataset or reducing the spatial scale of the predictions.

Details

UN Sustainable Development Goals (SDGs)

This output has contributed to the advancement of the following goals:

#13 Climate Action
#15 Life on Land

Source: InCites

Metrics

6 File views/ downloads
54 Record Views

InCites Highlights

These are selected metrics from InCites Benchmarking & Analytics tool, related to this output

Collaboration types
Domestic collaboration
International collaboration
Citation topics
4 Electrical Engineering, Electronics & Computer Science
4.169 Remote Sensing
4.169.91 Vegetation Mapping
Web Of Science research areas
Ecology
Evolutionary Biology
ESI research areas
Environment/Ecology
Logo image