Primary tabs

University of Idaho

Since 1889, the University of Idaho has provided motivated students with a transformative higher education experience that prepares them to solve real-world problems and achieve success in their lives and careers.

Learn more at

Data Extent

Other Access

The information on this page (the dataset metadata) is also available in these formats.


via the DKAN API

Data from: Modeling tree canopy height using machine learning over mixed vegetation landscapes

Although the random forest algorithm has been widely applied to remotely sensed data to predict characteristics of forests, such as tree canopy height, the effect of spatial non-stationarity in the modeling process is oftentimes neglected. Previous studies have proposed methods to address the spatial variance at local scales, but few have explored the spatial autocorrelation pattern of residuals in modeling tree canopy height or investigated the relationship between canopy height and model performance. By combining Light Detection and Ranging (LiDAR) and Landsat datasets, we used spatially-weighted geographical random forest (GRF) and traditional random forest (TRF) methods to predict tree canopy height in a mixed dry forest woodland in complex mountainous terrain. Comparisons between TRF and GRF models show that the latter can lower predefined extreme residuals, and thus make the model performance relatively stronger. Moreover, the relationship between model performance and degree of variation of true canopy height can vary considerably within different height quantiles. Both models are likely to present underestimates and overestimates when the corresponding tree canopy heights are high (>95% quantile) and low (<median), respectively. This study provides a critical insight into the relationship between tree canopy height and predictive abilities of random forest models when taking account of spatial non-stationarity. Conclusions indicate that a trade-off approach based on the actual need of project should be taken when selecting an optimal model integrating both local and global effects in modeling attributes such as canopy height from remotely sensed data.

Data Use
License: CC-BY-NC-SA 4.0
Recommended Citation: Wang, H., Seaborn, T., & Wang, Z. (2021). Data from: Modeling tree canopy height using machine learning over mixed vegetation landscapes [Data set]. University of Idaho.

Release Date
Spatial / Geographical Coverage Area
POLYGON ((-117.28909945115 44.038494233469, -117.28909945115 44.754636434027, -116.58359214664 44.754636434027, -116.58359214664 44.038494233469))
Spatial / Geographical Coverage Location
Mann Creek Watershed, Idaho, USA
Temporal Coverage
Saturday, September 9, 2017 - 00:00 to Saturday, October 14, 2017 - 00:00
English (United States)
Hui Wang, Travis Seaborn, Zhe Wang
Contact Name
Hui (William) Wang
Contact Email
Public Access Level
Data available on:: 
Wednesday, October 13, 2021