The dataset selected six key plant functional traits that reflect plant resource acquisition strategies and ecosystem functions, including specific leaf area (SLA), leaf dry matter content (LDMC), leaf nitrogen concentration (LNC), leaf phosphorus concentration (LPC), leaf area (LA), and wood density (WD). 34589 in-situ trait measurements of 3447 seed plants were collected from 1430 sampling points in China, and two machine learning models (random forest and lift regression tree) were used to generate spatial plant functional trait maps (∼ 1 km) as well as environmental variables and vegetation indices. To obtain the best estimate, the weighted average algorithm is further applied to merge the prediction results of the two models and obtain the final spatial plant functional trait map.
| collect place | China |
|---|---|
| data size | 737.0 MiB |
| data format | tif,excel |
| Coordinate system |
This study used 21 climate variables, including 19 bioclimatic variables, solar radiation (RAD), and drought index (AI). 19 bioclimatic variables and solar radiation data from WorldClim version 2.1 from 1970 to 2000( https://www.worldclim.org/data/worldclim21.html ).
The artificial impact data is sourced from the Consultative Group on International Agricultural Research (CGIAR-CSI) Spatial Information Consortium from 1970 to 2000 http://www.csi.cgiar.org The spatial resolution of climate data is 1 kilometer.
This study included 12 soil variables representing different aspects of soil properties, namely soil texture, bulk density (BD), pH value, and soil nutrients. All soil variables were extracted from the Chinese soil database for surface modeling http://globalchange.bnu.edu.cn/research/soil2
The terrain variable is elevation. Elevation data based on SRTM V4.1 database( https://www.resdc.cn/data.aspx?DATAID=123 Extract from the Chinese STRM 90m dataset. The spatial resolution is 1 km.
This study includes three types of vegetation indices. Firstly, EVI is extracted from MOD13A3 V006 product( https://lpdaac.usgs.gov/products/mod13a3v006/ ). This product provides monthly averages with a spatial resolution of 1 kilometer, ranging from January 2000 to December 2018. Secondly, MODIS reflectance data was extracted from MOD13A3 V006 product, including MIR reflectance, NIR reflectance, red reflectance, and blue reflectance. Thirdly, the MERIS Land Chlorophyll Index (MTCI) was obtained from the Earth Observation Data Center of the Natural Environment Research Council (NEODC, 2005)( https://data.ceda.ac.uk/ )Extracted from within. From June 2002 to December 2011, MTCI data was provided globally with a spatial resolution and range of 4.63 kilometers. It should be noted that the effective MTCI value should be greater than 1, so our study removed any values less than 1.
1. Field measurement data of six plant functional traits were collected from TRY and Chinese databases, as well as publicly published literature, and the functional traits of plant species were classified based on plant growth morphology, leaf type, and leaf phenology. After avoiding collinearity between climate, soil, terrain, and vegetation indices, multiple grid prediction factors were used.
2. Use random forests and boosting regression trees to train the relationship between plant functional traits and predictive factors for each PFT.
3. Use a land cover map (100 meters) to calculate the spatial abundance of each PFT within a 1-kilometer grid cell. Calculate the community weighted trait values within a 1-kilometer grid cell based on the abundance and predicted trait values of each PFT in step 2. In order to reduce the variability of different single models, we used the set model algorithm. According to the cross validation R2 value of random forest trees and lifting regression trees, we combined their prediction results to obtain the final spatial distribution map of plant functional traits.
This study was compared with research worldwide, and therefore, data from China was extracted from the global trait map. Before quantitatively comparing with previous studies, we carried out two steps to make the data products as comparable as possible and improve consistency between different studies. Firstly, due to the spatial resolution of the global trait map (mainly 0.5 °) being different from our study, we resampled the data products from previous research and our map to a spatial resolution of 0.5 °. In addition to Vallicrosa et al. (2022) generating global maps of LNC and LPC at a spatial resolution of 1 kilometer, we also compared the frequency distribution of Vallicrosa et al. (2022) with our study at a spatial resolution of 1 kilometer. Secondly, our research focuses on natural vegetation, so we filtered out non natural vegetation (such as cultivated land) using a global trait map. We quantitatively compared our map with previous research from two aspects. The comparison between trait maps uses frequency charts and spatial correlations.
This work is licensed under a
Creative
Commons Attribution 4.0 International License.
| # | title | file size |
|---|---|---|
| 1 | Data file used for machine learning models (updated in 20231019).xlsx | 1.2 MiB |
| 2 | _ncdc_meta_.json | 7.3 KiB |
| 3 | ensemble_LA_China.tif | 122.6 MiB |
| 4 | ensemble_LDMC_China.tif | 122.6 MiB |
| 5 | ensemble_LNC_China.tif | 122.6 MiB |
| 6 | ensemble_LPC_China.tif | 122.6 MiB |
| 7 | ensemble_SLA_China.tif | 122.6 MiB |
| 8 | ensemble_WD_China.tif | 122.6 MiB |
Specific leaf area (SLA) leaf dry matter content (LDMC) leaf nitrogen concentration (LNC) leaf phosphorus concentration (LPC) wood density (WD) leaf area LAI
©Copyright 2005-. Northwest Institute of Eco-Environment and Resources, CAS.
Donggang West Road 320, Lanzhou, Gansu, China (730000)

