Post on 06-Apr-2018
8/3/2019 Modelos de Distribucion y Divers Id Ad de Arboles en La Amazonia Usando Sensores Remotos
1/18
Modeling distribution of Amazonian tree species and diversity using remotesensing measurements
Sassan Saatchi a,, Wolfgang Buermann b, Hans ter Steege c, Scott Mori d, Thomas B. Smith e
aJet Propulsion Laboratory, California Institute of Technology, Pasadena, CA 91109 USAb Center for Tropical Research, Institute of the Environment, University of California, Los Angeles, Los Angeles, CA 90095 USA
c Institute of Environmental Biology, Section Plant Ecology and Biodiversity, Utrecht University, Sorbonnelaan 14, 3584 CA Utrecht, The Netherlandsd New York Botanical Garden, 200th Street and Kazimiroff Blvd., Bronx, NY 10458 USA
e Center for Tropical Research, Institute of the Environment and Department of Ecology and Evolutionary Biology, University of California, Los Angeles,
Los Angeles, CA 90095 USA
Received 26 December 2006; received in revised form 8 January 2008; accepted 12 January 2008
Abstract
The availability of a wide range of satellite measurements of environmental variables at different spatial and temporal resolutions, together withan increasing number of digitized and georeferenced species occurrences, has created the opportunity to model and monitor species geographicdistribution and richness at regional to continental scales. In this paper, we examine the application of recently developed global data productsfrom satellite observations in modeling the potential distribution of tree species and diversity in the Amazon basin. We use data from satellitesensors, including MODIS, QSCAT, SRTM, and TRMM, to develop different environmental variables related to vegetation, landscape, andclimate. These variables are used in a maximum entropy method (Maxent) to model the geographical distribution of five commercial trees and toclassify the patterns of tree alpha-diversity in the Amazon basin. Maxent simulations are analyzed using binomial tests of omission rates and the
area under the receiver operating characteristics (ROC) curves to examine the model performance, the accuracy of geographic distributions, andthe significance of environmental variables for discriminating suitable habitats. To evaluate the importance of satellite data, we used the Maxentjackknife test to quantify the training gains from data layers and to compare the results with model simulations using climate-only data. For allspecies and tree alpha-diversity, modeled distributions are in agreement with historical data and field observations. The results compare withclimate-derived patterns, but provide better spatial resolution and detailed information on the habitat characteristics. Among satellite data products,QSCAT backscatter, representing canopy moisture and roughness, and MODIS leaf area index (LAI) are the most important variables in almost allcases. Model simulations suggest that climate and remote sensing results are complementary and that the best distribution patterns can be achievedwhen the two data sets are combined. 2008 Elsevier Inc. All rights reserved.
Keywords: Species distribution; Remote sensing data; Maxent; Amazon basin; Tree diversity
1. Introduction
Recent efforts to conserve biodiversity are moving beyond preserving only its pattern, such as particular species orpopulations, to include the many complex processes that pro-duce and maintain biodiversity (Cowling and Pressey, 2001;Crandall et al., 2000). The conservation of regional biodiversity
is inextricably linked with the species that occur in a region, thegenes they contain, and the other biotic and abiotic features thatcomprise the ecosystem (Myers et al., 2000). Under pressure tomake informed management decisions rapidly, conservation
practitioners must increasingly rely on predictive models toprovide them with information on species distributions (Ferrier,2002; Loiselle et al., 2003). In addition, using models to predictspecies distributions have become key elements in documenting
biodiversity on the planet and are critical to understanding theeffect of multiple stresses caused by climate and human-inducedchanges (Fjeldsa & Lovett, 1997; Pimm, 1991).
Available online at www.sciencedirect.com
Remote Sensing of Environment 112 (2008) 20002017www.elsevier.com/locate/rse
Corresponding author. Tel.: +1 818 354 1051; fax: +1 818 393 5184. E-mail address: saatchi@congo.jpl.nasa.gov (S. Saatchi).
0034-4257/$ - see front matter 2008 Elsevier Inc. All rights reserved.doi:10.1016/j.rse.2008.01.008
mailto:saatchi@congo.jpl.nasa.govhttp://dx.doi.org/10.1016/j.rse.2008.01.008http://dx.doi.org/10.1016/j.rse.2008.01.008mailto:saatchi@congo.jpl.nasa.gov8/3/2019 Modelos de Distribucion y Divers Id Ad de Arboles en La Amazonia Usando Sensores Remotos
2/18
The majority of studies in biogeography use species occur-rence or museum collections to map and analyze large-scale
patterns of species distribution and richness (Lovett et al., 2000;Rahbek & Graves, 2001). These studies clearly indicate thatspecies differ in the size of their geographic range. Most species,within the same assemblage, tend to have relatively small
ranges that reflect how they share space (Brown et al., 1996;Gaston, 1998). Range size may depend on a variety ofecological and evolutionary processes and extrinsic factors ofthe physical environment such as soils, nutrients, water, andclimate (Gentry, 1988; Hunter, 2003; Kreft et al., 2006; Smithet al., 2001). Capturing the interplay of these factors is fun-damental to understanding the uneven distribution of diversityon regional and global scales.
In most biogeographic theories, geographic distribution ofspecies and their diversity or richness are conceived in terms ofa multidimensional coordinate system, whose axes are variousresource gradients (e.g. ecological and environmental vari-
ables). This coordinate system defines a hyperspace, and therange of the space that a given species occupies is its niche. Theniche is an abstract characterization of the intra-community
position of the species that depends on time, space, and dif-ferences in resource gradients that cause the species evolution(Whittaker, 1972). Geographic distribution of species and theirdiversity or richness depends on how well their ecological nicheis understood.
It is widely accepted that measurement of environmentalrequirements to quantify the range size and patterns of speciesdistribution and richness is an important step towards thisunderstanding (Woodward, 1987). This generalization is true ata variety of spatial scales, suggesting the importance of mea-
surements of environmental variables at different scales. Forexample, climate variables are of increasing importance as thescale increases from regional to continental to global scales.Currently, there is an increasing urgency among conservation
biologists to quantify the environmental requirements of par-ticular species at finer spatial scales in order to better prioritizeconservation efforts. This has created the need to collect spatialinformation over large regions using remotely sensed measure-ments from airborne or satellite sensors (Turner et al., 2003).In addition, the use of remote sensing data by conservation
biologists has helped frame new and important research ques-tions. Can remote sensing data identify areas of significance to
biodiversity, predict species distributions, and model commu-nity responses to environmental and anthropogenic changes?Answering these questions depends on several assumptions:1) environmental variables, and biophysical properties thatcharacterize species habitat, and drive its distribution are de-tectable by existing remote sensing sensors, 2) there are suf-ficient and spatially representative field observations of species
presence or absence and habitat characteristics, and 3) there aredistribution models capable of extending the field observationsto regional and global scales with the aid of environmentalvariables produced by remote sensing measurements. There has
been an increasing interest in studying these assumptions inrecent years (Turner et al., 2003; Nagendra, 2001; Guisan &Zimmermann, 2000; Peng, 2000).
This paper examines the potential use of recently developedglobal datasets from satellite observations for mapping distribu-tion patterns of tree species and diversity in the Amazon basin.Unlike regions with limited species richness and strong gradientsof climate variables, the Amazon basin has one of the highestspecies diversity and richness in the world, but comparatively
little variations in climatic variables (temperature and rainfall)(Nelson et al., 1990; De Oliveira and Mori, 1999). These regionalcharacteristics limit the use of climate variables to developecological and distribution models. Remote sensing data, on theother hand, provide spatially refined information on landscapeand vegetation heterogeneity over the Amazon basin that can bereadily incorporated in models to predict species distribution anddiversity. These models are either strictly mathematical or basedon certain ecological theories. The detailed discussion or reviewof these models and the ecological theories are beyond the scopeof this paper (Elith et al., 2006; Graham & Hijmans, 2006).
Here, we are interested to model the distribution of five
widespread commercial trees, and tree alpha-diversity (ex-pressed as Fisher's alpha) over the Amazon basin. We use themaximum entropy method (Maxent) (Phillips et al., 2005) thatintegrates remote sensing and geographical point locality data ofspecies in order to model distributions and provides a predictive
probability to assess the contribution of remote sensing datalayers. The paper is organized into three sections: 1) descriptionof species, remote sensing, and climate data,2) description of theMaxent model and simulations used for testing the applicationofremote sensing data, 3) assessment of potential range distribu-tions, and 4) discussion on the contribution and significance ofremote sensing data for characterizing suitable areas of specieshabitat.
2. Species data
2.1. Amazonian tree species
Five widespread and well-documented commercial timbertrees were selected for distribution modeling. The geographicallocations of trees were extracted from the herbarium collectionof the New York Botanical Gardens and included data froma variety of forest types and landscape features in northernSouth America (Fig. 1). The species studied were: Calophyllumbrasiliense (Clusiaceae), Carapa guianensis (Meliaceae), Hura
crepitans (Euphorbiaceae), Manilkara bidentata (Sapotaceae),and Virola surinamensis (Myristicaceae). The data set did notinclude any subspecies with strong distributional or functionalcharacteristics or preferences that might influence the overalldistribution.
C. brasiliense (Clusiaceae), known in Brazil by the commonname of jacareuba, grows as a canopy tree in a variety of soil,slopes, and elevations (up to 1500 m). The tree can reach 45 min height with a straight bole without any buttresses or branchesfor about 2/3 of the height. C. (Clusiaceae) is a tropical genuscomposed of approximately one hundred species. Its naturalgeographical range extends from southern Mexico throughoutCentral America to northern parts of South America (Record& Hess, 1943). It is also found in several Caribbean islands
2001S. Saatchi et al. / Remote Sensing of Environment 112 (2008) 20002017
8/3/2019 Modelos de Distribucion y Divers Id Ad de Arboles en La Amazonia Usando Sensores Remotos
3/18
(Marques & Joly, 2000). We acquired 96 point localities spreadover the Amazon basin for this tree.
C. guianensis (Meliaceae) is a deciduous or semi-evergreenmedium-size species, up to 35 m tall with a straight cylindrical
bole up to 100200 cm in diameter. The species, with the tradename of andiroba, is found in the West Indies, in the Caribbeanislands, throughout Central America and south to the centralAmazon. We acquired 88 point localities for this tree speciesspread mainly along rivers. The tree establishes itself on richsoils along streams, in periodically inundated swamp forests, inupland forests along the rivers of the Amazon basin, and inelevations ranging from shorelines to 1200 m (Guariguata et al.,2002).
H. crepitans (Euphorbiaceae), known as Acacu in Brazil, isanothertalltree,rangingfrom25to50minheightwithclearbolesof 15 to 30 m and with diameters ranging from 100150 cm (at
times to 200 cm). The tree has a native range in tropical America,but has been naturalized in other parts of the world. Its rangeextends from Central America and the Caribbean islands tonorthern South America, with larger concentrations in Colombia,Ecuador, and northern Peru and within the white water varzeafloodplains along the Amazon River. H. crepitans is also foundextensively in coastal Venezuela and the Guyanas on pure sand ormoist sandy loam and is frequently cultivated as a shade treeelsewhere (Freiberg, 1996). However, for this study, we couldonly acquire 45 point localities for this species.
M. bidentata (Sapotaceae) is a large evergreen forest tree foundthroughout the West Indies, ranging from Mexico throughoutPanama to northern South America, and from Venezuela to Peru,including northern Brazil and the Guyanas. The tree is extremely
shade tolerant and grows from coastal sea levels up to fewhundred meters in elevation. There were 140 point localities forthis species that included the two subspecies of bidentata and
surinamensis.
V. surinamensis (Myristicaceae), known as Ucuba in Brazil,has a variety of commercial and medicinal values and is foundin swampy, fertile and periodically inundated riverbanks, inAmazonian varzea forests, and in degraded and secondaryforests. Its geographical range in the Neotropics extends fromCentral America, Costa Rica, and Panama down to the northernAmazon basin and the eastern coastal region in the Guyanas.The tree grows modestly in the open forest gaps and can attain asize of 30 m in height and 100 cm in diameter (Fisher et al.,1991). The tree canopy has seasonal characteristics. In FrenchGuiana, Ucuba flowers twice a year, in March and September,
but near Manaus flowering extends from August to November
and fruiting from January to July (Howe, 1990; Rodriguez,1972). We found 133 point localities for this species.
2.2. Amazonian tree diversity
The tree diversity data were from a total of 633 plots locatedon a variety of forest types, including terra firme, floodplains,and swamps in the Amazon basin and the Guiana Shields(Fig. 2). The data were primarily from published 1 ha plots ofthe ATDN database (ter Steege et al., 2003), however, a numberof smaller plots with sufficient trees (more than 150 individuals)of a diameter at breast-height, dbhN10 cm or larger ones werealso included in the data set (ter Steege et al., 2003). Tree alpha-diversity, expressed as Fisher's alpha (), a measure which
Fig. 1. Geographic locations of tree species inventory data for five commercial tree species, C. brasiliense, C. guianensis, H. crepitans, M. bidentata, andV. surinamensis.
2002 S. Saatchi et al. / Remote Sensing of Environment 112 (2008) 20002017
8/3/2019 Modelos de Distribucion y Divers Id Ad de Arboles en La Amazonia Usando Sensores Remotos
4/18
corrects for the sample size, was calculated for each plot usingS a ln 1 N
a
, where S is the number of species, N is the
number of individuals, and is the diversity coefficient (Fisheret al., 1943).
Plots' geographical locations, tree alpha-diversity, and foresttypes were among data sets acquired from the Amazon TreeDiversity Network. Detailed information about the data setsand a partial list of references for the published plot data in theAmazon can be found in ter Steege et al. (2003).
3. Environmental data
3.1. Remote sensing data
We compiled a set of remote sensing data and products fromdifferent earth observing sensors to derive metrics sensitive to
vegetation and landscape variables. The data set included bothoptical and microwave satellite sensors. To quantify spatial andtemporal patterns in canopy structure, we used the monthly 1 kmLAI (Leaf Area Index) data derived from MODIS reflectanceover the five-year period, 20002004 (Myneni et al., 2002). Itis noteworthy to mention that in this study we preferred LAIover NDVI or any other vegetation index because of how itrelates to canopy structure and seasonality and it had under-gone various quality checks before and during LAI algorithmimplementation (Myneni et al., 2002). The MODIS 8-day LAI
products provided the basis for these monthly composites,which improved the data quality by further reducing the impactof clouds and any possible LAI estimation errors. We pro-duced monthly climatological means by averaging values over
5 years (20002004). The climatological composites were thenused to generate five metrics: annual maximum, (Fig. 3a)minimum, mean, standard deviation, and range (difference ofmaximum and minimum). These LAI metrics provide infor-
mation on net primary productivity and vegetation seasonality,both important for characterizing species geographical range.
We also included the MODIS-derived vegetation continuousfield (VCF) product as a measure of the percentage of treecanopy cover within each 1 km pixel resolution (Hansen et al.,2002). The VCF product is generated from the time seriescomposites of MODIS data from year 2001 and is availablefrom the Global Land Cover Facility at the University ofMaryland. The VCF product separates open (e.g., shrub lands,savannas), fragmented, and deforested areas from those of intactold growth forests (Fig. 3b).
As part of the microwave remote sensing measurements, we
included global QSCAT (Quick Scatterometer) data available inthree-day composites at 2.25 km resolution (Long et al., 2001).The three-day data over 5 years (20002004) were used tocreate average monthly composites at 1 km resolution and thenfurther processed to produce four metrics that included annualmean and standard deviation of radar backscatter at both HHand VV polarizations (H: horizontal, V: vertical). QSCAT radarmeasurements are at KU band (12 GHz) and are sensitive tosurface or canopy roughness, moisture, and other seasonalattributes, such as phenological changes. For areas with lowvegetation biomass, such as woodlands and savanna, measure-ments at different polarizations correlate positively with theaboveground biomass (Long et al., 2001; Saatchi et al., 2007).For areas with dense forest, backscatter measurements are
Fig. 2. Geographic locations of tree alpha-diversity (Fisher's alpha) in terra firme and inundated forests. Dots indicate the maximum Fisher's alpha found at onelocation (n =633).
2003S. Saatchi et al. / Remote Sensing of Environment 112 (2008) 20002017
8/3/2019 Modelos de Distribucion y Divers Id Ad de Arboles en La Amazonia Usando Sensores Remotos
5/18
sensitive to canopy roughness and moisture and contribute tomeasuring differences in forest types and canopy structure. Thelong term (5 years) average of MODIS and QSCAT data andthe metrics used in this study are assumed to approximatelyrepresent the climatological mean of the environmental varia-
bles they represent (Buermann et al., 2002). In this study, weused the annual mean (Fig. 3c) and standard deviation ofQSCAT HH backscatter data over 1 year and excluded the VV
backscatter data because of its high correlation with the HHbackscatter over tropical forests.
Finally, we included the SRTM (Shuttle Radar TopographyMission) digital elevation data, aggregated from a 100-meterresolution to 1 km. In addition to the mean elevation (Fig. 3d),the standard deviation was also included to represent surface
ruggedness or roughness. Overall, seven remote sensing datalayers (2 LAI, 2 QSCAT, 1 VCF, 2 SRTM) were included in thisstudy (Table 1). These layers were chosen after performing acorrelation test and removing highly correlated layers (Buer-mann et al., in press).
3.2. Climate data
A series of climate metrics were obtained from WorldClim(WorldClim version 1.4; Hijmans et al., 2005). These climatemetrics are derived from monthly temperature and rainfall valuesand represent biologically meaningful variables for characterizingspecies distribution (Nix, 1986). The WorldClim data layers in-cluded 11 temperature and eight precipitation metrics, expressing
Fig. 3. A selection of the remote sensing data layers used in this study. The panels show (a) MODIS LAI annual maximum, (b) MODIS percentage tree cover,(c) QSCAT annual mean, and (d) mean elevation from SRTM.
2004 S. Saatchi et al. / Remote Sensing of Environment 112 (2008) 20002017
8/3/2019 Modelos de Distribucion y Divers Id Ad de Arboles en La Amazonia Usando Sensores Remotos
6/18
spatial variations in annual means, seasonality, and extreme orlimiting climatic factors. The climate metrics were developedusing long time series of a global network of more than 4000weather stations from various sources such as the GlobalHistorical Climatology Network (GHCN), the FAO (the United
Nations Food and Agricultural Organization), the WMO (WorldMeteorological Organization), the International Center forTropical Agriculture (CIAT), R-HYdronet, and additionalcountry-based stations. The station data were interpolated tomonthly climate surfaces at 5 km spatial resolution by using athin-plate smoothing spline algorithm with latitude, longitude,and elevation (SRTM) as independent variables (Hijmans et al.,
2005).In addition to the bioclimatic variables interpolated from the
station data, we used remotely sensed precipitation data fromthe sensors onboard the Tropical Rainfall Mapping Mission(TRMM) (Kummerow et al., 1998). The TRMM products wereobtained from the global rainfall algorithm (3B43), combiningthe estimates from the sensors with the global gridded raingauge data from Climate Assessment and Monitoring System(CAMS), produced by NOAA's Climate Prediction Center and/or global rain gauge product, produced by the Global Pre-cipitation Climatology Center (GPCC). The output is rainfall for0.25 0.25 degree grid boxes for each month. Monthly rainfall
data from TRMM covering the tropical region (20N20S) andextended to (50N50S) over a period of 9 years (19982006)were used to develop climatologically averaged precipitationmetrics such as the total annual, driest quarter, wettest quarter,and seasonality (coefficient of variation). While developing theclimatological metrics, we resampled the TRMM data to 5 kmresolution using a cubic-spline routine in order to be compatiblewith the WorldClim data layers. The TRMM measurements aresuperior to precipitation layers in the WorldClim dataset becauseof direct rainfall measurements from space, calibration accuracy,and coverage over areas in the tropics where no ground sta-tions are available. After removing the correlated climate layers(Buermann et al., in press), we used only nine independentclimate variables for the model runs (Table 2).
4. Methodology
4.1. Maxent model
We used the Maxent algorithm, which has been very recentlyintroduced for modeling of species distributions (Phillips et al.,
2005). Maxent is a general-purpose algorithm that generatespredictions or inferences from an incomplete set of information.The Maxent approach is based on a probabilistic framework. Itrelies on the assumption that the incomplete empirical probabilitydistribution (which is based on the species occurrences) can beapproximated with a probability distribution of maximum en-tropy (the Maxent distribution) subject to certain environmentalconstraints, and that this distribution approximates a species
potential geographic distribution (Phillips et al., 2005). The inputdata includes a set of environmental layers for a geographicalregion and a set of species presence data inside that region. Likemost maximum likelihood estimation approaches, Maxent, a
priori assumes a uniform distribution and performs a numberof iterations in which the weights are adjusted to maximizethe average probability of the point localities (also known asthe average sample likelihood), expressed as the training gain(Phillips, 2005). These weights are then used to compute theMaxent distribution over the entire geographic space. As in thecase of the present study, Maxent can be applied to species
presence-only geographic locations and remote sensing data toproduce distributions expressing suitability of each grid cell as afunction of the environmental variables at that grid cell. A highvalue of the function at a particular grid cell indicates that thegrid cell is predicted to have suitable conditions for that species(Phillips, 2005).
Compared to other existing models, Maxent has a number offeatures that makes it very useful for modeling species distribu-tion (Elith et al., 2006; Phillips et al., 2005). These include adeterministic framework and, hence, stability as well as theability to run with presence-only point occurrences, high per-formance with few point localities, better computing efficiencyenabling the use of large-scale high-resolution data layers,continuous output from least to most suitable conditions, andability to model complex responses to environmental variables.Last but not least, the newest Maxent version (2.3) is equippedwith several features aimed at supporting the interpretation ofthe model results. For example, Maxent has a built-in jackknife
Table 2Bioclimatic variables used in Maxent predictions
Bioclimate layer5 km resolution
Layer description and unit
BIO1 Annual mean temperatureBIO2 Mean diurnal range (mean of monthly (max tempmin temp))BIO3 Temperature seasonality (standard deviation100)BIO4 Max temperature of warmest monthBIO5 Min temperature of coldest monthBIO6 TRMM annual precipitationBIO7 TRMM precipitation seasonality (coefficient of variation)BIO8 TRMM precipitation of wettest quarter
BIO9 TRMM precipitation of driest quarter
Table 1Overview of remote sensing data sets used in the Maxent predictions along withtheir native resolution and ecological interpretation
Data record Instrument Ecological variable Nativeresolution
Leaf area index
(LAI)
MODIS Vegetation phenology,
structure, and net primary productivity
1 km and 8 days
Maximum LAILAI rangePercent tree cover MODIS Forest cover and
heterogeneity1 km
Scatterometer-backscatter
QSCAT Surface (canopy)moistureand roughness(forest structure)
2.25 km and 3 days
Annual mean HHAnnual STD HHDEM SRTM Topography and
ruggedness90 m1 km
Rainfall TRMM Monthly rainfall 0.25 0.25 deg.
2005S. Saatchi et al. / Remote Sensing of Environment 112 (2008) 20002017
8/3/2019 Modelos de Distribucion y Divers Id Ad de Arboles en La Amazonia Usando Sensores Remotos
7/18
option, which allows the estimation of the significance ofindividual environmental data layers in computing the speciesdistributions. It provides statistical measures for model per-formance such as omission rates and the areas under theReceiver Operating Characteristic (ROC) curve (AUC), andresponse curves for each environmental layer showing how the
Maxent prediction depends on a particular environmentalvariable (Phillips, 2005). In this study, we used two approachesto evaluate the model performance: the omission rates and theAUC. In general, a low omission rate of species occurrenceis necessary for potentially predicting the species distributionranges (Anderson et al., 2003).
4.2. Scenarios and quantitative analysis
We developed several experiments to test the contribution ofhigh-resolution satellite data in modeling species distributionand diversity. First, we used Maxent to model the distribution of
five tree species with remote sensing data only at 1 km reso-lution and evaluated the overall performance of the model andthe contribution of each data layer. We compared these resultswith Maxent models derived from climate data only andexamined the spatial details obtained from remote sensing data.
For species diversity, we divided the tree alpha-diversityvalues into different range classes based on the histogram of treealpha-diversity values for available sites, and used Maxentto provide distributions for each class. The selection of thediversity range for each category was based on how the valueswere distributed over the entire 633 sites (Fig. 2). We dividedthe sites into incremental groups to sample the distribution of
point locations and provide ample point locations for modeling.
The initial Maxent run was performed for training data withalpha b20, corresponding to all the low diversity sites in theAmazon. Afterward, the model runs were performed for all thesites with values greater than the Fisher's alpha threshold foreach category (N20, N40, N60, N80, N100, N120, N180). Weused a threshold of 25% for the predictive probabilities obtainedfor each class range and combined the derived distributions in adecision rule approach to develop a classification map of treealpha-diversity for the entire range. The threshold value of 25%allowed the largest predictive area and suitability for each classrange. After evaluating the contribution of remote sensing datalayers to model outputs, we compared the results with a similar
experiment obtained from climate-only data.Utilizing all point localities available for each species pro-duced the final distribution maps of tree species and diversity.Spatial accuracy of the Maxent predictions was tested closelyfollowing the procedures in Phillips et al. (2005). In detail, wecreated 10 random data partitions with 60% of the pointlocalities assigned for training and 40% for testing, and ran eachscenario with each of these 10 data partitions. Model perfor-mance was then tested at fixed thresholds (threshold-dependent)and across all thresholds (threshold-independent). In thethreshold-dependent case, we evaluated extrinsic (test) omis-
sion rates, defined as the fraction of test localities that fallinto pixels outside the predicted area, at the 10% cumulative
probability threshold. The proportional predicted area is also
provided as the fraction of all the pixels predicted as beingsuitable for the species. A one-tailed test, as a measure to assesswhether the omission rate is lower or higher than random, wasused to determine whether the model could significantly predictthe test localities.
In the threshold-independent test, we analyzed the area under
the ROC curve (AUC) for both training and the test datasets andestimated how significantly each model prediction differed fromrandom using a ties-corrected MannWhitney-U test (Phillipset al., 2005). The ROC curve provides a quantitative representa-tion of the tradeoffs between omission (sensitivity) and commis-sion error (1-specificity). The sensitivity represents the absenceof the omission error, and the quantity 1-specificity representsthe commission error (Cantor et al., 1999). The ROC curve isobtained by plotting the sensitivity on the y axis and 1-specificityon the x axis for all possible thresholds (Swets, 1988). The areaunder the ROC curve is an important metric to measure the model
performance. The larger the AUC, the highest is the sensitivity
rate and the lower is the 1-specificity rate. An AUC equal to 1.0represents an ideal diagnostic test because it achieves both 100%sensitivity and100% specificity. If AUC is 0.5, it indicates that thetest has50% sensitivity and50% specificity rates, suggesting highomission and commission errors (Cantor et al., 1999).
Finally, we compared the distributions obtained from remotesensing data with Maxent distributions derived from climatevariables. We repeated the same experiments with nine inde-
pendent bioclimatic variables and compared the final distribu-tions for tree species and tree alpha-diversity to illustrate thesignificance of spatial information in satellite observations andto explain the complementary habitat characteristics obtainedfrom remote sensing data and products.
5. Results
5.1. Distribution of tree species
Maxent models for distribution of five tree species weregenerated using the georeferenced locations and remote sensingdata layers at 1 km spatial resolution (Table 1) excluding theTRMM precipitation metrics. We used two indicators to examinethe performance of the model: extrinsic omission evaluated at afixed threshold and the threshold-independent area under theROC curve (AUC) (Table 3). The indicators were obtained using
40% of the point locality data as test localities with the remainderused for training. For all species and all data partitions, the AUCvalues were significantly better than random (0.5). The AUCsin the training and test cases generally showed only smalldifferences, suggesting little overfitting in the Maxent predictions.At the 10% fixed cumulative probability threshold, the extrinsicomission rates were small, associated with reasonable fractions of
predicted areas, again suggesting meaningful model predictions.The overall performance of the model for all five species werehigh, indicating that the Maxent-derived distributions were aclose approximation of the probability distribution that representsthe reality.
Maxent models for distribution of five tree species weregenerated using the georeferenced locations and remote sensing
2006 S. Saatchi et al. / Remote Sensing of Environment 112 (2008) 20002017
8/3/2019 Modelos de Distribucion y Divers Id Ad de Arboles en La Amazonia Usando Sensores Remotos
8/18
data layers at 1 km spatial resolution (Table 1) excludingthe TRMM precipitation metrics. We used two indicators toexamine the performance of the model: the fraction of predictedarea and extrinsic omission rate as threshold-dependent testsand the area under the ROC curve (AUC) as the threshold-independent test (Table 3). The indicators were obtained usingapproximately 40% of the training data as test localities forevaluating the performance statistics. For all species, the AUCvalues were significantly better than random (0.5), and withone-tailed pb0.001. This result was obtained for both thetraining and the test data, with the small difference in AUCvalues suggesting a robust performance of the Maxent algo-
rithm to capture the variations in environmental variables over point localities. All omission tests were calculated at 10%threshold value. At this threshold, the fractional predicted areashows the fraction of all the pixels that are predicted suitable forthe species. For all species, the extrinsic omission rates weresmall, suggesting that only a small fraction of the test locationsfell into pixels not predicted as suitable for species. The overall
performance of the model for all five species was high, im-plying that the Maxent-derived distributions were a close ap-proximation of the probability distribution that represents thereality.
The spatial distributions of tree species in terms of the
predictive probability were segmented in five categories torepresent the ranges of habitat suitability (Fig. 4). As de-scribed earlier, the value assigned to a pixel is the sum ofthe probabilities of that pixel and all other pixels with equalor lower probability multiplied by 100 to give a percentage.Theoretically, any pixel with probability greater than 1% isconsidered suitable for the species habitat. However, here weare mainly interested in areas with higher probability (N20%).
For C. brasiliense (Fig. 4a), the areas in central Amazonhave the highest cumulative probabilities (N20%). Floodplainsof the central Amazon, dominated by close canopy varzeaforests, the tidal varzea of the Amazon estuary in the state ofPara, and southern basins of Tapajos, Itiri, and Xingu rivers, allfall in the N50% probability. C. brasiliense is considered to be
one of the most exploited timber species in the varzea forestsand is on the verge of extinction in these regions due to un-sustainable logging practices (Higuchi et al., 1994). The modelalso predicts areas in fragmented forests along the Atlanticcoast of Brazil in southern Bahia, the northwestern Amazon inColombia, Ecuador, and Peru, and some areas of the Guiana
Shields as suitable habitats for the species (Fisher & DosSantos, 2001). The results of the jackknife test of variableimportance showed the highest gain (N0.3) for the QSCATmean backscatter data, suggesting areas with high moisture in afloodplain and terra firme forests as the suitable habitat forC. brasiliense (Fig. 5a). Other variables, such as the maximumLAI and the percent tree cover with moderate gains (0.10.2),were the next contenders in defining the habitat.
Maxent prediction for C. guianensis pointed to the centralAmazon, the states of Amazonas and Para, the Guiana Shields,the northern coast of Venezuela, and the Atlantic Coastal forestsand varzea floodplains as regions of highest probability for the
geographical range of the species (Fig. 4 b). Remote sensingvariables with the highest gains were QSCAT mean (0.49) backscatter and SRTM elevation (0.6). A close examinationofFig. 5b shows areas delineated by elevation less than 300 mand high moisture along the floodplains and coastal regionswere suitable habitats. In addition, the Colombian Pacific Coastregion (Choc) (Lellinger and Sota, 1978), a stretch of landmainly between the Pacific Ocean and Cordillera Occidental ofthe Andes, from west of the mouth of the Atrato River nearPanama to Mataje River in the south, bordering northwesternEcuador, was also predicted as the suitable range for C.
guianensis (known as tangare in the region) (Gentry, 1982).This result clearly shows the strength of remote sensing data and
the Maxent model for predicting species range, in particular inareas where no training data were available.
H. crepitans had the lowest numbers of point localities amongthe five species and they were scattered mainly in the westernAmazon. Maxent predicted areas outside the central Amazon asthe suitable habitat. Areas with the highest probabilities (N50%)were in the western lowlands of Peru and Ecuador, in southernBolivia, along the Beni river basin, and in the eastern regions ofthe Brazilian Amazon and the coastal regions of Surinam,Guyana, and French Guiana (Fig. 4c). Along the Atlantic coastof Brazil, the model predicts small regions in southern Bahia as
potentially suitable habitat for H. crepitans. Similarly, narrow
regions in varzea forests along the Rio Solimoes and itstributaries are delineated as potential habitat. In general, H. crepitans is considered a semi-evergreen species found inseasonal forests which, along with other emergent trees,undergoes foliage reduction during the dry season (Conditet al., 2000; Schongart et al., 2002). Among remote sensingdata,QSCAT mean and standard deviation, maximum and range ofLAI were selected as variables with high gains for defining thespecies range. Mean QSCAT and maximum LAI both reachedN0.6 gains through the jackknife analysis, suggesting forestswith canopy moisture, roughness, and leaf area as potentialhabitat (Fig.5c). The standard deviation of QSCATand the rangeof LAI had gains N0.3, and both pointed to forests with seasonalcanopy characteristics. Topography, on the other hand, was not
Table 3Results of threshold-dependent omission tests and threshold-independent ROCtests for five tree species, including fractional predicted area, test omission rates,and the area under the ROC curve (AUC)
Species name andnumber of occurrences
Threshold-dependent test Threshold-independenttest
Fractionalpredictedarea
Test omissionrate AUC test (training)
(a) C. brasiliense (96) 0.533 0.106 0.756 (0.821)(b) C. guianensis (88) 0.356 0.153 0.843 (0.914)(c) H. crepitans (45) 0.289 0.212 0.832 (0.921)(d) M. bidentata (140) 0.321 0.095 0.853 (0.907)(e) V. surinamensis (133) 0.377 0.142 0.833 (0.896)
Test omission rates were calculated at the 10% threshold level. Values representaverages from 10 separate random training/test data partitions. For eachpartition, statistical evaluations of test omission rates (one-tailed binomial) andtest AUC (MannWhitney-U) indicated that the predictions were significantlybetter than random (pb0.001; individual p-values not shown).
2007S. Saatchi et al. / Remote Sensing of Environment 112 (2008) 20002017
8/3/2019 Modelos de Distribucion y Divers Id Ad de Arboles en La Amazonia Usando Sensores Remotos
9/18
Fig. 4. Maxent Prediction of potential geographic distribution of five tree species made using all occurrence records and the remote sensing data at 1 km resolution.The predictive probability values ranging from 0 to 100 are depicted by colors. (a) C. brasiliense, (b) C. guianensis, (c) H. crepitans, (d) M. bidentata, and(e) V. surinamensis.
2008 S. Saatchi et al. / Remote Sensing of Environment 112 (2008) 20002017
8/3/2019 Modelos de Distribucion y Divers Id Ad de Arboles en La Amazonia Usando Sensores Remotos
10/18
Fig. 5. Results from the Maxent jackknife test of the importance for remote sensing variables used for five tree species. The graphs depict the training gains when avariable is used in isolation, when the variable is excluded, and when all variables are utilized. The gain is a measure of how much better the Maxent probabilitydistribution fits the distribution of occurrence data. A variable has useful information when the gain is high and it is used in isolation and has unique information if itreduces the gain most when it is excluded.
2009S. Saatchi et al. / Remote Sensing of Environment 112 (2008) 20002017
8/3/2019 Modelos de Distribucion y Divers Id Ad de Arboles en La Amazonia Usando Sensores Remotos
11/18
an important indicator for the habitat. Visual inspection of thepredicted areas at 1 km resolution confirmed that the distributioncompared well with the seasonal characteristics ofH. crepitansobserved in field experiments (Freiberg, 1996; Schongart, et al.,2002).
M. bidentata distribution is widespread in the Amazon basinwith the highest predicted probability in the Central regionof Brazil, especially in the state of Amazonas, Para (heavilylogged), Amapa, and Roriama, and along the northeasternAtlantic coast, the Amazon estuary, French Guiana, Surinam,
and Guyana (Fig. 4d). Maxent predicts areas in upland terrafirme and floodplain forests as preferred habitats for M. bidentata. The distribution extends to western SouthAmerica, the state of Acre in Brazil, northern Peru, Ecuador,and Colombia. The range also covered areas in the westernChoc region of Colombia to southern Panama (Faber-Langendoen & Gentry, 1991). Several remote sensing variableshelp define the geographical range ofM. bidentata. The traininggains obtained from the jackknife statistics showed meanQSCAT (gain N0.5), standard deviation (gain N0.3), MODIS
percent tree cover (gain N0.4), SRTM elevation (gain N0.4),and maximum LAI (gain N0.4) are among the important re-
mote sensing variables contributing to the predicted distribution(Fig. 5d). In general, closed canopy forests (high percent treecover), moist (high mean QSCAT), with low seasonality (lowQSCAT standard deviation) and medium LAI with almost noseasonal variations were the best indicators for the habitat. Therange was primarily limited to areas of low elevation and smallvariations in topography.
V. surinamensis is also predicted to be widespread in thecentral Amazon, extending from the eastern Atlantic coast andthe Guiana Shields to the western regions of Peru, Ecuador,Colombia, and areas along the lowland Andes to southernridges in Bolivia (Fig. 4e). Areas with high predictive
probability are predominantly along the Amazon River flood-plains, coastal forests extending to Venezuela, and in the Choc
region of western Colombia to Panama (Fisher et al., 1991;Gentry, 1975; Howe, 1990; Rodriguez, 1972). Areas in thesouthern Amazon region of Brazil along the transitional forestsare also predicted as suitable habitat, but with lower probability(b20%). Analysis of Maxent results showed seasonality as lessimportant in characterizing the distribution. This is mainly due
to the fact that the point localities of the species covered a widerange of landscapes with a wide range of seasonality. However,the canopy moisture from QSCAT backscatter (gain N0.5),maximum LAI from MODIS (gain N0.4), and the elevationfrom SRTM (gain 0.4) contributed significantly in definingthe species range (Fig. 5e). One of the most apparent features inthe distribution is the high probability of prediction (N50%) inthe Amazon floodplains.
5.2. Distribution of tree alpha-diversity
Using the remote sensing data layers, we ran the Maxent
model for nine categories of Fisher's alpha (Fig. 6). Thenumbers of point localities for each category were sufficient forthe model runs without encountering problems associated withover- or under-predictions. In fact, in all cases, the model
performance determined by the area under the ROC curve wassignificantly (pb0.001) better than random (Table 4). Similarly,the training omission rates were small and the fractional
predicted area over the entire environmental space coveringnorthern South America was reasonably large. The results inTable 4 provided confidence in Maxent prediction of spatialdistribution of tree alpha-diversity for each category. The resultsfrom dividing the point localities randomly in training (60% of
points) and testing (40% of points) reduced the AUC values
about 5% on the average, suggesting a reliable model per-formance under more constrained conditions. The threshold-dependent omission tests also provided low omission rates andrelatively large fractional predicted area (Table 4).
After combining the distribution maps derived for all ninescenarios, a classification map of Fisher's alpha was produced
Fig. 6. Results from Maxent predictions of tree alpha-diversity made fromremote sensing data and inventory plots (n =633). Classification map of treealpha-diversity produced from Maxent predictions using 25% probability
threshold for each class range.
Table 4Results from the threshold-dependent omission tests and threshold-independentROC tests for eight range classes of tree alpha-diversity (expressed as Fisher'salpha), including fractional predicted area
Fisher'salpha-
diversityclass
Numberof point
localities
Threshold-dependent test Threshold-independenttest
Fractionalpredicted area
Test omissionrate
AUC test (training)
b20 118 0.345 0.118 0.861(0.897)N20 515 0.278 0.128 0.877(0.909)N40 370 0.266 0.106 0.886(0.915)N60 249 0.226 0.117 0.901(0.932)N80 176 0.221 0.141 0.907(0.940)N100 112 0.207 0.106 0.911(0.933)N120 79 0.184 0.080 0.935(0.951)N180 28 0.144 0.038 0.952(0.970)
Test omission rates were calculated at the 10% threshold level. Values representaverages from 10 separate random training/test data partitions. For eachpartition, statistical evaluations of test omission rates (one-tailed binomial) and
test AUC (Mann
Whitney-U) indicated that the predictions were significantlybetter than random (pb0.001; individual p-values not shown).
2010 S. Saatchi et al. / Remote Sensing of Environment 112 (2008) 20002017
8/3/2019 Modelos de Distribucion y Divers Id Ad de Arboles en La Amazonia Usando Sensores Remotos
12/18
at 1 km resolution. The resulting map is a potential distributionof ranges of tree alpha-diversity over the basin (Fig. 6). Giventhe Maxent performance and the distribution of sampling sitesover the basin, any extrapolation of diversity outside the mainrange of the sampled data and over other ecosystems or biomassis not allowed. Therefore, the analysis of the range map is
restricted to the Amazon basin.The largest swath of high tree diversity is in terra firme forest
of the western Amazon, stretching from the foothills of theAndes in southern Colombia, Ecuador, and Peru into the centralAmazon in Brazil. The Fisher's alpha associated with thisregion exceeded 180. The map also revealed many smaller areasof high tree diversity (alpha N180) in areas east of the Rio Negroand north of Manaus, in northeastern Brazil in the state ofAmapa, the southern basins of Tapajos and Xingu rivers in thestate of Para, areas along southern and eastern Guiana Shields,and outside the basin in the Atlantic coastal forests in Bahia.
In contrast, areas in the central Amazon between the Rio
Negro and Solimoes and their tributaries are dominated byforests with lower tree alpha-diversity (alpha b120). This regionis dominated by extensive river systems, varzea and igapofloodplains, and depending on their proximity to rivers and theirsediment load, and topography, the forests contain a large rangeof diversity. Further south, in transitional deciduous and semi-deciduous forests of Brazil and in Chiquitano dry forests ofBolivia, the diversity drops to its minimum (alpha b40). Notethat the separation of the high diversity Amazon basin from thesurrounding woodland and grassland savanna (cerrado) andhigher elevation Andean vegetation types is an artifact resultingfrom Maxent's extrapolation over the environmental space. Asthere were no point localities sampling these biomes in our
database, model predictions for these regions are not warranted.The jackknife training gains for the remote sensing variables
were performed for all individual runs and showed almost thesame results with slight variability in gain values. We showresults for alpha N20 to demonstrate the significance of eachvariable (Fig. 7). The most important variables were vegetationcanopy roughness and moisture from QSCAT, percent treecover from MODIS, and mean and seasonality of LAI fromMODIS. The highest gain was achieved for the QSCAT mean
backscatter (gain N0.7), maximum LAI (gain N0.6), and percenttree cover (gain N0.5). SRTM elevation, although important forthe overall performance of the model, had a low gain compared
to other variables (gain b0.3). LAI range and the standarddeviation of QSCAT both had relatively high gains, suggestingseasonality as important variables for the distribution of areaswith high diversity. An examination of the model responsecurves to input variables suggested that, in general, high mois-ture and low seasonality were associated with areas of high
diversity.To further demonstrate the role of remote sensing data in
separating the tree diversity over the basin, we plotted theFisher's alpha for all 633 plots with respect to the mean andstandard deviation of backscatter from QSCAT and the SRTMstandard deviation (Fig. 8). In each case, we developed anenvelope based on exponential functions to show the relation-ship between the maximum (equivalent to 90th quantile) treealpha-diversity and the remote sensing variable. The envelopesshowed three important trends: 1) QSCAT backscatter were
positively correlated (R2 =0.73, pb0.001) with the maximumalpha-diversity suggesting areas with higher canopy moisture
and roughness associated with higher diversity (Fig. 8a),2) QSCAT standard deviation of backscatter was negativelycorrelated (R2 =0.81, pb0.001) with maximum tree alpha-diversity indicating higher tree diversity was associated withareas of less seasonality in high stability in moisture (Fig. 8b),and 3) maximum tree alpha-diversity was negatively correlated(R2 =0.62, pb0.001) with standard deviation of SRTMsuggesting areas with less variations in elevation, primarilylowlands, were associated with higher tree diversity (Fig. 8c).
5.3. Comparison with climate derived models
We performed the comparison of remote sensing results with
distributions derived from nine bioclimatic layers for all fivetree species and the Fisher's alpha scenarios. Here, we showdistributions from V. surinamensis and the tree alpha-diversityto summarize the results from bioclimatic variables (Fig. 9). Forall tree species, the distributions derived from remote sensingdata were superior to climate data mainly because of the verycoarse spatial resolution of the climate data and the limitedvariations of temperature and precipitation over the lowlandAmazonian forests where species data were collected. Forexample, the V. surinamensis distributions from climate (Fig. 9a)and remote sensing data (Fig. 4e) have similar patterns for com-
parable predictive probabilities. However, there are two distinct
Fig. 7. Example of the Maxent jackknife test gains for the importance of remote sensing variables for tree alpha-diversity N20 (n =515).
2011S. Saatchi et al. / Remote Sensing of Environment 112 (2008) 20002017
8/3/2019 Modelos de Distribucion y Divers Id Ad de Arboles en La Amazonia Usando Sensores Remotos
13/18
differences: 1) finer resolution remote sensing data allows
prediction of range patterns along geomorphological features inthe central Amazon and topographical and vegetation featuresalong the Guiana Shields. These features disappear in climateresults.2) Models fromclimate data underpredict thespeciesrangein southern margins of the Amazon along the transitional forestsand overpredict in savanna regions such as Roraima, the GranSavanna and areas in cerrado Brazil. The jackknife test (Fig. 9b)shows several temperature and rainfall variables such as thetemperature mean diurnal range (BIO2), seasonality (BIO3),temperature of the coldest month (BIO5), total annual precipita-tion (BIO6), and mean precipitation of the driest month (BIO9)as important habitat characteristics. If predicted correctly, thesevariables are complementary to remote sensing data (Fig. 5e)where canopy moisture and seasonality (QSCAT), maximum leaf
area (MODIS), and low elevation (SRTM) are the dominanthabitat characteristics.
The distribution for Fisher's alpha classes derived fromclimate data (Fig. 9c) has general patterns similar to thoseobtainedbyremotesensingdata(Fig.7). Visual comparison of thetwo diversity maps reveals three regions of high diversity within
the Amazon basin: 1) the western Amazon basin, includingwestern and northwestern Brazil, northern Peru, Ecuador, and theeastern Colombian Amazon, 2) the central Amazon basin, in-cluding areas east of the Rio Negro and north of Manaus towestern Para, and 3) areas in northeastern Brazil and part of theGuiana Shields. Except for the western Amazon basin, the pat-terns do not necessarily cover the exact geographical regions. Ingeneral, the distribution from remote sensing data is spatiallyrefined and shows patterns delineated by geomorphological andgeological features of the Amazon basin, whereas, the climate-derived distribution is coarse in resolution and is distinguished
primarily by patterns of precipitation. Similarly, areas of high
diversity (alphaN
180) in the Guiana Shields appear continuousin the remote sensing results, with the patterns following thegeological and vegetation gradients, but patchy and discontinuousin the climate results. The southwestern region of the Amazon,including southern Peru and Bolivia, also appear different in thetwo distributions. The climate results show low diversity withinthe dense old growth forests of lowland Peru caused by tem-
perature and precipitation seasonality. In contrast, the remotesensing results show higher tree alpha-diversity (alpha N120) inthe lowland old growth forests of Peru and northern Bolivia. Ingeneral, the distribution of tree alpha-diversity from the remotesensing data has much smoother variations within the Amazon
basin than the climate results. The Maxent model results from the
climate data underpredict the tree alpha-diversity in most areas ofthe central Amazon and create patterns different from observedtree diversity in the region (ter Steege et al., 2003). The jackknifetest highlights these points by choosing rainfall of the driestquarter (BIO9), total rainfall (BIO6), and temperature range(BIO2), seasonality (BIO3), and minimum temperature of thecoldest month (BIO5) as important variables in predicting areas ofhigh diversity (ter Steege et al., 2006). Given the complementaryinformation in remote sensing and climate layers, it is expectedthat the best distribution for diversity may be produced fromcombined climate and remote sensing data (Prates-Clark et al.,2008).
6. Discussion and conclusion
6.1. Contribution of remote sensing data
For both tree species and tree alpha-diversity, remote sensingdatasets provided meaningful and significant contributions indefining the distribution range and spatial patterns. We sum-marize these contributions in two areas: 1) improving the spatialresolution and, therefore, providing landscape-level details on
potential habitat characteristics, and 2) adding to the pool ofenvironmental variables beyond climate surfaces and henceimproving the definition of habitat properties and ecologicalniche.
Fig. 8. Distribution of tree alpha-diversity of inventory plots as a function ofselected remote sensing variables. Exponential functions are used as envelopesto show the general trends of maximum diversity (equivalent to 90th quantile)with respect to (a) QSCAT annual mean backscatter, (b) QSCAT annual standarddeviation of backscatter, and (c) standard deviation of SRTM data.
2012 S. Saatchi et al. / Remote Sensing of Environment 112 (2008) 20002017
8/3/2019 Modelos de Distribucion y Divers Id Ad de Arboles en La Amazonia Usando Sensores Remotos
14/18
A subset of the tree alpha-diversity maps produced from 1 kmremote sensing data (Fig. 6) and 5 km climate data (Fig. 9c) overthe central Amazon in Amazonia and the Para states of Brazil
demonstrated this point (Fig. 10). With the 1 km remote sensingdata (Fig. 10a), landscape features due to forest fragmentation,differences in forest cover (terra firme and inundated), andlandscape geomorphology were readily delineated and reason-able differences in their tree alpha-diversity were observed. Incontrast, the climate-derived map (Fig. 10b), although showinga general pattern, does not provide reasonable prediction anduseful patterns of tree alpha-diversity in this region. In general,climate surfaces interpolated from station data or derived fromcoarse resolution satellite measurements cannot capture land-scape-scale variations in diversity. It is also important to note thatunlike potential species distribution influenced by environmen-tal factors, diversity depends on the size of the area sampled,climate, past history, and local influences, such as soil, geology,
and nutrients. Therefore, diversity is very much a local orregional property of a forest and cannot be readily extrapolatedto other regions. Nevertheless, as demonstrated in this study,
quantification of landscape heterogeneity from high-resolutionsatellite observations can readily improve our understanding ofthe biogeography and biodiversity of the lowland Amazonianrainforests from typical postulated or observed distribution
barriers such as unfavorable past climates, mountains, rivers,and river floodplains (Tuomisto et al., 1995).
Furthermore, remote sensing data provide measurementsdirectly related to forest structure, species composition, gapfraction, and the overall health of the ecosystem that cancollectively improve our understanding of suitable habitats forspecies. Maxent offers response curves for the input environ-mental variables that allow examining how the predictions ofsuitable habitatdepend on each variable. To demonstrate this, weexamined the QSCAT response curves, as one of the most
Fig. 9. Maxent predictions derived from bioclimatic variables at 5 km resolution and corresponding Maxent jackknife test results for the variable importance.(a) V. surinamensis distribution, (b) jackknife test forV. surinamensis. (c) classification of tree alpha-diversity, and (d) jackknife test for alpha N20 class (n =515).
2013S. Saatchi et al. / Remote Sensing of Environment 112 (2008) 20002017
8/3/2019 Modelos de Distribucion y Divers Id Ad de Arboles en La Amazonia Usando Sensores Remotos
15/18
important variables in predicting the distribution ofM. bidentataand V. surinamensis (Fig. 11). The response curves were derivedfrom Maxent runs with QSCAT used in isolation in order toavoid interferences with other variables. QSCAT values overmost of the point localities (shown as diamonds) and plotted aslog of sample frequency, are scattered within a small range of thevariable between 10.0 dB and 6.0 dB. These are typicalQSCAT values measured over tropical forests. Within thisrange,
there is a major difference in the response curves between thetwo species. These differences force Maxent to choose thresh-olds to separate suitable areas and predict different distribution
patterns, Areas outside of this range, although included in theoverall environmental space, will not contribute to defining thespecies range. We expect the combination of climate and remotesensing data and the multiscale analysis may provide the bestdistributions for species range and diversity.
Fig. 11. Maxent response and sample frequency forM. bidentata and V. surinamensis as a function of QSCAT annual mean backscatter. The response curves illustratehow the contribution to the raw Maxent prediction depends on a particular environmental variable (Phillips et al., 2005). The Maxent response curves were derived
from Maxent runs with QSCAT used in isolation to avoid interferences with other variables. The sample frequency (plotted as diamonds) shows the number of pointlocalities that fall in a certain QSCAT interval.
Fig. 10. Comparison of Maxent predictions of tree alpha-diversity classification over the Atlantic Coastal Forests of Bahia from (a) 1 km remote sensing data and(b) 5 km bioclimatic variables.
2014 S. Saatchi et al. / Remote Sensing of Environment 112 (2008) 20002017
8/3/2019 Modelos de Distribucion y Divers Id Ad de Arboles en La Amazonia Usando Sensores Remotos
16/18
6.2. Maxent predictions
For those species whose distribution patterns are determinedprimarily by evolutionary processes and gene flow and less byenvironmental variables, especially those detected by climate,landscape, or vegetation, predictive models are not useful. In
general, models are based on algorithms that extrapolate thesensitivity developedby thetraining data from point localities andthe environmental variables to a larger space. A true statistical testfor any model prediction is how significantly and consistently it
performs better than a random prediction. Maxent has internalroutines that provide threshold-dependent and threshold-inde-
pendent tests of its performance by examining the omission rateand the area under the ROC curve (AUC) for a set of randomlyselected test localities. In case of low sensitivity to environmentalvariables for a set of point localities, both tests will show poor
performance of the algorithm. In all cases studied here, consistentand significantly better than random performance were achieved.
However, without absence data (not available in most biologicaldata), there seems to be no source of negative instances to ac-curately measure the predictions over the rest of the environ-mental space. Phillips et al. (2005) developed non-parametrictests using the MannWhitney-U-statistic and a sample of 10,000
pixels drawn randomly from the study region to examine towhat extent the prediction differs from random (AUC of 0.5).Although, this approach is not as rigorous as tests with absencedata, it provides more confidence in model predictions. Fur-thermore, it provides enough confidence in distribution patternsderived from remote sensing data. Following a similar approach,we concluded that in examples used in our study, Maxent
predictions were significantly different than random and its
performance was close to optimal. These tests suggest that modelpredictions derived from remote sensing data provide meaningfuland reasonable distribution patterns. With the predictions relatedto tree alpha-diversity, the results also confirmed that thedistribution is only valid within the biomes sampled by the
presence data and anyextrapolation of diversity to other biomesisnot valid.
6.3. Future work
Spatial, spectral, and temporal diversity of recent satelliteobservations and improvement of algorithms to derive ecolo-
gically important variables has enhanced our capabilities inconservation biology in different areas: 1) to directly map in-dividual species over relatively large and spatially contiguousunits, 2) to map homogeneous associations dominated by fewspecies or diversity indicators, and 3) to develop environmentalrequirements for species range and diversity. Results obtainedfrom this study and similar studies using satellite data suggestthat nature conservation in Amazonia and other tropical re-gions can benefit from recognizing ecological heterogeneity atlandscape scales and the use of methods and datasets that cancarefully distinguish these heterogeneities (Nagendra, 2001;Tuomisto et al., 1995). Interpolation and extrapolation of fieldinventory data on species presence, richness, and diversity on aspatial scale depend on how well the environmental variables
and ecological heterogeneities are characterized on that scale.Analysis of satellite images over Amazonian forests has shownthat in addition to spatial information, the spectral data can beused to separate structurally and floristically distinct biotopeswithin the vegetation types already known (Chambers et al.,2006; Lucas et al., 2004).
These results also call for new areas of research in usingsatellite observations in biogeography and the conservation of
biodiversity. First, the use of remote sensing data in predictivemodels requires new rules and protocols to be developed inorder to improve the assessment of distribution patterns fromregional and continental to landscape scales. Finer-resolutionenvironmental data may not improve model predictions, as theymay be incompatible with the spatial scale of the inventory datafrom natural history museums and herbaria and they mayintroduce unwanted and additional statistics in the input dataand thus impact the performance of predictive models. Furtherresearch is also required to determine to what degree the spatial
resolution of satellite observation can help or limit theidentification of species' environmental requirements andtheir ecological niche.
Research is also needed regarding the utility of spectral dataor remote sensing products to quantify and map ecologicallyimportant features on landscapes. Currently, remote sensing-
based habitat characterization is mainly based on the relation between spectral data and the structure, chemistry, andheterogeneity of vegetation within a pixel resolution. It is notclear whether habitat suitability for most species can be definedin terms of these variables. Currently, specieshabitat relation-ships are defined by eco-region classifications based on climate,geology, and natural barriers. In particular, for mobile taxa like
birds or butterflies, the specieshabitat relationships are notwell defined or quantifiable. New approaches are required toextend these relationships to physical environmental variablesdetectable by satellite observations. In addition, research is alsorequired to explore whether modeling distributions for a numberof species or communities or developing relationships betweenspectral data and species richness and diversity may enhanceand improve the utility of satellite data.
Detection and assessment of changes in landscapes, such asdeforestation and land use change, have been explored exten-sively in environmental sciences using the temporal diversity ofremote sensing data. In addition, availability of time series
satellite observation in the past three decades has also providedinformation on both the stability and the dynamics of ecosystemsand natural habitats. In this study, we used time series data fromMODIS and QSCAT to develop climatological metrics. However,further research is required to improve the utility of these datasetsin predictive models and to understand the impact of land use
patterns and interannual variability present in time series remotesensing data on species distribution.
Acknowledgement
We thank Donat Agosti for his help with acquiring the treespecies data and Ana Paula Giorgi for her help with the GISlayers. The tree species data was provided by the herbarium of
2015S. Saatchi et al. / Remote Sensing of Environment 112 (2008) 20002017
8/3/2019 Modelos de Distribucion y Divers Id Ad de Arboles en La Amazonia Usando Sensores Remotos
17/18
the New York Botanical Garden and the tree diversity data isfrom the Amazon Tree Diversity Network. However, this workwould not have been possible without decades of dedicatedand methodic fieldwork of numerous researchers across SouthAmerica. We would also like to thank the reviewers who pro-vided us with important suggestions and criticisms of the original
manuscript. This work was performed at the Jet PropulsionLaboratory, California Institute of Technology, and the UCLACenter forTropical Research, Institute of the Environment, under acontract from the National Aeronautics and Space Administration.
References
Anderson, R. P., Lew, D., & Peterson, A. T. (2003). Evaluating predictivemodels of species' distributions: Criteria for selecting optimal models.
Ecological Modelling, 162, 211232.Brown, J. H., Stevens, G. C., & Kaufman, D. M. (1996). The geographic range:
Size, shape, boundaries, and internal structure. Annual Review of Ecologyand Systematics, 27, 597623.
Buermann, B., Saatchi, S., Zutta, B. R., Chaves, J., Mil, B., Graham, C. H.,
Smith, T. B. (in press). Application of remote sensing data in predictivemodels of species' distribution. Journal of Biogeography.
Buermann, W., Wang, Y., Dong, J., Zhou, L., Zeng, X., Dickinson, R. E., Potter,C. S., & Myneni, R. B. (2002). Analysis of a multiyear global vegetationleaf area index data set. Journal of Geophysical Research, 107, 4646.doi:10.1029/2001JD000975
Cantor, S. B., Sun, C. C., Tortolero-Luna, G., Richards-Kortum, R., & Follen, M.(1999). A comparison of C/B ratios from studies using receiver operatingcharacteristic curve analysis. Journal of Clinical Epidemiology, 52, 885892.
Chambers, J. Q., Asner, G. P., Morton, D. C., Anderson, L. O., Saatchi, S. S.,Esprito-Santo, F. D. B., Palace, M., & Souza, C., Jr. (2006). Regionalecosystem structure and function: Ecological insights from remote sensingof tropical forests. Trends in Ecology & Evolution, 22(8), 414423.
Condit, R., Watts, K., Bohlman, S. A., Prez, R., Hubbell, S. P., & Foster, R. B.(2000). Quantifying the deciduousness of tropical forest canopies under
varying climates. Journal of Vegetation Science, 11, 649658.Cowling, R. M., & Pressey, R. L. (2001). Rapid plant diversification: Planning
for an evolutionary future. Proceedings of the National Academy of Sciences, 98(10), 54525457.
Crandall, K. C., Bininda-Emonds, O. R. P., Mace, G. M., & Wayne, R. K.(2000). Cosidering evolutionary processes in conservation biology. Trendsin Ecology and Evolution, 15(7), 290295.
De Oliveira, A. A., & Mori, S. (1999). A central Amazonian terra firme forest. I.High tree species richness on poor soils. Conservation Biology, 8,12191244.
Elith, J., Graham, C. H., Anderson, R. P., Dudk, M., Ferrier, S., Guisan, A., et al.(2006). Novel methods improve prediction of species' distributions fromoccurrence data. Ecography, 29, 129151.
Faber-Langendoen, D., & Gentry, A. H. (1991). The structure and diversity ofrain forests at Bajo Calima, Choco Region, Western Colombia. Biotropica,
23, 211.Ferrier, S. (2002). Mapping spatial pattern in biodiversity for regional conservation
planning: Where to from here? Systematic Biology, 51(2), 331363.Fisher, R. A., Corbet, A. S., & Williams, C. B. (1943). The relation between the
number of species and the number of individuals in a random sample of ananimal population. Journal of Animal Ecology, 12, 4258.
Fisher, E., & Dos Santos, F. A. M. (2001). Demography, phenology and sex ofCalophyllum brasiliense (Clusiaceae) trees in the Atlantic forest. Journal ofTropical Ecology, 17, 903909.
Fisher, B. L., Howe, H. F., & Wright, S. J. (1991). Survival and growth of Virolasurinamensis yearlings: Water augmentation in gap and understory. Oeco-logia, 86, 292297.
Fjeldsa, J., & Lovett, J. C. (1997). Biodiversity and environmental stability. Biodiversity and Conservation, 6, 315323.
Freiberg, M. (1996). Spatial distribution of vascular epiphytes on three emergent
canopy trees in French Guiana. Biotropica, 28(3), 345355.
Gaston, K. J. (1998). Species-range size distributions: Products of speciation,extinction, and transformation. Philosophical Transactions of the RoyalSociety B, 353, 219/230.
Gentry, A. H. (1975). Additional Panamanian Myristicaceae. Annals of the Missouri Botanical Garden, 62(2), 474479.
Gentry, A. H. (1982). Phytogeographic patterns as evidence for a Choc refuge.In G. T. Prance (Ed.), Biological diversification in the tropics (pp. 112136).
New York: Columbia University Press.Gentry, A. H. (1988). Tree species richness of upper Amazonian Forests. Pro-ceeding of National Academy of Sciences , 85, 156169.
Graham, C. H., & Hijmans, R. J. (2006). A comparison of methods of speciesranges and species richness. Global Ecology and Biogeography, 15,578587.
Guariguata, M. R., Claire, H. A., & Jones, G. (2002). Tree seed fate in a loggedand fragmented forest landscape, Northeastern Costa Rica. Biotropica, 34,405415.
Guisan, A., & Zimmermann, N. E. (2000). Predictive habitat distribution modelsin ecology. Ecological Modelling, 135, 147186.
Hansen, M. C., DeFries, R., Townshend, J., Sohlberg, R., Dimiceli, C., &Carroll, M. (2002). Towards an operational MODIS continuous field ofpercent tree cover algorithm: Examples using AVHRR and MODIS data.Remote Sensing of Environment, 83, 303319.
Higuchi, N., Hummel, A. C., Freitas, J. V., Malinowski, J. R. E., & Stokes, R.(1994). Explorao Florestal nas Vrzeas do Estado do Amazonas: Seleode rvore, Derrubada e Transporte. Proceedings of the VII Harvesting andTransportation of Timber Products (pp. 168193). Curitiba, Brazil: IUFRO/UFPR.
Hijmans, R. J., Cameron, S. E., Parra, J. L., Jones, P. G., & Jarvis, A. (2005).Very high resolution interpolated climate surfaces for global land areas.
International Journal of Climatology, 25, 19651978.Howe, F. (1990). Survival and growth of juvenile Virola surinamensis in Panama:
Effects of herbivory and canopy closure. Journal of Tropical Ecology, 6,259280.
Hunter, J. T. (2003). Factors affecting range size differences for plant species onrock outcrops in eastern Australia. Diversity and Distributions, 9, 211 /220.
Kreft, H., Sommer, J. H., & Barthlott, W. (2006). The significance of geographicrange size for spatial diversity patterns in Neotropical palms. Ecography,
29, 2130.Lellinger, D. B., & Sota, E. R. de la (1978). The phytogeography of the
pteridophytes of the Departamento del Choc, Colombia. NationalGeographic Society Research Reports 1969 projects (pp. 381387).
Loiselle, B. A., Howell, C. A., Graham, C. H., Goerck, J. M., Brooks, T., Smith,K. G., & Williams, P. H. (2003). Avoiding pitfalls of using species distributionmodels in conservation planning. Conservation Biology, 17, 15911600.
Long, D. G., Drinkwater, M., Holt, B., Saatchi, S., & Bertoia, C. (2001). Globalice and land climate studies using scatterometer image data. EOS,Transaction of American Geophysical Union, 82(43).
Lovett, J. C., Rudd, S., Taplin, J., & Frimodt-Mller, C. (2000). Patterns of plantdiversity in Africa south of the Sahara and their implications forconservation management. Biodiversity and Conservation, 9, 3342.
Lucas, R., Held, A. A., Phinn, S. R., & Saatchi, S. S. (2004). Tropical forests. InS. Ustin (Ed.), Remote Sensing of Earth Sciences: Manual of Remote
Sensing, Vol. 4. John Wiley & Sons.Kummerow, C., Barnes, W., Kozu, T., Shiue, J., & Simpson, J. (1998). The
Tropical Rainfall Measuring Mission (TRMM) sensor package. Journal ofAtmospheric and Oceanic Technology, 15, 809817.
Marques, M. C. M., & Joly, C. A. (2000). Seed germination and growth ofCalophyllum brasiliensie (Clusiaceae), a typical species of flooded forests.Acta Botanica Brasilica, 14(1), 113120.
Myers, N., Mittermeier, R. A., Mittermeier, C. G., da Fonseca, G. A. B., & Kent,J. (2000). Biodiversity hotspots for conservation priorities. Nature (London),403, 853858.
Myneni, R. B., Hoffman, S., Knyazikhin, Y., Privette, J. L., Glassy, J., Tian, Y.,et al. (2002). Global products of vegetation leaf area and fraction absorbedPAR from year one of MODIS data. Remote Sensing of Environment, 83,214231.
Nagendra, H. (2001). Using remote sensing to assess biodiversity. International
Journal of Remote Sensing, 22, 23772400.
2016 S. Saatchi et al. / Remote Sensing of Environment 112 (2008) 20002017
http://dx.doi.org/10.1029/2001JD000975http://dx.doi.org/10.1029/2001JD0009758/3/2019 Modelos de Distribucion y Divers Id Ad de Arboles en La Amazonia Usando Sensores Remotos
18/18
Nelson, B. W., Ferreira, C. A. C., da Silva, M. F., & Kawasaki, M. L. (1990).Endemism centres, refugia and botanical collection density in BrazilianAmazonia. Nature, 345, 714716.
Nix, H. A. (1986). A biogeographic analysis of Australian elapid snakes. In:Atlas of Australian ElapidSnakes (pp. 415).Canberra: Bureau Flora Fauna.
Peng, C. (2000). From static biogeographical model to dynamic globalvegetation model: A global perspective on modeling vegetation dynamics.
Ecological Modelling, 135, 3354.Phillips, S. (2005). A brief tutorial on Maxent. AT& T Research (from http://www.cs.princeton.edu/~schapire/maxent/tutorial/tutorial.doc ).
Phillips, S., Anderson, R. P., & Schapire, R. E. (2005). Maximum entropymodelling of species geographic distributions. Ecological Modelling, 190,231259.
Pimm, S. L. (1991). The balance of nature? Ecological issues in conservation ofspecies and communities. Chicago: The University of Chicago Press.
Prates-Clark, C. D. C., Saatchi, S. S., & Agosti, D. (2008). Predictinggeographical distribution models of high-value timber trees in the AmazonBasin using remotely sensed data. Ecological Modelling, 211(3-4),309323.
Rahbek, C., & Graves, G. R. (2001). Multiscale assessment of patterns of avianspecies richness. Proceedings of the National Academy of Sciences of theUnited States of America, 98, 4534 /4539.
Record, S. J., & Hess, R. W. (1943). Timbers of the new world. New Haven(USA): Yale Univ. Press.
Rodriguez, W. A. (1972). A ucuuba de vrzea e suas aplicaes. ActaAmazonica, 2(2), 2947.
Saatchi, S., Houghton, R., Avala, R., Yu, Y., & Soares, J. -V. (2007). Spatialdistribution of live aboveground biomass in Amazon Basin. Global Change
Biology, 13, 816837.Schongart, J., Piedade, M. T. F., Ludwigshausen, S., Horna, V., & Worbes, M.
(2002). Phenology and stem growth periodicity of tree species in Amazonianflood plain forests. Journal of Tropical Ecology, 18, 581597.
Smith, T. B., Schneider, C. J., & Holder, K. (2001). Refugial isolation versus
ecological gradients: Testing alternative mechanisms of evolutionary diver-gence in four rainforest vertebrates. Genetica (Dordrecht), 112113, 383398.Swets, J. A. (1988). Measuring the accuracy of diagnostic systems. Science,
240, 12851293.ter Steege, H., Pitman, N., Philips, O., Chave, J., Sabatier, D., Duque, A., et al.
(2006). Continental-scale patterns of canopy tree composition and functionacross Amazonia. Nature, 443(28), 444447.
ter Steege, H., Pitman, N., Sabatier, D., Castellanos, H., Van Der Hout, P., Daly,D. C., et al. (2003). A spatial model of tree alpha-diversity and tree densityfor the Amazon. Biodiversity and Conservation, 12, 22552277.
Tuomisto, H., Ruokolainen, K., Kalliola, R., Linna, A., Danjoy, W., & Rodriguez,Z. (1995). Dissection Amazonian biodiversity. Science, 269, 6366.
Turner, W., Spector, S., Gardiner, N., Fladeland, M., Sterling, E., & Steininger,M. (2003). Remote sensing for biodiversity science and conservation.Trends in Ecology and Evolution, 18, 306314.
Whittaker, R. H. (1972). Evolution and measurement of species diversity.Taxon, 21, 213251.
Woodward, F. I. (1987). Climate and plant distribution. Cambridge UniversityPress 190 pp.
2017S. Saatchi et al. / Remote Sensing of Environment 112 (2008) 20002017
http://www.cs.princeton.edu/~schapire/maxent/tutorial/tutorial.dochttp://www.cs.princeton.edu/~schapire/maxent/tutorial/tutorial.dochttp://www.cs.princeton.edu/~schapire/maxent/tutorial/tutorial.dochttp://www.cs.princeton.edu/~schapire/maxent/tutorial/tutorial.doc