If you have not already done so, you’ll need to start R and load the
spatstat
package by typing
library(spatstat)
We will study a dataset that records the locations of Ponderosa Pine
trees (Pinus ponderosa) in a study region in the Klamath
National Forest in northern California. The data are included with
spatstat
as the dataset ponderosa
.
assign the data to a shorter name, like X
or
P
;
plot the data;
find out how many trees are recorded;
find the dimensions of the study region;
obtain an estimate of the average intensity of trees (number of trees per unit area).
The Ponderosa data, continued:
When you type plot(ponderosa)
, the command that is
actually executed is plot.ppp
, the plot method for point
patterns. Read the help file for the function plot.ppp
, and
find out which argument to the function can be used to control the main
title for the plot;
plot the Ponderosa data with the title Ponderosa Pine Trees above it;
from your reading of the help file, predict what will happen if we type
plot(ponderosa, chars="X", cols="green")
then check that your guess was correct;
try different values of the argument chars
, for
example, one of the integers 0 to 25, or a letter of the alphabet. (Note
the difference between chars=3
and chars="+"
,
and the difference between chars=4
and
chars="X"
).
The dataset japanesepines
contains the locations of
Japanese Black Pine trees in a study region.
Plot the japanesepines
data.
What is the average intensity (the average number of points per unit area?
Using density.ppp
, compute a kernel estimate of the
spatially-varying intensity function for the Japanese pines data, using
a Gaussian kernel with standard deviation \(\sigma=0.1\) units, and store the estimated
intensity in an object D
say.
Plot a colour image of the kernel estimate
D
.
Most plotting commands will accept the argument
add=TRUE
and interpret it to mean that the plot should be
drawn over the existing display, without clearing the screen beforehand.
Use this to plot a colour image of the kernel estimate D
with the original Japanese Pines data superimposed.
Plot the kernel estimate without the ‘colour ribbon’.
Try the following command
persp(D, theta=70, phi=25, shade=0.4)
and find the documentation for the arguments theta
,
phi
and shade
.
Find the maximum and minimum values of the intensity estimate
over the study region. (Hint: Use summary
or
range
)
The kernel estimate of intensity is defined so that its integral
over the entire study region is equal to the number of points in the
data pattern, ignoring edge effects. Check whether this is approximately
true in this example. (Hint: use integral
)
The dataset hamster
is a multitype pattern representing
the locations of cells of two types, dividing and
pyknotic.
plot the pattern;
plot the pattern again, changing the colours and symbols used to represent the two types of cells;
plot the patterns of pyknotic and dividing cells separately using
plot(split(hamster))
.
use relrisk
to perform cross-validated bandwidth
selection and computation of the relative intensity of pyknotic
cells.
The bei
dataset gives the locations of trees in a survey
area with additional covariate information in a list
bei.extra
.
Assign the elevation covariate to a variable elev
by
typing
elev <- bei.extra$elev
Plot the trees on top of an image of the elevation covariate.
Assume that the intensity of trees is a function \(\lambda(u) = \rho(e(u))\) where \(e(u)\) is the terrain elevation at location u. Compute a nonparametric estimate of the function \(\rho\) and plot it by
rh <- rhohat(bei, elev)
plot(rh)
Compute the predicted intensity based on this estimate of \(\rho\).
Compute a non-parametric estimate of intensity by kernel smoothing, and compare with the predicted intensity above.
Bonus info: To plot the two intensity estimates next to each
other you collect the estimates as a spatial object list
(solist
) and plot the result (the estimates are called
pred
and ker
below):
l <- solist(pred, ker)
plot(l, equal.ribbon = TRUE, main = "",
main.panel = c("rhohat prediction", "kernel smoothing"))