WEBVTT
Kind: captions
Language: en-US

00:00:02.720 --> 00:00:05.040
Okay, good morning, everyone.
Today I’m going to be talking about

00:00:05.040 --> 00:00:07.040
some of the methods we’ve
developed to do earthquake

00:00:07.040 --> 00:00:10.320
forecasting during swarms.
Before we begin, I just wanted to

00:00:10.320 --> 00:00:14.696
thank my collaborators – Andy,
Nicholas, Morgan, Jeanne, and Sara,

00:00:14.720 --> 00:00:18.160
whom I’ve worked with on various
aspects of swarms over the years.

00:00:18.160 --> 00:00:20.160
And a special shout-out
goes to Sara, who is actually

00:00:20.160 --> 00:00:23.118
the one who came up
with this pretty great title.

00:00:26.000 --> 00:00:29.460
So earthquake swarms are sequences
that are thought to be driven primarily

00:00:29.460 --> 00:00:33.281
by some external transient aseismic 
process, such as fluid flow –

00:00:33.281 --> 00:00:38.240
either natural or anthropogenic, 
or slow slip. And while swarms in

00:00:38.240 --> 00:00:40.400
southern California, particularly in the
Salton Trough, have been causing

00:00:40.400 --> 00:00:43.840
more of a fuss recently, swarms
happen in the Bay Area, too,

00:00:43.840 --> 00:00:49.016
particularly in the East Bay,
shown here in this map,

00:00:49.040 --> 00:00:53.496
near the cities of San Ramon,
Danville, and Alamo.

00:00:53.520 --> 00:00:57.200
At least 11 swarms have happened
here since 1970, which you can

00:00:57.200 --> 00:01:03.096
see color-coded on this map by –
from a paper by Xue and others.

00:01:03.120 --> 00:01:07.200
And you can see some of their
characteristics here in this table,

00:01:07.200 --> 00:01:10.400
also from their paper.
So the largest earthquakes in these

00:01:10.400 --> 00:01:16.240
swarms tend to be in
the mid-4s – about 4.2, 4.4.

00:01:16.240 --> 00:01:19.896
And the swarms typically last
from weeks to months.

00:01:19.920 --> 00:01:23.760
These swarms seem to be occurring
in an extensional step-over where slip

00:01:23.760 --> 00:01:28.000
is transferring from the northern
Calaveras Fault to the Mount Diablo

00:01:28.000 --> 00:01:31.250
Fault and then further north
to the Concord Fault.

00:01:33.360 --> 00:01:36.640
Swarms present unique challenges
for earthquake forecasting.

00:01:36.640 --> 00:01:39.840
Unlike aftershocks, which can be
modeled with empirical laws like

00:01:39.840 --> 00:01:43.440
Omori decay, swarms are typically
modeled as time-varying background

00:01:43.440 --> 00:01:46.640
rates, particularly in the framework
of the Epidemic-Type Aftershock

00:01:46.640 --> 00:01:50.320
Sequence, or ETAS, model,
shown here, which is a commonly

00:01:50.320 --> 00:01:54.480
used earthquake forecasting model.
So, in this model, the seismicity rate

00:01:54.480 --> 00:02:00.080
in a region at some time is equal to
the sum of a background rate plus the –

00:02:00.080 --> 00:02:02.880
some of the aftershock rates that are
triggered by all the earthquakes that

00:02:02.880 --> 00:02:10.560
occur in that region prior to time, t.
So the problem is that the lack of

00:02:10.560 --> 00:02:13.920
direct knowledge or observation
of the underlying driving process

00:02:13.920 --> 00:02:16.560
really makes it difficult to
predict how long a swarm,

00:02:16.560 --> 00:02:19.974
or how long this high 
background rate is going to last.

00:02:20.960 --> 00:02:24.776
And earthquake forecasts are
very sensitive to the assumed

00:02:24.800 --> 00:02:28.800
swarm duration. So this figure here
is a comparison of two end member

00:02:28.800 --> 00:02:32.800
forecasts that were made during the
2016 Bombay Beach swarm in the

00:02:32.800 --> 00:02:36.880
Salton Trough in southern California.
These forecasts were made by using

00:02:36.880 --> 00:02:40.320
the ETAS model to simulate
a bunch of catalogs and then

00:02:40.320 --> 00:02:43.656
see how many earthquakes occur
during this forecast interval.

00:02:43.680 --> 00:02:47.016
So these plots are showing cumulative
number of earthquakes over time.

00:02:47.040 --> 00:02:52.456
And the forecast interval is
shaded by the – by the gray.

00:02:52.480 --> 00:02:54.800
So the top panel shows the
forecasted number of events,

00:02:54.800 --> 00:02:57.784
assuming that the high background
rate of the swarm continues

00:02:57.784 --> 00:03:01.339
throughout the
one-week forecast period.

00:03:03.120 --> 00:03:06.520
And the bottom shows a forecast using
an end member model that assumes

00:03:06.520 --> 00:03:11.176
that the high background rate stops
as soon as the forecast was made.

00:03:11.200 --> 00:03:14.160
And this ultimately leads to almost
an order of magnitude difference

00:03:14.160 --> 00:03:18.160
in the number of earthquakes
that each of the models forecast.

00:03:18.160 --> 00:03:20.960
So, in this talk, I’m going to go
over some of the basic building blocks

00:03:20.960 --> 00:03:23.200
that we’ve developed and started
to implement for earthquake

00:03:23.200 --> 00:03:27.336
forecasting during swarms.
These include a model for

00:03:27.360 --> 00:03:32.240
swarm duration based on
actuarial statistics,

00:03:32.240 --> 00:03:36.378
region-specific swarm models,
and ensemble forecasts.

00:03:38.080 --> 00:03:44.720
And the – much of what I’m going
to be covering in this talk come from

00:03:44.720 --> 00:03:47.280
two papers that we published
in 2019 in BSSA.

00:03:47.280 --> 00:03:50.960
So if you’d like further details,
I can refer you to those papers.

00:03:53.600 --> 00:03:56.480
So, to start off with, to build
a simple swarm duration model,

00:03:56.480 --> 00:04:00.240
we took a page from actuarial statistics.
And this is the kind of math that’s used

00:04:00.240 --> 00:04:04.000
to compute things like life expectancy
tables for a given population

00:04:04.000 --> 00:04:07.576
by agencies like the Social
Security Administration.

00:04:07.600 --> 00:04:10.880
Regions like the Salton Trough,
which has a high level of swarm

00:04:10.880 --> 00:04:15.920
activity, we can consider the observed
swarms there as a population

00:04:15.920 --> 00:04:19.896
and then put together what’s essentially
a swarm life expectancy table.

00:04:19.920 --> 00:04:24.720
And so that’s what shown here.
So, using a data set of about 20

00:04:24.720 --> 00:04:28.216
Salton Trough swarms that were
identified by Chen and Shearer,

00:04:28.240 --> 00:04:33.976
we can calculate the number of swarms
that last to a particular age or duration,

00:04:34.000 --> 00:04:36.560
and then, from there,
compute the probability

00:04:36.560 --> 00:04:39.840
that a swarm that’s lasted that
particular age ends in the next

00:04:39.840 --> 00:04:44.351
24 hours, and then the expected
time remaining in the swarm.

00:04:47.120 --> 00:04:50.000
So, as it turns out, the swarm
durations in the Salton Trough

00:04:50.000 --> 00:04:54.080
are exponentially distributed.
This figure shows, in the black

00:04:54.080 --> 00:04:56.800
solid line, the probability that
a swarm that has lasted to

00:04:56.800 --> 00:05:00.960
a particular duration has ended.
And the gray solid line –

00:05:00.960 --> 00:05:04.296
the probability that it will
end in the next 24 hours.

00:05:04.320 --> 00:05:08.560
And, again, these – so these solid
lines are computed directly from

00:05:08.560 --> 00:05:12.560
the observed swarms in this table.
And the dashed lines show the fit

00:05:12.560 --> 00:05:16.936
from a Poisson model – basically
a Poisson model of termination,

00:05:16.960 --> 00:05:21.896
which applies an exponential
model for a duration.

00:05:21.920 --> 00:05:24.720
The bottom plot here shows
the average time remaining –

00:05:24.720 --> 00:05:29.840
again, observed in black.
And then, once a correction

00:05:29.840 --> 00:05:32.902
has been applied to account for
the  relatively small sample size,

00:05:32.902 --> 00:05:35.840
that’s shown here by the dotted line.
And you can see that it’s again

00:05:35.840 --> 00:05:39.659
well fit by an exponential duration
or Poisson termination model.

00:05:41.680 --> 00:05:46.160
So, once we have a duration model,
we can now go back to the 2016

00:05:46.160 --> 00:05:51.656
Bombay Beach forecasts and see how
its implementation affects the forecasts.

00:05:51.680 --> 00:05:55.360
So we again make forecasts by
using the ETAS model to simulate

00:05:55.360 --> 00:05:58.240
a large number of catalogs.
But now, for each synthetic catalog,

00:05:58.240 --> 00:06:02.616
we can randomly draw a duration over
which to apply a high background rate.

00:06:02.640 --> 00:06:05.760
So this figure shows the results
from the different models.

00:06:05.760 --> 00:06:11.336
On the left are one-week forecasts.
On the right are one-month forecasts.

00:06:11.360 --> 00:06:15.576
The top row are forecasts that are made
using this exponential duration model.

00:06:15.600 --> 00:06:18.480
The middle row are forecasts that
are made using the end member

00:06:18.480 --> 00:06:22.080
model where we assume that the
swarm continues throughout

00:06:22.080 --> 00:06:25.040
the entire forecast interval.
And then the bottom is where

00:06:25.040 --> 00:06:28.036
we assume that the
swarm has ended.

00:06:28.060 --> 00:06:31.360
And on each panel, the black line
shows the forecast number of

00:06:31.360 --> 00:06:38.136
earthquakes with the shading
indicating 95% confidence interval.

00:06:38.160 --> 00:06:42.456
And the red is the observed number
in each of the forecast windows.

00:06:42.480 --> 00:06:46.960
So you can see that the duration model
does indeed improve the forecasts.

00:06:46.960 --> 00:06:50.240
And perhaps the improvement
is only slight here in the case of

00:06:50.240 --> 00:06:53.576
the one-week forecasts.
And this is likely because the

00:06:53.600 --> 00:06:58.376
average number of – average length
of a swarm in the Salton Trough

00:06:58.400 --> 00:07:02.696
is about a week, which is on the same
order as the forecast interval itself.

00:07:02.720 --> 00:07:07.520
But you can see that the improvement
is much more significant for the

00:07:07.520 --> 00:07:12.160
one-month forecasts. So our results
suggest that a duration model is

00:07:12.160 --> 00:07:17.016
definitely a useful improvement to
implement for swarm forecasting.

00:07:17.040 --> 00:07:20.320
However, this approach still assumes
that the background rate is constant

00:07:20.320 --> 00:07:24.080
throughout the entire swarm.
But, because swarms are often driven

00:07:24.080 --> 00:07:28.000
by some process that varies over time,
we might also expect that the

00:07:28.000 --> 00:07:31.976
background rates during
a swarm would vary as well.

00:07:32.000 --> 00:07:34.960
So, to explore the best way to estimate
and update the background rate

00:07:34.960 --> 00:07:37.760
for a swarm forecast,
we turn to northern California,

00:07:37.760 --> 00:07:41.520
specifically the East Bay, which,
as I’ve shown – showed in your

00:07:41.520 --> 00:07:45.494
introduction, has seen a number of
swarms in the last few decades.

00:07:46.800 --> 00:07:49.256
So there again,
shown here in this map,

00:07:49.280 --> 00:07:54.856
here’s a cumulative number of
earthquakes over time for the region.

00:07:54.880 --> 00:07:57.360
So, unlike the Salton Trough,
where the average duration of

00:07:57.360 --> 00:08:02.216
the swarms was less than a week,
the average swarm here lasts

00:08:02.240 --> 00:08:06.320
well over a month, which you
can also see in this table,

00:08:06.320 --> 00:08:09.184
again from the paper
by Xue and others.

00:08:09.184 --> 00:08:11.920
A particularly significant
swarm occurred in 2015

00:08:11.920 --> 00:08:18.160
near San Ramon over here.
And it lasted over a month.

00:08:18.160 --> 00:08:23.736
Here’s the cumulative number over
time and the magnitude over time.

00:08:23.760 --> 00:08:27.760
And it produced almost
100 magnitude 2 to 3-1/2 earthquakes.

00:08:27.760 --> 00:08:31.705
And so this is the swarm that
we’ll be using as our test case.

00:08:32.880 --> 00:08:36.320
So, to determine what background rate
produces the best forecasts during the

00:08:36.320 --> 00:08:40.000
swarm, we ran a suite of different
forecast models, where we estimated

00:08:40.000 --> 00:08:44.320
the background rates from either
all the previous seismicity in the

00:08:44.320 --> 00:08:49.440
San Ramon area, only the previous
swarms, or only the current swarm –

00:08:49.440 --> 00:08:54.000
the 2015 swarm using different
look-back windows of 2, 5, or 10 days

00:08:54.000 --> 00:08:57.235
over which to estimate
the background rate.

00:08:58.800 --> 00:09:02.000
So this figure here shows how
the different – the different

00:09:02.000 --> 00:09:06.880
background rates compare.
In the first two models, the Fixed/All –

00:09:06.880 --> 00:09:12.375
all previous seismicity model
and the only previous swarms model,

00:09:12.400 --> 00:09:14.880
you can see the background rates
are fixed throughout the

00:09:14.880 --> 00:09:17.256
entire swarm
at the same rate.

00:09:17.280 --> 00:09:18.960
For the other models,
the background rate is

00:09:18.960 --> 00:09:22.997
updated at the start
of each forecast window.

00:09:23.840 --> 00:09:26.080
So, at the start of each
three-day forecast window,

00:09:26.080 --> 00:09:31.600
we simulate 100,000 catalogs using the
different models to forecast the number

00:09:31.600 --> 00:09:36.616
of earthquakes that are likely to
occur over the next three days.

00:09:36.640 --> 00:09:39.120
And here are the results.
So this plot shows the number of

00:09:39.120 --> 00:09:43.920
earthquakes in each three-day
time window during the course

00:09:43.920 --> 00:09:46.880
of the swarm, predicted by the
different models, which are shown

00:09:46.880 --> 00:09:52.776
by the different symbols. The observed
number is shown by the squares.

00:09:52.800 --> 00:09:56.560
And the legend also contains the
log-likelihood score of each of the

00:09:56.560 --> 00:10:00.555
different models, with the higher
scores being the better models.

00:10:02.480 --> 00:10:05.040
So, of the two fixed models –
the Fixed/All model and the

00:10:05.040 --> 00:10:08.800
Fixed/Swarm model,
the Fixed/All model almost always

00:10:08.800 --> 00:10:12.320
underestimates the rates because
it has the lowest background rate.

00:10:12.320 --> 00:10:15.096
So you can see that
by the green triangles here.

00:10:15.120 --> 00:10:18.696
And the Fixed/Swarm model,
with its higher background rate,

00:10:18.720 --> 00:10:22.640
does a better job, but it tends to
start over-predicting towards the

00:10:22.640 --> 00:10:27.602
end of the swarm here because it
doesn’t know that the swarm is ending.

00:10:29.600 --> 00:10:35.576
The updated models all generally
perform better than the fixed models,

00:10:35.600 --> 00:10:38.480
but, as we look – as the swarm
progresses, you can see that

00:10:38.480 --> 00:10:42.160
no single model clearly provides
the best forecast throughout

00:10:42.160 --> 00:10:46.240
the entire swarm. So how do we
decide which model to use,

00:10:46.240 --> 00:10:49.600
especially at the beginning
of the swarm, before we know –

00:10:49.600 --> 00:10:51.999
we have very
much information?

00:10:53.200 --> 00:10:55.600
To avoid having to make that
arbitrary choice, we turn to

00:10:55.600 --> 00:10:58.400
ensemble earthquake forecast
methods – in particular,

00:10:58.400 --> 00:11:01.496
those developed by Marzocchi
and others in 2012.

00:11:01.520 --> 00:11:04.560
Ensembles are made by weighting
and combining individual models

00:11:04.560 --> 00:11:07.576
according to their
past performance.

00:11:07.600 --> 00:11:10.856
We tested two weighting schemes –
a Bayesian model averaging scheme,

00:11:10.880 --> 00:11:14.240
where the weight assigned to a model
is equal to the exponential of its

00:11:14.240 --> 00:11:18.160
likelihood, and then a score model
averaging scheme, where the weight

00:11:18.160 --> 00:11:22.320
is the inverse of its likelihood.
So these figures here show the relative

00:11:22.320 --> 00:11:26.720
weighting of each of the models get –
each of the individual models get

00:11:26.720 --> 00:11:30.456
within the ensemble
as the swarm progresses.

00:11:30.480 --> 00:11:35.096
So, for BMA weighting, you can see
that, because of the exponent here,

00:11:35.120 --> 00:11:38.160
BMA is a strong weighting scheme,
so the best-fitting model –

00:11:38.160 --> 00:11:40.880
the best-fitting individual model
tends to dominant the ensemble,

00:11:40.880 --> 00:11:43.200
which has definitely
happened here.

00:11:43.200 --> 00:11:48.562
In this case, the best-fitting model is
the LB2 – the Look-Back 2 model.

00:11:50.640 --> 00:11:56.397
The individual models get more
representation in the SMA ensemble.

00:11:57.600 --> 00:12:00.800
So here are the resulting
forecasts using the ensembles.

00:12:00.800 --> 00:12:04.400
This is the same plot that I showed
earlier with the observed and forecast

00:12:04.400 --> 00:12:07.200
number of earthquakes during the
swarm for the different models.

00:12:07.200 --> 00:12:12.936
And now I’ve added the BMA forecast
in gold and the SMA forecast in blue.

00:12:12.960 --> 00:12:15.760
And here on the right, I’m showing
the cumulative likelihood of

00:12:15.760 --> 00:12:19.760
each of the models over time
throughout – so you can see

00:12:19.760 --> 00:12:23.440
how it evolves during the swarm.
And you can see that the two ensemble

00:12:23.440 --> 00:12:29.096
models have the highest likelihood
throughout the entire swarm.

00:12:29.120 --> 00:12:32.880
Both perform about equally
well within uncertainty.

00:12:32.880 --> 00:12:35.120
We just have a preference
for the SMA ensemble

00:12:35.120 --> 00:12:38.876
because it’s less dominated
by a single model.

00:12:43.760 --> 00:12:47.760
So we also did the same analysis for
some of the older East Bay swarms,

00:12:47.760 --> 00:12:50.720
and the results are shown here.
And these include the 1976 Danville

00:12:50.720 --> 00:12:56.444
swarm, the 1990 Alamo swarm, and
then also the 2018 Danville swarm.

00:12:58.000 --> 00:13:02.240
In general, the Fixed/Swarm model
performs worse for the earlier swarms

00:13:02.240 --> 00:13:06.616
because there are fewer data to
constrain the swarm parameters.

00:13:06.640 --> 00:13:11.360
But the ensemble forecasts still perform
best overall, which you can see

00:13:11.360 --> 00:13:15.176
in the cumulative likelihood
plots here on the right.

00:13:15.200 --> 00:13:17.760
So this suggests that this method
can work, even in regions

00:13:17.760 --> 00:13:22.056
that don’t have a very
extensive swarm history.

00:13:22.080 --> 00:13:25.920
So, all in all, our results show
that ensemble forecasts can

00:13:25.920 --> 00:13:30.893
be a valuable tool in developing
swarm earthquake forecasting.

00:13:32.320 --> 00:13:35.760
So, to summarize, earthquake swarms
present challenges to current forecast

00:13:35.760 --> 00:13:39.576
models, which are really geared
more towards aftershock sequences.

00:13:39.600 --> 00:13:41.920
But we’ve developed some building
blocks for swarm earthquake

00:13:41.920 --> 00:13:45.680
forecasting, and these include
a swarm duration model,

00:13:45.680 --> 00:13:49.760
which is actually now being
implemented in USGS swarm forecasts.

00:13:49.760 --> 00:13:54.160
So a couple of screenshot examples are
shown below from the forecasts we put

00:13:54.160 --> 00:13:59.736
out for the Salton Sea and Westmorland
swarms from last summer and fall.

00:13:59.760 --> 00:14:03.040
We’ve also shown the usefulness
of having a regional swarm model

00:14:03.040 --> 00:14:06.160
tuned to prior swarm seismicity.
And we’ve shown that including

00:14:06.160 --> 00:14:08.320
time-varying background rates
and ensemble methods

00:14:08.320 --> 00:14:11.496
can also be used to improve
forecasts during swarms.

00:14:11.520 --> 00:14:15.756
Nevertheless, there’s still
a lot more work to be done.

00:14:16.800 --> 00:14:20.160
So, for my last slide, I just wanted
to end with a few of what I see

00:14:20.160 --> 00:14:22.160
are some of the next steps
towards a comprehensive

00:14:22.160 --> 00:14:25.096
swarm operational
earthquake forecast model.

00:14:25.120 --> 00:14:28.640
These include folding the actuarial
duration model into the ensemble

00:14:28.640 --> 00:14:32.729
approach and also investigating how
widely applicable the exponential

00:14:32.729 --> 00:14:38.288
swarm duration model is across
different regional and tectonic settings.

00:14:39.360 --> 00:14:42.400
We also need to figure out how to
detect when a swarm is happening

00:14:42.400 --> 00:14:44.800
and when a swarm forecast
model would be more appropriate

00:14:44.800 --> 00:14:48.685
to use than an aftershock
model, or vice versa.

00:14:49.600 --> 00:14:52.400
We also need to consider how
might messaging need to differ

00:14:52.400 --> 00:14:56.536
for swarm-specific forecasts
versus aftershock forecasts.

00:14:56.560 --> 00:14:59.200
And finally, how might a better
understanding of the physics of

00:14:59.200 --> 00:15:02.000
swarms, which our next speaker,
Alicia Hotovec-Ellis, is going to

00:15:02.000 --> 00:15:06.560
discuss, and how might external
information about the physical

00:15:06.560 --> 00:15:09.840
driving process, such as
fluid flow, strain, or GPS,

00:15:09.840 --> 00:15:12.936
be used to improve our
forecasts in the future.

00:15:12.960 --> 00:15:16.560
So, with that, I’ll end there,
and thank you for your attention.