WEBVTT Kind: captions Language: en-US 00:00:02.720 --> 00:00:05.040 Okay, good morning, everyone. Today I’m going to be talking about 00:00:05.040 --> 00:00:07.040 some of the methods we’ve developed to do earthquake 00:00:07.040 --> 00:00:10.320 forecasting during swarms. Before we begin, I just wanted to 00:00:10.320 --> 00:00:14.696 thank my collaborators – Andy, Nicholas, Morgan, Jeanne, and Sara, 00:00:14.720 --> 00:00:18.160 whom I’ve worked with on various aspects of swarms over the years. 00:00:18.160 --> 00:00:20.160 And a special shout-out goes to Sara, who is actually 00:00:20.160 --> 00:00:23.118 the one who came up with this pretty great title. 00:00:26.000 --> 00:00:29.460 So earthquake swarms are sequences that are thought to be driven primarily 00:00:29.460 --> 00:00:33.281 by some external transient aseismic process, such as fluid flow – 00:00:33.281 --> 00:00:38.240 either natural or anthropogenic, or slow slip. And while swarms in 00:00:38.240 --> 00:00:40.400 southern California, particularly in the Salton Trough, have been causing 00:00:40.400 --> 00:00:43.840 more of a fuss recently, swarms happen in the Bay Area, too, 00:00:43.840 --> 00:00:49.016 particularly in the East Bay, shown here in this map, 00:00:49.040 --> 00:00:53.496 near the cities of San Ramon, Danville, and Alamo. 00:00:53.520 --> 00:00:57.200 At least 11 swarms have happened here since 1970, which you can 00:00:57.200 --> 00:01:03.096 see color-coded on this map by – from a paper by Xue and others. 00:01:03.120 --> 00:01:07.200 And you can see some of their characteristics here in this table, 00:01:07.200 --> 00:01:10.400 also from their paper. So the largest earthquakes in these 00:01:10.400 --> 00:01:16.240 swarms tend to be in the mid-4s – about 4.2, 4.4. 00:01:16.240 --> 00:01:19.896 And the swarms typically last from weeks to months. 00:01:19.920 --> 00:01:23.760 These swarms seem to be occurring in an extensional step-over where slip 00:01:23.760 --> 00:01:28.000 is transferring from the northern Calaveras Fault to the Mount Diablo 00:01:28.000 --> 00:01:31.250 Fault and then further north to the Concord Fault. 00:01:33.360 --> 00:01:36.640 Swarms present unique challenges for earthquake forecasting. 00:01:36.640 --> 00:01:39.840 Unlike aftershocks, which can be modeled with empirical laws like 00:01:39.840 --> 00:01:43.440 Omori decay, swarms are typically modeled as time-varying background 00:01:43.440 --> 00:01:46.640 rates, particularly in the framework of the Epidemic-Type Aftershock 00:01:46.640 --> 00:01:50.320 Sequence, or ETAS, model, shown here, which is a commonly 00:01:50.320 --> 00:01:54.480 used earthquake forecasting model. So, in this model, the seismicity rate 00:01:54.480 --> 00:02:00.080 in a region at some time is equal to the sum of a background rate plus the – 00:02:00.080 --> 00:02:02.880 some of the aftershock rates that are triggered by all the earthquakes that 00:02:02.880 --> 00:02:10.560 occur in that region prior to time, t. So the problem is that the lack of 00:02:10.560 --> 00:02:13.920 direct knowledge or observation of the underlying driving process 00:02:13.920 --> 00:02:16.560 really makes it difficult to predict how long a swarm, 00:02:16.560 --> 00:02:19.974 or how long this high background rate is going to last. 00:02:20.960 --> 00:02:24.776 And earthquake forecasts are very sensitive to the assumed 00:02:24.800 --> 00:02:28.800 swarm duration. So this figure here is a comparison of two end member 00:02:28.800 --> 00:02:32.800 forecasts that were made during the 2016 Bombay Beach swarm in the 00:02:32.800 --> 00:02:36.880 Salton Trough in southern California. These forecasts were made by using 00:02:36.880 --> 00:02:40.320 the ETAS model to simulate a bunch of catalogs and then 00:02:40.320 --> 00:02:43.656 see how many earthquakes occur during this forecast interval. 00:02:43.680 --> 00:02:47.016 So these plots are showing cumulative number of earthquakes over time. 00:02:47.040 --> 00:02:52.456 And the forecast interval is shaded by the – by the gray. 00:02:52.480 --> 00:02:54.800 So the top panel shows the forecasted number of events, 00:02:54.800 --> 00:02:57.784 assuming that the high background rate of the swarm continues 00:02:57.784 --> 00:03:01.339 throughout the one-week forecast period. 00:03:03.120 --> 00:03:06.520 And the bottom shows a forecast using an end member model that assumes 00:03:06.520 --> 00:03:11.176 that the high background rate stops as soon as the forecast was made. 00:03:11.200 --> 00:03:14.160 And this ultimately leads to almost an order of magnitude difference 00:03:14.160 --> 00:03:18.160 in the number of earthquakes that each of the models forecast. 00:03:18.160 --> 00:03:20.960 So, in this talk, I’m going to go over some of the basic building blocks 00:03:20.960 --> 00:03:23.200 that we’ve developed and started to implement for earthquake 00:03:23.200 --> 00:03:27.336 forecasting during swarms. These include a model for 00:03:27.360 --> 00:03:32.240 swarm duration based on actuarial statistics, 00:03:32.240 --> 00:03:36.378 region-specific swarm models, and ensemble forecasts. 00:03:38.080 --> 00:03:44.720 And the – much of what I’m going to be covering in this talk come from 00:03:44.720 --> 00:03:47.280 two papers that we published in 2019 in BSSA. 00:03:47.280 --> 00:03:50.960 So if you’d like further details, I can refer you to those papers. 00:03:53.600 --> 00:03:56.480 So, to start off with, to build a simple swarm duration model, 00:03:56.480 --> 00:04:00.240 we took a page from actuarial statistics. And this is the kind of math that’s used 00:04:00.240 --> 00:04:04.000 to compute things like life expectancy tables for a given population 00:04:04.000 --> 00:04:07.576 by agencies like the Social Security Administration. 00:04:07.600 --> 00:04:10.880 Regions like the Salton Trough, which has a high level of swarm 00:04:10.880 --> 00:04:15.920 activity, we can consider the observed swarms there as a population 00:04:15.920 --> 00:04:19.896 and then put together what’s essentially a swarm life expectancy table. 00:04:19.920 --> 00:04:24.720 And so that’s what shown here. So, using a data set of about 20 00:04:24.720 --> 00:04:28.216 Salton Trough swarms that were identified by Chen and Shearer, 00:04:28.240 --> 00:04:33.976 we can calculate the number of swarms that last to a particular age or duration, 00:04:34.000 --> 00:04:36.560 and then, from there, compute the probability 00:04:36.560 --> 00:04:39.840 that a swarm that’s lasted that particular age ends in the next 00:04:39.840 --> 00:04:44.351 24 hours, and then the expected time remaining in the swarm. 00:04:47.120 --> 00:04:50.000 So, as it turns out, the swarm durations in the Salton Trough 00:04:50.000 --> 00:04:54.080 are exponentially distributed. This figure shows, in the black 00:04:54.080 --> 00:04:56.800 solid line, the probability that a swarm that has lasted to 00:04:56.800 --> 00:05:00.960 a particular duration has ended. And the gray solid line – 00:05:00.960 --> 00:05:04.296 the probability that it will end in the next 24 hours. 00:05:04.320 --> 00:05:08.560 And, again, these – so these solid lines are computed directly from 00:05:08.560 --> 00:05:12.560 the observed swarms in this table. And the dashed lines show the fit 00:05:12.560 --> 00:05:16.936 from a Poisson model – basically a Poisson model of termination, 00:05:16.960 --> 00:05:21.896 which applies an exponential model for a duration. 00:05:21.920 --> 00:05:24.720 The bottom plot here shows the average time remaining – 00:05:24.720 --> 00:05:29.840 again, observed in black. And then, once a correction 00:05:29.840 --> 00:05:32.902 has been applied to account for the relatively small sample size, 00:05:32.902 --> 00:05:35.840 that’s shown here by the dotted line. And you can see that it’s again 00:05:35.840 --> 00:05:39.659 well fit by an exponential duration or Poisson termination model. 00:05:41.680 --> 00:05:46.160 So, once we have a duration model, we can now go back to the 2016 00:05:46.160 --> 00:05:51.656 Bombay Beach forecasts and see how its implementation affects the forecasts. 00:05:51.680 --> 00:05:55.360 So we again make forecasts by using the ETAS model to simulate 00:05:55.360 --> 00:05:58.240 a large number of catalogs. But now, for each synthetic catalog, 00:05:58.240 --> 00:06:02.616 we can randomly draw a duration over which to apply a high background rate. 00:06:02.640 --> 00:06:05.760 So this figure shows the results from the different models. 00:06:05.760 --> 00:06:11.336 On the left are one-week forecasts. On the right are one-month forecasts. 00:06:11.360 --> 00:06:15.576 The top row are forecasts that are made using this exponential duration model. 00:06:15.600 --> 00:06:18.480 The middle row are forecasts that are made using the end member 00:06:18.480 --> 00:06:22.080 model where we assume that the swarm continues throughout 00:06:22.080 --> 00:06:25.040 the entire forecast interval. And then the bottom is where 00:06:25.040 --> 00:06:28.036 we assume that the swarm has ended. 00:06:28.060 --> 00:06:31.360 And on each panel, the black line shows the forecast number of 00:06:31.360 --> 00:06:38.136 earthquakes with the shading indicating 95% confidence interval. 00:06:38.160 --> 00:06:42.456 And the red is the observed number in each of the forecast windows. 00:06:42.480 --> 00:06:46.960 So you can see that the duration model does indeed improve the forecasts. 00:06:46.960 --> 00:06:50.240 And perhaps the improvement is only slight here in the case of 00:06:50.240 --> 00:06:53.576 the one-week forecasts. And this is likely because the 00:06:53.600 --> 00:06:58.376 average number of – average length of a swarm in the Salton Trough 00:06:58.400 --> 00:07:02.696 is about a week, which is on the same order as the forecast interval itself. 00:07:02.720 --> 00:07:07.520 But you can see that the improvement is much more significant for the 00:07:07.520 --> 00:07:12.160 one-month forecasts. So our results suggest that a duration model is 00:07:12.160 --> 00:07:17.016 definitely a useful improvement to implement for swarm forecasting. 00:07:17.040 --> 00:07:20.320 However, this approach still assumes that the background rate is constant 00:07:20.320 --> 00:07:24.080 throughout the entire swarm. But, because swarms are often driven 00:07:24.080 --> 00:07:28.000 by some process that varies over time, we might also expect that the 00:07:28.000 --> 00:07:31.976 background rates during a swarm would vary as well. 00:07:32.000 --> 00:07:34.960 So, to explore the best way to estimate and update the background rate 00:07:34.960 --> 00:07:37.760 for a swarm forecast, we turn to northern California, 00:07:37.760 --> 00:07:41.520 specifically the East Bay, which, as I’ve shown – showed in your 00:07:41.520 --> 00:07:45.494 introduction, has seen a number of swarms in the last few decades. 00:07:46.800 --> 00:07:49.256 So there again, shown here in this map, 00:07:49.280 --> 00:07:54.856 here’s a cumulative number of earthquakes over time for the region. 00:07:54.880 --> 00:07:57.360 So, unlike the Salton Trough, where the average duration of 00:07:57.360 --> 00:08:02.216 the swarms was less than a week, the average swarm here lasts 00:08:02.240 --> 00:08:06.320 well over a month, which you can also see in this table, 00:08:06.320 --> 00:08:09.184 again from the paper by Xue and others. 00:08:09.184 --> 00:08:11.920 A particularly significant swarm occurred in 2015 00:08:11.920 --> 00:08:18.160 near San Ramon over here. And it lasted over a month. 00:08:18.160 --> 00:08:23.736 Here’s the cumulative number over time and the magnitude over time. 00:08:23.760 --> 00:08:27.760 And it produced almost 100 magnitude 2 to 3-1/2 earthquakes. 00:08:27.760 --> 00:08:31.705 And so this is the swarm that we’ll be using as our test case. 00:08:32.880 --> 00:08:36.320 So, to determine what background rate produces the best forecasts during the 00:08:36.320 --> 00:08:40.000 swarm, we ran a suite of different forecast models, where we estimated 00:08:40.000 --> 00:08:44.320 the background rates from either all the previous seismicity in the 00:08:44.320 --> 00:08:49.440 San Ramon area, only the previous swarms, or only the current swarm – 00:08:49.440 --> 00:08:54.000 the 2015 swarm using different look-back windows of 2, 5, or 10 days 00:08:54.000 --> 00:08:57.235 over which to estimate the background rate. 00:08:58.800 --> 00:09:02.000 So this figure here shows how the different – the different 00:09:02.000 --> 00:09:06.880 background rates compare. In the first two models, the Fixed/All – 00:09:06.880 --> 00:09:12.375 all previous seismicity model and the only previous swarms model, 00:09:12.400 --> 00:09:14.880 you can see the background rates are fixed throughout the 00:09:14.880 --> 00:09:17.256 entire swarm at the same rate. 00:09:17.280 --> 00:09:18.960 For the other models, the background rate is 00:09:18.960 --> 00:09:22.997 updated at the start of each forecast window. 00:09:23.840 --> 00:09:26.080 So, at the start of each three-day forecast window, 00:09:26.080 --> 00:09:31.600 we simulate 100,000 catalogs using the different models to forecast the number 00:09:31.600 --> 00:09:36.616 of earthquakes that are likely to occur over the next three days. 00:09:36.640 --> 00:09:39.120 And here are the results. So this plot shows the number of 00:09:39.120 --> 00:09:43.920 earthquakes in each three-day time window during the course 00:09:43.920 --> 00:09:46.880 of the swarm, predicted by the different models, which are shown 00:09:46.880 --> 00:09:52.776 by the different symbols. The observed number is shown by the squares. 00:09:52.800 --> 00:09:56.560 And the legend also contains the log-likelihood score of each of the 00:09:56.560 --> 00:10:00.555 different models, with the higher scores being the better models. 00:10:02.480 --> 00:10:05.040 So, of the two fixed models – the Fixed/All model and the 00:10:05.040 --> 00:10:08.800 Fixed/Swarm model, the Fixed/All model almost always 00:10:08.800 --> 00:10:12.320 underestimates the rates because it has the lowest background rate. 00:10:12.320 --> 00:10:15.096 So you can see that by the green triangles here. 00:10:15.120 --> 00:10:18.696 And the Fixed/Swarm model, with its higher background rate, 00:10:18.720 --> 00:10:22.640 does a better job, but it tends to start over-predicting towards the 00:10:22.640 --> 00:10:27.602 end of the swarm here because it doesn’t know that the swarm is ending. 00:10:29.600 --> 00:10:35.576 The updated models all generally perform better than the fixed models, 00:10:35.600 --> 00:10:38.480 but, as we look – as the swarm progresses, you can see that 00:10:38.480 --> 00:10:42.160 no single model clearly provides the best forecast throughout 00:10:42.160 --> 00:10:46.240 the entire swarm. So how do we decide which model to use, 00:10:46.240 --> 00:10:49.600 especially at the beginning of the swarm, before we know – 00:10:49.600 --> 00:10:51.999 we have very much information? 00:10:53.200 --> 00:10:55.600 To avoid having to make that arbitrary choice, we turn to 00:10:55.600 --> 00:10:58.400 ensemble earthquake forecast methods – in particular, 00:10:58.400 --> 00:11:01.496 those developed by Marzocchi and others in 2012. 00:11:01.520 --> 00:11:04.560 Ensembles are made by weighting and combining individual models 00:11:04.560 --> 00:11:07.576 according to their past performance. 00:11:07.600 --> 00:11:10.856 We tested two weighting schemes – a Bayesian model averaging scheme, 00:11:10.880 --> 00:11:14.240 where the weight assigned to a model is equal to the exponential of its 00:11:14.240 --> 00:11:18.160 likelihood, and then a score model averaging scheme, where the weight 00:11:18.160 --> 00:11:22.320 is the inverse of its likelihood. So these figures here show the relative 00:11:22.320 --> 00:11:26.720 weighting of each of the models get – each of the individual models get 00:11:26.720 --> 00:11:30.456 within the ensemble as the swarm progresses. 00:11:30.480 --> 00:11:35.096 So, for BMA weighting, you can see that, because of the exponent here, 00:11:35.120 --> 00:11:38.160 BMA is a strong weighting scheme, so the best-fitting model – 00:11:38.160 --> 00:11:40.880 the best-fitting individual model tends to dominant the ensemble, 00:11:40.880 --> 00:11:43.200 which has definitely happened here. 00:11:43.200 --> 00:11:48.562 In this case, the best-fitting model is the LB2 – the Look-Back 2 model. 00:11:50.640 --> 00:11:56.397 The individual models get more representation in the SMA ensemble. 00:11:57.600 --> 00:12:00.800 So here are the resulting forecasts using the ensembles. 00:12:00.800 --> 00:12:04.400 This is the same plot that I showed earlier with the observed and forecast 00:12:04.400 --> 00:12:07.200 number of earthquakes during the swarm for the different models. 00:12:07.200 --> 00:12:12.936 And now I’ve added the BMA forecast in gold and the SMA forecast in blue. 00:12:12.960 --> 00:12:15.760 And here on the right, I’m showing the cumulative likelihood of 00:12:15.760 --> 00:12:19.760 each of the models over time throughout – so you can see 00:12:19.760 --> 00:12:23.440 how it evolves during the swarm. And you can see that the two ensemble 00:12:23.440 --> 00:12:29.096 models have the highest likelihood throughout the entire swarm. 00:12:29.120 --> 00:12:32.880 Both perform about equally well within uncertainty. 00:12:32.880 --> 00:12:35.120 We just have a preference for the SMA ensemble 00:12:35.120 --> 00:12:38.876 because it’s less dominated by a single model. 00:12:43.760 --> 00:12:47.760 So we also did the same analysis for some of the older East Bay swarms, 00:12:47.760 --> 00:12:50.720 and the results are shown here. And these include the 1976 Danville 00:12:50.720 --> 00:12:56.444 swarm, the 1990 Alamo swarm, and then also the 2018 Danville swarm. 00:12:58.000 --> 00:13:02.240 In general, the Fixed/Swarm model performs worse for the earlier swarms 00:13:02.240 --> 00:13:06.616 because there are fewer data to constrain the swarm parameters. 00:13:06.640 --> 00:13:11.360 But the ensemble forecasts still perform best overall, which you can see 00:13:11.360 --> 00:13:15.176 in the cumulative likelihood plots here on the right. 00:13:15.200 --> 00:13:17.760 So this suggests that this method can work, even in regions 00:13:17.760 --> 00:13:22.056 that don’t have a very extensive swarm history. 00:13:22.080 --> 00:13:25.920 So, all in all, our results show that ensemble forecasts can 00:13:25.920 --> 00:13:30.893 be a valuable tool in developing swarm earthquake forecasting. 00:13:32.320 --> 00:13:35.760 So, to summarize, earthquake swarms present challenges to current forecast 00:13:35.760 --> 00:13:39.576 models, which are really geared more towards aftershock sequences. 00:13:39.600 --> 00:13:41.920 But we’ve developed some building blocks for swarm earthquake 00:13:41.920 --> 00:13:45.680 forecasting, and these include a swarm duration model, 00:13:45.680 --> 00:13:49.760 which is actually now being implemented in USGS swarm forecasts. 00:13:49.760 --> 00:13:54.160 So a couple of screenshot examples are shown below from the forecasts we put 00:13:54.160 --> 00:13:59.736 out for the Salton Sea and Westmorland swarms from last summer and fall. 00:13:59.760 --> 00:14:03.040 We’ve also shown the usefulness of having a regional swarm model 00:14:03.040 --> 00:14:06.160 tuned to prior swarm seismicity. And we’ve shown that including 00:14:06.160 --> 00:14:08.320 time-varying background rates and ensemble methods 00:14:08.320 --> 00:14:11.496 can also be used to improve forecasts during swarms. 00:14:11.520 --> 00:14:15.756 Nevertheless, there’s still a lot more work to be done. 00:14:16.800 --> 00:14:20.160 So, for my last slide, I just wanted to end with a few of what I see 00:14:20.160 --> 00:14:22.160 are some of the next steps towards a comprehensive 00:14:22.160 --> 00:14:25.096 swarm operational earthquake forecast model. 00:14:25.120 --> 00:14:28.640 These include folding the actuarial duration model into the ensemble 00:14:28.640 --> 00:14:32.729 approach and also investigating how widely applicable the exponential 00:14:32.729 --> 00:14:38.288 swarm duration model is across different regional and tectonic settings. 00:14:39.360 --> 00:14:42.400 We also need to figure out how to detect when a swarm is happening 00:14:42.400 --> 00:14:44.800 and when a swarm forecast model would be more appropriate 00:14:44.800 --> 00:14:48.685 to use than an aftershock model, or vice versa. 00:14:49.600 --> 00:14:52.400 We also need to consider how might messaging need to differ 00:14:52.400 --> 00:14:56.536 for swarm-specific forecasts versus aftershock forecasts. 00:14:56.560 --> 00:14:59.200 And finally, how might a better understanding of the physics of 00:14:59.200 --> 00:15:02.000 swarms, which our next speaker, Alicia Hotovec-Ellis, is going to 00:15:02.000 --> 00:15:06.560 discuss, and how might external information about the physical 00:15:06.560 --> 00:15:09.840 driving process, such as fluid flow, strain, or GPS, 00:15:09.840 --> 00:15:12.936 be used to improve our forecasts in the future. 00:15:12.960 --> 00:15:16.560 So, with that, I’ll end there, and thank you for your attention.