--- title: "Fitting Custom Meta-Analytic Ensembles" author: "František Bartoš" date: "`r Sys.Date()`" output: rmarkdown::html_vignette: self_contained: yes bibliography: ../inst/REFERENCES.bib csl: ../inst/apa.csl vignette: > %\VignetteIndexEntry{Fitting Custom Meta-Analytic Ensembles} %\VignetteEngine{knitr::rmarkdown} %\VignetteEncoding{UTF-8} %\VignetteEngine{knitr::rmarkdown_notangle} --- ```{r setup, include = FALSE} is_check <- ("CheckExEnv" %in% search()) || any(c("_R_CHECK_TIMINGS_", "_R_CHECK_LICENSE_") %in% names(Sys.getenv())) || !file.exists("../models/CustomEnsembles/Bem_update1.RDS") knitr::opts_chunk$set( collapse = TRUE, comment = "#>", eval = !is_check, dev = "png" ) if(.Platform$OS.type == "windows"){ knitr::opts_chunk$set(dev.args = list(type = "cairo")) } ``` ```{r include = FALSE, eval = FALSE} # R package version updating library(RoBMA) data("Bem2011", package = "RoBMA") fit <- RoBMA(d = Bem2011$d, se = Bem2011$se, study_names = Bem2011$study, priors_effect = NULL, priors_heterogeneity = NULL, priors_bias = NULL, priors_effect_null = prior("spike", parameters = list(location = 0)), priors_heterogeneity_null = prior("spike", parameters = list(location = 0)), priors_bias_null = prior_none(), seed = 1) fit <- update(fit, prior_effect = prior("normal", parameters = list(mean = .15, sd = .10), truncation = list(lower = 0)), prior_heterogeneity_null = prior("spike", parameters = list(location = 0)), prior_bias_null = prior_none()) saveRDS(fit, file = "../models/CustomEnsembles/Bem_update1.RDS") fit <- update(fit, prior_effect = prior("normal", parameters = list(mean = .15, sd = .10), truncation = list(lower = 0)), prior_heterogeneity = prior("invgamma", parameters = list(shape = 1, scale = .15)), prior_bias_null = prior_none()) fit <- update(fit, prior_effect_null = prior("spike", parameters = list(location = 0)), prior_heterogeneity_null = prior("spike", parameters = list(location = 0)), prior_bias = prior_weightfunction("one.sided", parameters = list(alpha = c(1, 1), steps = c(0.05)))) fit <- update(fit, prior_effect_null = prior("spike", parameters = list(location = 0)), prior_heterogeneity_null = prior("spike", parameters = list(location = 0)), prior_bias = prior_weightfunction("one.sided", parameters = list(alpha = c(1, 1, 1), steps = c(0.05, 0.10)))) fit <- update(fit, prior_effect_null = prior("spike", parameters = list(location = 0)), prior_heterogeneity_null = prior("spike", parameters = list(location = 0)), prior_bias = prior_PET("Cauchy", parameters = list(0, 1), truncation = list(lower = 0))) fit <- update(fit, prior_effect_null = prior("spike", parameters = list(location = 0)), prior_heterogeneity_null = prior("spike", parameters = list(location = 0)), prior_bias = prior_PEESE("Cauchy", parameters = list(0, 5), truncation = list(lower = 0))) saveRDS(fit, file = "../models/CustomEnsembles/Bem_update2.RDS") ``` By default, the `RoBMA()` function specifies models as a combination of all supplied prior distributions (across null and alternative specification), with their prior model weights being equal to the product of prior distributions' weights. This results in the 36 meta-analytic models using the default settings [@bartos2021no]$^1$. In [another vignette](ReproducingBMA.html), we illustrated that RoBMA can be also utilized for reproducing Bayesian Model-Averaged Meta-Analysis (BMA) [@gronau2017bayesian; @gronau2020primer; @bartos2021bayesian]. However, the package was built as a framework for estimating highly customized meta-analytic model ensembles. Here, we are going to illustrate how to do exactly that (see @bartos2020adjusting for a tutorial paper on customizing the model ensemble with JASP). Please keep in mind that all models should be justified by theory. Furthermore, the models should be tested to make sure that the ensemble can perform as intended a priori to drawing inference from it. The following sections are only for illustrating the functionality of the package. We provide a completely discussion with the relevant sources in the Example section of @bartos2021no. ### The Dataset To illustrate the custom model building procedure, we use data from the infamous @bem2011feeling "Feeling the future" pre-cognition study. We use coding of the results as summarized by Bem in one of his later replies [@bem2011must]. ```{r} library(RoBMA) data("Bem2011", package = "RoBMA") Bem2011 ``` ### The Custom Ensemble We consider the following scenarios as plausible explanations for the data, and decide to include only those models into the meta-analytic ensemble: 1) there is absolutely no precognition effect - a fixed effects model assuming the effect size to be zero ($H_{0}^f$), 2) the experiments measured the same underlying precognition effect - a fixed effects model ($H_{1}^f$), 3) each of the experiments measured a slightly different precognition effect - a random effects model ($H_{1}^r$), 4) there is absolutely no precognition effect and the results can be explained away with publication bias, modeled with one of the following publication bias adjustments: - 4.1) one-sided selection operating on significant *p*-values ($H_{1,\text{pb1}}^f$), - 4.2) one-sided selection operating on significant and marginally significant *p*-values ($H_{1,\text{pb2}}^f$), - 4.3) PET correction for publication bias which adjusts for the relationship between effect sizes and standard errors ($H_{1,\text{pb3}}^f$), - 4.4) PEESE correction for publication bias which adjusts for the relationship between effect sizes and standard errors squared ($H_{1,\text{pb4}}^f$). If we were to fit the ensemble using the `RoBMA()` function and specifying all of the priors, we would have ended with 2 (effect or no effect) * 2 (heterogeneity or no heterogeneity) * 5 (no publication bias or 4 ways of adjusting for publication bias) = 20 models. That is 13 models more than requested. Furthermore, we could not specify different parameters for the prior distributions for each model, which the following process allows (but we do not utilize it). We start with fitting only the first model using the `RoBMA()` function and we will continuously update the fitted object to include all of the models. #### Model 1 We initiate the model ensemble by specifying only the first model with the `RoBMA()` function. We explicitly specify prior distributions for all components and set the prior distributions to correspond to the null hypotheses and set seed to ensure reproducibility of the results. ```{r} fit <- RoBMA(d = Bem2011$d, se = Bem2011$se, study_names = Bem2011$study, priors_effect = NULL, priors_heterogeneity = NULL, priors_bias = NULL, priors_effect_null = prior("spike", parameters = list(location = 0)), priors_heterogeneity_null = prior("spike", parameters = list(location = 0)), priors_bias_null = prior_none(), seed = 1) ``` We verify that the ensemble contains only the single specified model with the `summary()` function by setting `type = "models"`. ```{r} summary(fit, type = "models") ``` #### Model 2 Before we add the second model to the ensemble, we need to decide on the prior distribution for the mean parameter. If pre-cognition were to exist, the effect would be small since all casinos would be bankrupted otherwise. The effect would also be positive, since any deviation from randomness could be characterized as an effect. Therefore, we decide to use a normal distribution with mean = 0.15, standard deviation 0.10, and truncated to the positive range. This sets the prior density around small effect sizes. To get a better grasp of the prior distribution, we visualize it using the `plot())` function (the figure can be also created using the ggplot2 package by adding `plot_type == "ggplot"` argument). ```{r fig_mu_prior, dpi = 300, fig.width = 4, fig.height = 3, out.width = "50%", fig.align = "center"} plot(prior("normal", parameters = list(mean = .15, sd = .10), truncation = list(lower = 0))) ``` We add the second model to the ensemble using the `update.RoBMA()` function. The function can also be used to many other purposes - updating settings, prior model weights, and refitting failed models. Here, we supply the fitted ensemble object and add an argument specifying the prior distributions of each components for the additional model. Since we want to add Model 2 - we set the prior for the $\mu$ parameter to be treated as a prior belonging to the alternative hypothesis of the effect size component and the remaining priors treated as belonging to the alternative hypotheses. If we wanted, we could also specify `prior_weights` argument, to change the prior probability of the fitted model but we do not utilize this option here and keep the default value, which sets the prior weights for the new model to `1`. (Note that the arguments for specifying prior distributions in `update.RoBMA()` function are `prior_X` - in singular, in comparison to `RoBMA()` function that uses `priors_X` in plural.) ```{r include = FALSE} # these fits are relatively fast, but we reduce the knitting time considerably fit <- readRDS(file = "../models/CustomEnsembles/Bem_update1.RDS") ``` ```r fit <- update(fit, prior_effect = prior("normal", parameters = list(mean = .15, sd = .10), truncation = list(lower = 0)), prior_heterogeneity_null = prior("spike", parameters = list(location = 0)), prior_bias_null = prior_none()) ``` We can again inspect the updated ensemble to verify that it contains both models. We see that Model 2 notably outperformed the first model and attained all of the posterior model probability. ```{r} summary(fit, type = "models") ``` #### Models 3-4.4 Before we add the remaining models to the ensemble using the `update()` function, we need to decide on the remaining prior distributions. Specifically, on the prior distribution for the heterogeneity parameter $\tau$, and the publication bias adjustment parameters $\omega$ (for the selection models' weightfunctions) and PET and PEESE for the PET and PEESE adjustment. For Model 3, we use the usual inverse-gamma(1, .15) prior distribution based on empirical heterogeneity estimates [@erp2017estimates] for the heterogeneity parameter $\tau$. For Models 4.1-4.4 we use the default settings for the publication bias adjustments as outlined the Appendix B of [@bartos2021no]. Now, we just need to add the remaining models to the ensemble using the `update()` function as already illustrated. ```r ### adding Model 3 fit <- update(fit, prior_effect = prior("normal", parameters = list(mean = .15, sd = .10), truncation = list(lower = 0)), prior_heterogeneity = prior("invgamma", parameters = list(shape = 1, scale = .15)), prior_bias_null = prior_none()) ### adding Model 4.1 fit <- update(fit, prior_effect_null = prior("spike", parameters = list(location = 0)), prior_heterogeneity_null = prior("spike", parameters = list(location = 0)), prior_bias = prior_weightfunction("one.sided", parameters = list(alpha = c(1, 1), steps = c(0.05)))) ### adding Model 4.2 fit <- update(fit, prior_effect_null = prior("spike", parameters = list(location = 0)), prior_heterogeneity_null = prior("spike", parameters = list(location = 0)), prior_bias = prior_weightfunction("one.sided", parameters = list(alpha = c(1, 1, 1), steps = c(0.05, 0.10)))) ### adding Model 4.3 fit <- update(fit, prior_effect_null = prior("spike", parameters = list(location = 0)), prior_heterogeneity_null = prior("spike", parameters = list(location = 0)), prior_bias = prior_PET("Cauchy", parameters = list(0, 1), truncation = list(lower = 0))) ### adding Model 4.4 fit <- update(fit, prior_effect_null = prior("spike", parameters = list(location = 0)), prior_heterogeneity_null = prior("spike", parameters = list(location = 0)), prior_bias = prior_PEESE("Cauchy", parameters = list(0, 5), truncation = list(lower = 0))) ``` We again verify that all of the requested models are included in the ensemble using the `summary())` function with `type = "models"` argument. ```{r include = FALSE} fit <- readRDS(file = "../models/CustomEnsembles/Bem_update2.RDS") ``` ```{r} summary(fit, type = "models") ``` ### Using the Fitted Ensemble Finally, we use the `summary()` function to inspect the model results. The results from our custom ensemble indicate weak evidence for the absence of the pre-cognition effect, $\text{BF}_{10} = 0.584$ -> $\text{BF}_{01} = 1.71$, moderate evidence for the absence of heterogeneity, $\text{BF}_{\text{rf}} = 0.132$ -> $\text{BF}_{\text{fr}} = 7.58$, and moderate evidence for the presence of the publication bias, $\text{BF}_{\text{pb}} = 3.21$. ```{r} summary(fit) ``` The finalized ensemble can be treated as any other `RoBMA` ensemble using the `summary()`, `plot()`, `plot_models()`, `forest()`, and `diagnostics()` functions. For example, we can use the `plot.RoBMA()` with the `parameter = "mu", prior = TRUE` arguments to plot the prior (grey) and posterior distribution (black) for the effect size. The function visualizes the model-averaged estimates across all models by default. The arrows stand for the probability of a spike, here, at the value 0. The secondary y-axis (right) shows the probability of the value 0, increasing from 0.71, to 0.81. ```{r fig_mu_posterior, dpi = 300, fig.width = 4, fig.height = 3.5, out.width = "50%", fig.align = "center"} plot(fit, parameter = "mu", prior = TRUE) ``` We can also inspect the posterior distributions of the publication bias adjustments. To visualize the model-averaged weightfunction, we change set `parameter = weightfunction` argument, with the prior distribution in light gray and the posterior distribution in the dark gray, ```{r fig_weightfunction_posterior, dpi = 300, fig.width = 5, fig.height = 4, out.width = "75%", fig.align = "center"} plot(fit, parameter = "weightfunction", prior = TRUE) ``` and the posterior estimate of the regression relationship between the standard errors and effect sizes by setting `parameter = "PET-PEESE"`. ```{r fig_PETPEESE_posterior, dpi = 300, fig.width = 5, fig.height = 4, out.width = "75%", fig.align = "center"} plot(fit, parameter = "PET-PEESE", prior = TRUE) ``` ### Footnotes $^1$ - The default setting used to produce 12 models in RoBMA versions < 2, which corresponded to earlier an article by @maier2020robust in which we applied Bayesian model-averaging only across selection models. ### References