Chapter 5 Analysis and Visualization of Publication Bias

Publication bias is a significant concern in meta-analysis, as it can distort the overall effect estimate and lead to misleading conclusions. In this section, we will discuss various methods for detecting and visualizing publication bias, including funnel plots, the trim and fill method, the Egger test, and Robust Bayesian Meta-Analysis.

5.0.1 Funnel Plots

A funnel plot is a scatter plot that helps visualize the relationship between the effect size and the precision of individual studies in a meta-analysis. It is an essential tool for identifying potential publication bias.

# Load necessary libraries
library(metafor)
library(ggplot2)

# Create a data frame with example data
data <- data.frame(
  study = c("Study1", "Study2", "Study3", "Study4", "Study5"),
  smd = c(0.5, 0.8, 0.3, 1.0, 0.9),
  var_smd = c(0.1, 0.15, 0.2, 0.05, 0.12)
)

# Conduct a random-effects meta-analysis
res <- rma(yi = smd, vi = var_smd, data = data)

estimate <- res$b  # Replace with your meta-analytic estimate if needed
se <- res$se        # Replace with your standard error if needed

# Define a sequence for standard errors (SE) to create CI regions
se.seq <- seq(0, max(data$var_smd), 0.001)

# Calculate 95% and 99% confidence intervals for the region
ll95 <- estimate - (1.96 * se.seq)
ul95 <- estimate + (1.96 * se.seq)
ll99 <- estimate - (3.29 * se.seq)
ul99 <- estimate + (3.29 * se.seq)

# Compute confidence intervals for the mean meta-analytic estimate
meanll95 <- estimate - (1.96 * se)
meanul95 <- estimate + (1.96 * se)

# Store all calculated values in a single data frame for easy plotting
dfCI <- data.frame(ll95, ul95, ll99, ul99, se.seq, estimate, meanll95, meanul95)


# Draw the enhanced funnel plot with shaded CI regions and lines
funnel_plot <- ggplot(aes(x = var_smd, y = smd), data = data)+
  # Add shaded areas for 95% and 99% CI regions
  geom_point(shape = 1) +
  xlab('Standard Error') + ylab('smd')+
  geom_line(aes(x = se.seq, y = ll95), linetype = 'dotted', data = dfCI) +
  geom_line(aes(x = se.seq, y = ul95), linetype = 'dotted', data = dfCI) +
  geom_line(aes(x = se.seq, y = ll99), linetype = 'dashed', data = dfCI) +
  geom_line(aes(x = se.seq, y = ul99), linetype = 'dashed', data = dfCI) +
  geom_segment(aes(x = min(se.seq), y = meanll95, xend = max(se.seq), yend = meanll95), linetype='dotted', data=dfCI) +
  geom_segment(aes(x = min(se.seq), y = meanul95, xend = max(se.seq), yend = meanul95), linetype='dotted', data=dfCI) +
  scale_x_reverse()+
  scale_y_continuous(breaks=seq(-1.25,2,0.25))+
  coord_flip()+
  theme_bw()

# Display the enhanced funnel plot
funnel_plot

5.0.1.1 Understanding Funnel Plots

Structure: In an ideal scenario, a funnel plot resembles an inverted funnel or a pyramid. This shape reflects that larger studies tend to be more precise (higher up on the plot), while smaller studies have more variability (scattered at the bottom).
Axes:
- The x-axis represents the estimated effect size for each study (e.g., risk ratios, odds ratios plotted on a logarithmic scale, or mean differences).
- The y-axis indicates the study precision, often measured by the standard error. Larger studies with greater precision are displayed at the top.

In a bias-free scenario, 95% of studies would be expected to lie within the dashed lines representing the 95% confidence interval.

Interpreting Funnel Plots and Reasons for Asymmetry

When analyzing a funnel plot, look for symmetry. A symmetrical plot suggests that publication bias is unlikely, while an asymmetrical plot may indicate bias.

Asymmetry in a funnel plot can arise from several factors:

Non-reporting Bias: Studies with non-significant results may be less likely to be published.
Methodological Quality: Studies with poor design may report exaggerated effect sizes.
True Heterogeneity: Variability in patient populations can lead to biased estimates, especially in smaller studies.
Artefactual Correlation: Correlations between effect estimates and their standard errors can create false asymmetry.
Chance: Random variation is more likely in analyses with fewer studies.

Methods such as the Precision Effect Test (PET) and Precision Effect Estimation with Standard Error (PEESE) have been developed to help researchers better assess and adjust for publication bias.

PET evaluates the relationship between effect sizes and their standard errors, positing that smaller studies (often unpublished) will display larger effect sizes due to the likelihood of publication. By fitting a regression model where effect sizes are regressed on their standard errors, researchers can determine whether a systematic bias exists. If the regression shows a significant slope, it suggests that publication bias is present, as smaller studies are showing disproportionately large effects.
PEESE builds on the insights from PET by providing a more nuanced approach to estimate the true effect size. It uses both the effect sizes and their standard errors, allowing for a more accurate adjustment of the effect size that accounts for the potential bias introduced by smaller studies. PEESE calculates an adjusted effect size by modeling the observed effect sizes and their variances, yielding a refined estimate that is less susceptible to the biases highlighted in PET.

5.0.2 Trim and Fill Method

The trim and fill method adjusts for publication bias by estimating the number of studies that would need to be added to achieve symmetry. This method imputes missing studies to provide a more accurate effect size.

5.0.2.1 Example of Using Trim and Fill

# Perform trim and fill analysis
trimmed_res <- trimfill(res)

# Summary of the results
summary(trimmed_res)

## 
## Estimated number of missing studies on the right side: 1 (SE = 1.7009)
## 
## Random-Effects Model (k = 6; tau^2 estimator: REML)
## 
##   logLik  deviance       AIC       BIC      AICc   
##  -1.7515    3.5031    7.5031    6.7219   13.5031   
## 
## tau^2 (estimated amount of total heterogeneity): 0.0000 (SE = 0.0673)
## tau (square root of estimated tau^2 value):      0.0022
## I^2 (total heterogeneity / total variability):   0.00%
## H^2 (total variability / sampling variability):  1.00
## 
## Test for Heterogeneity:
## Q(df = 5) = 4.6326, p-val = 0.4623
## 
## Model Results:
## 
## estimate      se    zval    pval   ci.lb   ci.ub      
##   0.8407  0.1348  6.2349  <.0001  0.5764  1.1050  *** 
## 
## ---
## Signif. codes:  0 '***' 0.001 '**' 0.01 '*' 0.05 '.' 0.1 ' ' 1

# Create a funnel plot with imputed studies
funnel(trimmed_res, main = "Funnel Plot with Trim and Fill")

In this example, the trimfill function from the metafor package adjusts the meta-analysis results for publication bias.

5.0.3 Egger Test

The Egger test statistically assesses funnel plot asymmetry. A significant result indicates potential publication bias.

5.0.3.1 Conducting the Egger Test

# Conduct the Egger test
egger_test <- regtest(res)

# Display the test results
print(egger_test)

## 
## Regression Test for Funnel Plot Asymmetry
## 
## Model:     mixed-effects meta-regression model
## Predictor: standard error
## 
## Test for Funnel Plot Asymmetry: z = -1.2521, p = 0.2105
## Limit Estimate (as sei -> 0):   b =  1.4949 (CI: 0.3521, 2.6376)

5.0.4 Rosenthal’s Fail-Safe N

Rosenthal’s Fail-Safe N calculates the number of unpublished studies with null results needed to invalidate the overall effect. A high Fail-Safe N suggests that publication bias is unlikely to significantly impact the findings.

5.0.4.1 Example of Calculating Fail-Safe N

# Calculate the Fail-Safe N
failsafe_n <- fsn(res)
print(failsafe_n)

## 
## Fail-safe N Calculation Using the General Approach
## 
## Average Effect Size:         0.7867 (with file drawer: 0.2950)
## Amount of Heterogeneity:     0.0000 (with file drawer: 0.1641)
## Observed Significance Level: <.0001 (with file drawer: 0.0493)
## Target Significance Level:   0.05
## 
## Fail-safe N: 8

5.0.5 Robust Bayesian Meta-Analysis (RoBMA)

Robust Bayesian Meta-Analysis (RoBMA) is an advanced method designed to address the limitations of traditional meta-analysis by incorporating information on publication bias, heterogeneity, and effect sizes in a single model. It relies on Bayesian principles to estimate the probability of the presence of a real effect while considering the potential for biases and study variability. RoBMA extends the traditional meta-analytic framework by integrating multiple models that account for different sources of uncertainty. It not only provides point estimates of effect sizes but also outputs the probability that publication bias and heterogeneity are influencing the results. This allows for a more transparent and reliable evaluation of the evidence base. Specifically, RoBMA offers:

Probabilistic conclusions: Instead of simple yes/no results, RoBMA provides probabilities for the presence of effects, heterogeneity, and bias.
Robust model averaging: It combines estimates across different models, reducing sensitivity to outliers and small study effects.
Enhanced diagnostics: RoBMA provides visual tools to detect and correct for bias, making it easier to understand and communicate findings.

5.0.5.1 Setting Up and Running a RoBMA Model in R

The implementation of RoBMA in R is straightforward using the RoBMA package. Below is a basic example using simulated effect sizes (d) and standard errors (se):

# Load the necessary libraries
library(RoBMA)

# Ensure your dataset 'dat' is structured correctly
# Use the yi and vi columns from 'dat'
# Fit the Bayesian meta-analysis model
fit <- RoBMA(
  d = dat$yi,                     # Effect sizes
  se = sqrt(dat$vi),              # Standard errors from variances
  study_names = as.character(dat$paper),        # Unique study identifiers
  seed = 1,                        # Seed for reproducibility
  chains = 2,
  sample = 500,
  burnin = 200,parallel = TRUE)

# Summarize the results
summary(fit)

## Call:
## RoBMA(d = dat$yi, se = sqrt(dat$vi), study_names = as.character(dat$paper), 
##     chains = 2, sample = 500, burnin = 200, parallel = TRUE, 
##     seed = 1)
## 
## Robust Bayesian meta-analysis
## Components summary:
##               Models Prior prob. Post. prob. Inclusion BF
## Effect         18/36       0.500       0.171        0.206
## Heterogeneity  18/36       0.500       1.000          Inf
## Bias           32/36       0.500       0.359        0.560
## 
## Model-averaged estimates:
##                    Mean Median 0.025 0.975
## mu                0.018  0.000 0.000 0.204
## tau               0.479  0.472 0.373 0.628
## omega[0,0.025]    1.000  1.000 1.000 1.000
## omega[0.025,0.05] 0.992  1.000 0.873 1.000
## omega[0.05,0.5]   0.969  1.000 0.572 1.000
## omega[0.5,0.95]   0.965  1.000 0.529 1.000
## omega[0.95,0.975] 0.967  1.000 0.541 1.000
## omega[0.975,1]    0.969  1.000 0.541 1.000
## PET               0.099  0.000 0.000 1.183
## PEESE             0.500  0.000 0.000 6.212
## The estimates are summarized on the Cohen's d scale (priors were specified on the Cohen's d scale).
## (Estimated publication weights omega correspond to one-sided p-values.)

# Plot the results for the parameter of interest
plot(fit, parameter = "mu", xlim = c(-0.5, 0.5))  # Adjust xlim as necessary

5.0.5.2 Key Outputs and Interpretation

Model Components and Inclusion Probabilities:

RoBMA assesses three primary components: Effect, Heterogeneity, and Bias. The Inclusion Probability indicates how likely each component is part of the model:
- Effect: Probability that a true non-zero effect exists.
- Heterogeneity: Probability that effect sizes vary significantly between studies.
- Bias: Probability that results are influenced by publication bias.
For example, if Bias = 98%, this suggests a very high likelihood that publication bias is present and impacting the results.
Model-Averaged Estimates:

RoBMA reports model-averaged estimates for the average effect size (mu), between-study variance (tau), and other model parameters. This averaging reduces the impact of individual extreme results by incorporating information from all candidate models.
- mu: The central tendency of the effect size (e.g., Cohen’s d).
- tau: The estimated heterogeneity, reflecting how much the effect sizes vary between studies.
- Bias Weights (omega): These weights correspond to the probability of study inclusion based on their p-values, directly indicating the severity of publication bias.
Posterior Model Probabilities:

Each model in RoBMA has a posterior probability, which tells you how well that specific combination of effect, heterogeneity, and bias explains the observed data. This helps in identifying the most plausible scenarios.

5.0.5.3 Visualizing Publication Bias with RoBMA

RoBMA provides tools to visually inspect and diagnose publication bias using enhanced funnel plots. To generate a funnel plot that shows bias-corrected effects, you can use:

# Create an enhanced funnel plot with bias adjustment
plot(fit, type = "funnel", show_legend = TRUE)

The enhanced funnel plot compares the original effect sizes with those adjusted for potential biases. Asymmetry or clustering patterns in the plot indicate possible biases, while the adjusted estimates provide a more accurate picture of the underlying effect.

5.0.5.4 Advantages of Using RoBMA

Comprehensive Bias Assessment: RoBMA integrates several models to check for the presence and impact of publication bias.
Flexible Prior Specification: Allows for the use of both informative and non-informative priors, adapting to various research scenarios.
Model-Averaging for Stability: By combining different models, RoBMA reduces the chance of overfitting and provides more robust conclusions.

5.0.5.5 Limitations to Consider

Computational Complexity: Running RoBMA with large datasets or complex prior structures can be resource-intensive.
Advanced Interpretation: Understanding and explaining RoBMA results may require familiarity with Bayesian concepts and model averaging.