1. Introduction
Statistical arbitrage is a class of trading strategies that exploit temporary deviations from equilibrium relationships between correlated assets. The simplest and most widely practiced variant operates on the spread — the price difference (or a linear combination) of two cointegrated instruments — and bets on its reversion to a historical mean.
This article examines a Bollinger Band–based mean reversion strategy applied to the spread. We develop the statistical foundations, formalize the trading logic, analyze parameter selection, and discuss the critical risk of non-reversion.
2. Statistical Foundations
2.1 Cointegration
Two price series and , each individually non-stationary (integrated of order one, ), are said to be cointegrated if there exists a coefficient such that the linear combination
is stationary, i.e., . The vector is called the cointegrating vector (Engle & Granger, 1987).
Stationarity of implies that it has constant mean , finite variance, and tends to revert to after deviations. This mean-reverting property is the engine of statistical arbitrage.
2.2 Stationarity Testing
Before constructing a spread-based strategy, one must verify that is indeed stationary. Two standard procedures are:
Augmented Dickey-Fuller (ADF) Test. The null hypothesis is that contains a unit root. Rejection (low -value) supports stationarity:
Under , the series has a unit root.
Johansen Test. When working with more than two assets, the Johansen procedure simultaneously estimates the cointegrating rank (number of independent cointegrating relationships) and the cointegrating vectors using a vector error correction model (VECM):
where , and the rank of equals the cointegrating rank .
2.3 Spread Modeling
For a pair of cointegrated assets with estimated coefficient , the spread at time is:
Under the null of cointegration, can be modeled as an Ornstein-Uhlenbeck (OU) process:
where is the speed of mean reversion, is the long-run mean, is the volatility, and is a Wiener process. The half-life of reversion is:
A shorter half-life implies faster reversion and, ceteris paribus, more frequent trading opportunities.
3. Bollinger Band Construction
3.1 Definition
The Bollinger Band at time is defined by three lines computed over a rolling window of length :
- Middle band:
- Upper band:
- Lower band:
where the rolling standard deviation is:
and is the number of standard deviation multipliers (typically ).
3.2 Parameter Selection
Window length . The window should be chosen in relation to the half-life of the spread’s OU process. A common heuristic is to , so that the moving average captures the equilibrium level without excessive lag. Windows that are too short produce noisy bands; windows that are too long adapt too slowly to regime changes.
Multiplier . The choice of controls the trade-off between signal frequency and confidence:
| Probability outside band (Gaussian) | Interpretation | |
|---|---|---|
| 1.0 | 31.7% | Frequent signals, many false positives |
| 2.0 | 4.6% | Balanced; standard choice |
| 2.5 | 1.2% | Rare signals, higher confidence |
For spread processes with heavier tails than Gaussian (common in practice), may still produce more signals than the theoretical 4.6%. Robust estimation of (e.g., using median absolute deviation) can mitigate this.
3.3 Entry and Exit Logic
The trading rules are:
Entry.
- If : short the spread (sell leg A, buy leg B). Expectation: will revert downward.
- If : long the spread (buy leg A, sell leg B). Expectation: will revert upward.
Exit.
- If holding a short spread and : close position.
- If holding a long spread and : close position.
In pseudocode:
| |
The payup parameter shifts the limit price in the favorable direction to accelerate execution. For statistical arbitrage, execution speed is critical: the edge decays as the spread reverts.
4. Risk of Non-Reversion
The central risk in spread mean reversion is cointegration breakdown: the equilibrium relationship itself changes, and the spread does not revert. This can occur due to:
Structural breaks. Regulatory changes, commodity substitution, or supply-side shocks can permanently alter the relationship between two assets. For example, a change in refining margins can shift the crack spread between crude oil and gasoline to a new regime.
Regime shifts in volatility. During market stress, correlations converge toward one, and the spread may widen explosively rather than revert. The OU model assumption of constant breaks down.
Non-synchronous trading. If one leg is illiquid or has trading halts, the observed spread may deviate from the true economic spread, generating false signals.
4.1 Detecting Cointegration Breakdown
Practitioners monitor several indicators:
- Rolling ADF statistic. Compute the ADF test on a rolling window. A rising -value signals weakening cointegration.
- Spread z-score magnitude. If , the deviation is extreme and may indicate a structural break rather than a temporary dislocation.
- Hurst exponent. A Hurst exponent shifting from (mean-reverting) toward (random walk or trending) signals loss of mean reversion.
4.2 Mitigation
Common defenses include:
- Stop-loss. Close the spread position if the unrealized loss exceeds a threshold (e.g., from entry).
- Position sizing inversely proportional to spread volatility. Reduce exposure when expands.
- Periodic re-estimation of . Recalibrate the cointegrating coefficient on a rolling basis rather than relying on a fixed .
5. Summary
Bollinger Band mean reversion on the spread is one of the simplest and most intuitive statistical arbitrage strategies. Its strength lies in the clear economic logic: cointegrated assets should revert to equilibrium, and the Bollinger Band provides a statistically principled way to identify dislocations.
However, the strategy’s viability rests entirely on the persistence of the cointegrating relationship. When the structural relationship breaks, the mean-reversion assumption fails catastrophically. Robust implementation therefore requires continuous monitoring of cointegration, strict risk controls, and the humility to recognize when the regime has changed.
References
Engle, R. F., & Granger, C. W. J. (1987). Co-integration and error correction: Representation, estimation, and testing. Econometrica, 55(2), 251–276.
Gatev, E., Goetzmann, W. N., & Rouwenhorst, K. G. (2006). Pairs trading: Performance of a relative-value arbitrage rule. The Review of Financial Studies, 19(3), 797–827.
Johansen, S. (1991). Estimation and hypothesis testing of cointegration vectors in Gaussian vector autoregressive models. Econometrica, 59(6), 1551–1580.
Avellaneda, M., & Lee, J.-H. (2010). Statistical arbitrage in the U.S. equities market. Quantitative Finance, 10(7), 761–782.
Bollinger, J. (2001). Bollinger on Bollinger Bands. McGraw-Hill.
Vidyamurthy, G. (2004). Pairs Trading: Quantitative Methods and Analysis. Wiley.