1. Introduction
Single-indicator trading strategies are easy to construct but fragile in practice: each indicator has regimes where it fails, and a strategy that relies on one signal alone will suffer extended drawdowns during those regimes. The natural remedy is to combine multiple signals, hoping that their errors are at least partially independent. This is the ensemble approach, borrowed from statistical learning, applied to CTA signal construction.
This article develops the mathematical framework for multi-signal combination, focusing on two canonical mechanisms—linear weighting and majority voting—and analyzes the conditions under which adding signals improves rather than degrades performance.
2. Mathematical Framework
2.1 Signal Definition
Let be a set of signals. Each signal produces a position recommendation at time :
where denotes short, neutral, and long. The choice of a discrete output space reflects the typical CTA setup where position sizing is standardized rather than continuous.
2.2 Linear Weighted Combination
The most straightforward combination rule is a linear weighted sum:
where is the weight assigned to signal . In the unweighted case, for all , and the target position is simply the sum of votes.
The resulting target position can take integer values in , which naturally provides a conviction scale: a target of indicates unanimous agreement, while a target near zero indicates disagreement among signals.
| |
2.3 Majority Voting
An alternative is majority voting, which produces a discrete output:
where maps to . This discards conviction information but produces a simpler, more interpretable signal.
The choice between linear weighting and voting involves a trade-off:
| Property | Linear Weighting | Majority Voting |
|---|---|---|
| Output space | Continuous / integer | Discrete |
| Conviction preservation | Yes | No |
| Sensitivity to outlier signals | Higher | Lower |
| Position sizing | Variable | Fixed |
3. Three Canonical Signals
Consider a concrete ensemble of three signals: RSI, CCI, and Moving Average crossover.
3.1 RSI Signal
The Relative Strength Index over a window of periods is:
where and .
The signal rule:
Note: In a trend-following context, extreme RSI values confirm trend strength rather than signal reversal.
3.2 CCI Signal
The Commodity Channel Index measures deviation from the statistical mean:
where is the typical price, and MAD is the mean absolute deviation.
The signal rule:
3.3 MA Crossover Signal
Given a fast moving average and a slow moving average (with ):
Unlike RSI and CCI, the MA signal is always active (never zero), which means it always contributes a vote.
3.4 Combined Signal
With equal weights , the target position is:
The following table illustrates the combinatorial logic:
| RSI | CCI | MA | Interpretation | |
|---|---|---|---|---|
| +1 | +1 | +1 | +3 | Strong long consensus |
| +1 | 0 | +1 | +2 | Moderate long |
| +1 | 0 | -1 | 0 | Conflicting signals, neutral |
| -1 | -1 | +1 | -1 | Weak short |
| -1 | -1 | -1 | -3 | Strong short consensus |
4. Inter-Signal Correlation
The benefit of ensemble combination depends critically on the correlation structure among signals. If all signals are highly correlated, the ensemble provides no diversification benefit.
4.1 Pairwise Correlation
Define the pairwise correlation between signals and as:
For the three-signal system, the correlation matrix is:
Since all three signals are derived from the same price series, they will tend to be positively correlated. The ensemble benefit is maximized when is low—in the extreme case of , the variance of the combined signal is reduced by a factor of .
4.2 Variance Reduction
For unweighted linear combination with signals having equal individual variance and uniform pairwise correlation , the variance of the combined signal is:
The variance reduction ratio relative to a single signal is:
When , the ratio is (full diversification). When , the ratio is (no diversification—adding signals merely amplifies). For typical CTA signals derived from the same instrument, often ranges from to , meaning the diversification benefit is real but moderate.
4.3 Signal Selection Principles
Given the correlation constraint, signal selection should follow:
- Complementarity: Prefer signals that capture different market features (e.g., trend strength vs. mean reversion) rather than different parameterizations of the same feature.
- Minimum redundancy: Among candidate signals, remove those with with any existing signal in the ensemble.
- Incremental value: Each added signal should reduce the combined Sharpe ratio’s variance by a meaningful amount, which can be tested via bootstrap.
5. Overfitting Risk
Adding more signals introduces more parameters, and more parameters increase overfitting risk. This section quantifies the trade-off.
5.1 Parameter Count
For a three-signal ensemble, the free parameters include:
| Component | Parameters |
|---|---|
| RSI | Window , thresholds |
| CCI | Window , thresholds |
| MA | Fast window , slow window |
| Weights |
This yields approximately 9–10 free parameters. With signals, the parameter count grows as , and the risk of finding a parameter combination that fits noise increases.
5.2 The Curse of In-Sample Optimization
The in-sample Sharpe ratio is an upwardly biased estimator of the true Sharpe ratio . The bias grows with the number of parameters and shrinks with the sample size :
where is the standard deviation of the Sharpe estimator. For the three-signal ensemble with and a backtest of trading days, the inflation is approximately Sharpe points—not negligible.
5.3 Mitigation Strategies
- Walk-forward validation: Reserve out-of-sample data that is never touched during parameter selection.
- Parameter stability check: Small perturbations to parameters should not dramatically change results; if they do, the solution is likely an artifact of noise.
- Information criterion: Prefer parsimonious models. Adding a signal is only justified if it reduces AIC/BIC, not merely because it increases the in-sample Sharpe ratio.
- Economic rationale: Every signal in the ensemble should have a clear economic rationale for why it captures a distinct market inefficiency.
6. Summary
Multi-signal ensemble strategies offer a principled way to combine weak signals into a stronger composite signal. The key insights are:
- Linear combination preserves conviction information and allows flexible position sizing; majority voting is more robust to outlier signals but discards intensity.
- The diversification benefit depends on inter-signal correlation; for typical CTA signals derived from the same instrument, the benefit is moderate.
- Overfitting risk grows with the number of signals and their parameters; rigorous out-of-sample validation is essential.
- Signal selection should prioritize complementarity and economic rationale over mere in-sample performance.
The ensemble approach does not guarantee profitability, but it does provide a framework for constructing more robust and interpretable trading signals than any single indicator alone.
References
- Brillouin, L. (1962). Science and Information Theory, 2nd ed. Academic Press.
- Dietterich, T. G. (2000). “Ensemble Methods in Machine Learning.” Multiple Classifier Systems, LNCS 1857, pp. 1–15.
- Harvey, C. R. and Liu, Y. (2015). “Backtesting.” Journal of Portfolio Management, 42(1), pp. 13–28.
- Bailey, D. H., Borwein, J. M., Lopez de Prado, M., and Zhu, Q. J. (2017). “The Probability of Backtest Overfitting.” Journal of Computational Finance, 20(4), pp. 39–69.
- Aronson, D. R. (2007). Evidence-Based Technical Analysis. Wiley.