PCA evaluation of Futures returns for enjoyable and revenue, half deux

This web page was created programmatically, to learn the article in its authentic location you may go to the hyperlink bellow:
https://qoppac.blogspot.com/2025/09/pca-analysis-of-futures-returns-for-fun.html
and if you wish to take away this text from our web site please contact us

In my earlier publish I mentioned what would occur when you did the loopy factor of doing a PCA on the entire universe of futures throughout belongings, relatively than simply inside US equities or bonds like The Man would need you to. In this publish I discover how we might do one thing helpful with them. There is a few messy code here, to run all of it you may want psystemtrade, however you may exploit massive chunks with your personal information even when you do not.

The massive drawback: signal flipping

Before hitting some p&l producing exercise, first nevertheless we have to take care of an impressive difficulty from the earlier publish.

TLDR, more often than not issue one is ‘threat on /equities are go’ and issue two is international rates of interest; though not at all times. Factor signal flipping was an issue nevertheless (due to individuals beneath the road for that perception). So typically issue one was lengthy equities, typically quick equities. Sometimes it was one thing else solely.

As an instance, bear in mind this plot from half one? It’s the issue publicity of the S&P 500 over time for components 1(0), 2 (1) and three(2).

Note there are ‘blips’ when we’ve a brief publicity to issue 1, principally within the interval since 2008 once we’re usually lengthy issue 1. That’s clearly a short lived signal flip. We most likely wish to eliminate these. But there may be additionally the lengthy interval within the early 2000’s once we’re persistently quick issue 1. That could be a ‘signal flip’; however it is also that issue one on this interval was one thing extra fascinating than simply ‘lengthy fairness threat’.

A few concepts spring to thoughts right here. One is simply smoothing the issue weights. That would simply resolve the blips; and the sleek want solely be just a few weeks to eliminate them. But an extended easy, of the size wanted to eliminate the opposite intervals, would cut back the details about the components; in notably we would be lacking out on fascinating instances when one thing apart from boring outdated threat on and off is driving the market.

Another shiny concept I had was to reverse the signal on weights when the most important absolute worth weight was damaging. My expectation was that typically the most important weight on issue 1 (principally threat on) would often be equities, and when that issue flipped signal we would flip it again once more. However that did not produce the anticipated outcomes. If I have been much less lazy (and wanting to get again to writing e book #5), I’d most likely perform some research; eg I’m fairly positive the reply is someplace in Gappy’s new book however I have never received there but.

In the top I made a decision to loosen up and ignore the signal flipping; I can do that due to the 4 concepts I outlined:

1- personal the components

2- commerce the components

3- purchase belongings with persistent alpha (+ve residual)

4- imply revert the cumulative residual

…. it is solely actually 1 and a pair of which might be affected by signal flipping. And I really feel I have already got issues in my armoury for 1 and a pair of. For instance my combination momentum sign (blogged about right here, and in addition in my most up-to-date e book AFTS) is mainly like 2, and on belongings with a protracted bias that will even give us a bit of 1 as effectively.

Arguably my relative momentum and long run imply reversion are additionally a bit like 3 and 4. Yet one other concept is to construct ‘asset courses’ utilizing clustering as I did right here, after which use these for the needs of 1,2 and presumably 3 and 4.

So we’ve three alternative ways of forming ‘components’: exogenously decide asset courses, PCA, and clustering; and 4 alternative ways of buying and selling every of them. Those will not give radically completely different outcomes since clusters principally observe asset courses, however they may very well be just a little completely different.

<Sidebar * see earlier word>

But, I hear you cry, why are you able to flippantly ignore signal flipping when buying and selling solely the residuals? Well it is fairly easy; think about a typical APT sort equation with a single PCA ok and market i:

r_i_t = a_i + (b_i,ok * r_k,t) +e_i,t

If we now do an indication flip, then the beta (b) may have a minus one in entrance of, however the market or PCA return r_m will even have a minus one in entrance of it. These cancel, and estimation of each the persistent bias (alpha, a_i) and the momentary error (epsilon, e_m) will likely be unaffected.

Trading the alpha

So we’ve two fundamental concepts; we generate our PCA after which run regressions that seem like this:

r_i_t = a_i + (b_i,ok * r_k,t) + … +e_i,t

Where there are a number of PCA ok…. And then we eithier purchase constructive a_i and promote damaging; or we promote issues with current cumulative constructive e_i.

There are nonetheless many design inquiries to resolve right here. How many PCA can we embrace? Too few, and we’ll most likely find yourself lacking one thing fascinating. Too many and there’s a threat we’ll find yourself with out clear indicators. Over what interval ought to we estimate betas and alphas? Basically how persistent are they prone to be. Over what interval ought to we cumulate epsilon? Are there intervals by which episilon will likely be trending relatively than imply reverting; eg belongings which have outperformed their issue adjusted return will proceed to take action (which is able to look an terrible lot like shopping for constructive alpha)?

For the PCA I’m going to maintain it easy and initially use three PCA, which occurs to be probably the most I can plot and get my head round it. I’m additionally going to stay to estimating my alphas and betas over a 12 month interval, which is the arbitrary interval I used earlier than to estimate the PCA themselves (appears bizarre to make use of a unique interval). For the query of epsilon decay I’ll threat the wrath of the overfitting gods and do a time sensitivity evaluation.

To summarise then: At the beginning of every month we have a look at the 12 months normalised returns, do a PCA, after which regress every instrument on the returns of every element. We then have an alpha intercept coefficient, and a few betas (at most three, as soon as for every PCA). We can see how predictable the alpha is of returns within the following month(s). Then for the next month we are able to additionally calculate the residual of efficiency vs the fitted mannequin. We can cumulate up these residuals and see how they forecast efficiency.

Alpha

Let’s begin with the alphas. Here be a large scatter plot:

Each level is the alpha calculated at first of a given month for an instrument, and the normalised ex-post return for the next month. It appears to be like like there could be a weak constructive relationship there, so let’s do some stats.

OLS Regression Results

==============================================================================

Dep. Variable: ex_post_return R-squared: 0.006

Model: OLS Adj. R-squared: 0.006

Method: Least Squares F-statistic: 169.0

Date: Mon, 08 Sep 2025 Prob (F-statistic): 1.62e-38

Time: 11:46:09 Log-Likelihood: 1910.6

No. Observations: 26311 AIC: -3817.

Df Residuals: 26309 BIC: -3801.

Df Model: 1

Covariance Type: nonrobust

==============================================================================

coef std err t P>|t| [0.025 0.975]

——————————————————————————

Intercept 0.0052 0.001 3.735 0.000 0.002 0.008

alpha 0.3144 0.024 12.999 0.000 0.267 0.362

==============================================================================

There: we’ve the basic undergrad stats train query “Why is my t-stat big but my R squared is low?”. Answer: there’s something right here, however it’s weak. This typically occurs when you use a big dataset (26,000 observations right here).

To present this in another way, the conditional ex-post common each day return over the next month with a constructive alpha is 0.02, and with a damaging alpha is -0.01 (each conditional subsets are roughly half the dataset total). The t-statistic evaluating these is a hefty 10, comparable to a p_value of the order of 10^-26. So once more, alpha positively has an impact, however is that distinction actually that massive? Hard to inform.

But we all know that low R squared are fairly frequent in finance, so is that this an issue? To take a look at this I attempted utilizing the alphas as a forecast, after which calculated the Sharpe Ratio of every forecast. The median throughout all devices is a SR of 0.1. Remember that pattern following provides us round 0.3 to 0.4 for every instrument, so this is not particularly fascinating.

It could be that I might get higher outcomes from a unique lookback to calculate the alphas (bear in mind we use one 12 months). Everything from a 1 month to a lookback of 10 years. What about utilizing fewer, or extra principal parts? Remember we’re going with three. It seems {that a} one 12 months lookback is fairly optimum in comparison with shorter or longer; however utilizing one PC is best than utilizing two or extra. Still the easiest we are able to do is a one 12 months lookback with one PC, and that offers us a SR of 0.12 which is hardly in pockets busting territory and in addition not considerably completely different from the end result wih three components.

Trading the residual

Let’s flip then to buying and selling the residuals. We’re going to cumulate up residuals over varied intervals and see how effectively that predicts future returns. To keep away from a ahead trying forecast, the residuals are calculated on the out of pattern month following the purpose at which the mannequin is fitted. Otherwise the regression coefficients can be ahead trying, and therefore so would the forecast.

Note that as a result of the mannequin modifications barely every month, the coefficients used to calculate residuals will even change barely. Such is life. But we’ll hold stacking up the residuals month by month despite the fact that they’re utilizing completely different fashions.

We now have 3 knobs to twiddle on our overfitting machine; lookback and variety of PCs as earlier than, but in addition the variety of days we sum up residuals. To hold issues comparatively easy I’ll initially sum them up utilizing 22 days (a few month of enterprise days). So our base case is:

One 12 months lookback to do PCA and calculate coefficients
3 principal parts
22 days summing up of residuals; our forecast is minus the summed residuals

And leaping straight to Sharpe Ratio calculation like an impatient toddler, we get a SR of -0.04. The signal is incorrect, displaying that constructive residuals result in extra constructive efficiency, and the impact can be v.v.v. weak.

Does growing the residual summing interval work; eg imply reversion works over longer time intervals? Nope. Anything as much as a 12 months is definitely worse. Going right down to every week (which might be v.v.v. expensive to commerce) does no less than push the SR into constructive territory, however solely simply.

Dropping to at least one PC (which was marginally higher than for alpha above), altering the lookback on the PCA, …. nothing produces helpful outcomes. This concept is an already lifeless donkey that has been subsequently thrown off a cliff after which burned**.

** no precise donkeys have been harmed within the creation of this weblog publish

Back to alpha

So we had a not too promising particular person SR utilizing alpha on an instrument degree, however how does that look on a portfolio degree? Surprisingly, fairly good. Here are the mixed outcomes for 100 devices:

That unhealthy boy has a SR of 0.84! Some of that’s diversification, however a few of it’s as a result of a extra precisely calculated median SR per instrument is 0.15, greater than the 0.10 calculated earlier (as a result of, many causes, like buffering and what not). Still, that is a particularly excessive realised diversification of over 5. Let’s evaluate it to the ‘gold normal’ of single momentum fashions, EWMAC16,64 with the identical devices:

OK not pretty much as good, even counting the completely different vol ewmac is available in with a SR of 1.08, however their correlation is a comparatively lowly 0.3. That suggests a modest allocation to alpha persistence will earn some cash. Chucking 10% of your forecast weights into alpha persistence bumps up the SR of ewmac16,64 from 1.08 to 1.12. Going to the (arguably in pattern fitted) finest mannequin with one principal element improves the SR of the alpha mannequin by itself to 1.02, but in addition will increase the correlation with momentum; in order that the joint SR with 10% in alpha persistence and 90% in ewmac produces a just about unchanged SR of 1.13.

Summary

Research in systematic buying and selling tends to end in a variety of blind alleys. I believed this is able to be one other one. Certainly the concept of imply reverting the errors, a basic from the fairness stat arb crowd, does not actually work on this context. However there does appear to be some modest efficiency acquire to fundamental momentum from together with a PCA derived alpha persistence mannequin. The acquire is small nevertheless, so it is debatable whether or not it is price what can be various further work. Not a blind alley then, however not a really nice one to spend a lot time in.

This web page was created programmatically, to learn the article in its authentic location you may go to the hyperlink bellow:
https://qoppac.blogspot.com/2025/09/pca-analysis-of-futures-returns-for-fun.html
and if you wish to take away this text from our web site please contact us